SnapAPI
  • Models
  • Docs
  • Pricing
  • Blog
Log InSign Up
Log In
SnapAPI

One unified API to access world-class AI models.

Contact: support@silverai.com
Products
  • Models
  • Docs
  • Blog
Resources
  • API Updates
  • Terms
  • Privacy
Image Processing
  • Remove Background
  • Virtual Tryon
  • Enhance & Upscale
  • Object Detection
Image Generation
  • FLUX Kontext Dev
  • Z-Image Turbo
  • Qwen Image Edit
  • Fairy AI

©2026 SnapAPI.AI. All rights reserved.

All services are online
  1. Home
  2. Models
  3. Text Detection
SilverAISilverAI

Text Detection

Detect text regions and return a mask, ready to feed the Remove Text API.

image

Overview

Text Detection by SilverAI locates text regions in an image and returns a precise mask covering every detected character and word. Rather than altering the image, it produces a pixel-accurate RGBA mask that marks exactly where text appears, leaving you in full control of what happens next. The mask can be reviewed, edited by hand, or fed directly into a downstream removal step.

This model is the natural companion to the Remove Text API. The mask returned here can be passed as the input_mask to Remove Text, optionally after manual touch-ups, giving you a clean, two-stage pipeline: detect first, then remove. Because detection and removal are decoupled, you gain transparency and fine-grained control over which text gets erased, retouched, or preserved.

Key Capabilities

  • Multi-language detection: Reliably detects both Latin and non-Latin scripts, including CJK, Cyrillic, Arabic, and more.

  • Tight, accurate masks: Returns pixel-precise RGBA masks that hug character edges for clean downstream results.

  • Pairs with Remove Text: The output mask drops straight into the Remove Text API as input_mask for an end-to-end removal pipeline.

  • Editable output: Masks can be manually refined before removal, so you decide exactly what stays and what goes.

  • Flexible input: Accepts an uploaded image file or a remote image URL.

  • Clear empty signal: Returns a null mask and detected: false when no text is present, making conditional logic simple.

Supported Tasks

Detecting printed and stylized text regions in photographs and graphicsGenerating masks for text-removal and inpainting pipelinesIdentifying watermark and caption areas for downstream editingAuditing images for the presence of any text before processing

Use Cases

Document and Sign Cleanup

Detect text on signs, documents, and labels to build masks that drive automated cleanup. Pair the detected mask with Remove Text to erase captions, watermarks, or overlaid wording while preserving the underlying scene.

Document and Sign Cleanup

Multilingual Content Preparation

With support for both Latin and non-Latin scripts, Text Detection prepares masks for global content workflows. Localize imagery by detecting source-language text before replacing or removing it for new markets.

Multilingual Content Preparation

E-commerce Image Standardization

Standardize product catalogs by detecting promotional text, watermarks, and stray labels across thousands of images. Use the masks to flag images for cleanup or feed them directly into a removal pipeline.

E-commerce Image Standardization
Specifications
model iddetect-text
vendorSilverAI
typeimage
inputRGB Image
outputRGBA Mask
endpoint/v1/images/detect-text
statusStable
version1.0
Usage Pricing
Pay only for what you use
~$0.0005 / image
  • credits: 1 credit / image
  • credit rate: from $0.0005 per credit (top up via dashboard)
  • free tier: Available with API key signup
  • volume discounts: Available for high-volume usage

Save up to 70% vs direct pricing

Aggregated volume discounts.