What types of text can the model detect?

The model can detect printed text, captions, product labels, signs, watermarks, subtitles, logos containing text, and many stylized text elements commonly found in photographs and graphic designs.

Which languages are supported?

Text Detection API supports both Latin and non-Latin writing systems, including English, Chinese, Japanese, Korean, Arabic, Cyrillic, and many other languages.

Can I use the output with Remove Text API?

Yes. The returned mask is fully compatible with the Remove Text API and can be passed directly as the input_mask parameter to create a detect-then-remove workflow.

What happens when no text is detected?

The API returns detected: false and a null mask, making it easy to skip unnecessary processing steps in automated workflows.

SilverAI

Text Detection

Detect text regions and return a mask, ready to feed the Remove Text API.

image

Overview

Text Detection by SilverAI locates text regions in an image and returns a precise mask covering every detected character and word. Rather than altering the image, it produces a pixel-accurate RGBA mask that marks exactly where text appears, leaving you in full control of what happens next. The mask can be reviewed, edited by hand, or fed directly into a downstream removal step.

This model is the natural companion to the Remove Text API. The mask returned here can be passed as the input_mask to Remove Text, optionally after manual touch-ups, giving you a clean, two-stage pipeline: detect first, then remove. Because detection and removal are decoupled, you gain transparency and fine-grained control over which text gets erased, retouched, or preserved.

Key Capabilities

Multi-language detection: Reliably detects both Latin and non-Latin scripts, including CJK, Cyrillic, Arabic, and more.
Tight, accurate masks: Returns pixel-precise RGBA masks that hug character edges for clean downstream results.
Pairs with Remove Text: The output mask drops straight into the Remove Text API as input_mask for an end-to-end removal pipeline.
Editable output: Masks can be manually refined before removal, so you decide exactly what stays and what goes.
Flexible input: Accepts an uploaded image file or a remote image URL.
Clear empty signal: Returns a null mask and detected: false when no text is present, making conditional logic simple.

Supported Tasks

Detecting printed and stylized text regions in photographs and graphicsGenerating masks for text-removal and inpainting pipelinesIdentifying watermark and caption areas for downstream editingAuditing images for the presence of any text before processing

Specifications

model iddetect-text

vendorSilverAI

typeimage

inputRGB Image

outputRGBA Mask

endpoint/v1/images/detect-text

statusStable

version1.0

Usage Pricing

Pay only for what you use

$0.002 / image

credits: 1 credit / image
credit rate: $0.002 per credit (top up via dashboard)
free tier: Available with API key signup
volume discounts: Available for high-volume usage

Save up to 70% vs direct pricing

Aggregated volume discounts.

Use Cases

Smart Selection Features via Text Detection API

Pinpoint embedded text regions with high accuracy to build flexible workflows for removal, translation, or selective editing.

Build Text Removal Workflows with Detection-First Control

Text Detection API is commonly used as the first step in text removal pipelines. Developers can review or edit the generated mask before sending it to Remove Text API, giving full control over which text regions are removed and which should be preserved.

Get Started

Build Text Removal Workflows with Detection-First Control

Content Localization and Translation Automation

Global marketing and e-commerce platforms use Text Detection API to locate embedded text inside banners, product images, and promotional creatives before replacing it with translated content. This helps automate multilingual content localization workflows at scale.

Get Started

Content Localization and Translation Automation

Add Text Selection Features to Editing Applications

Photo editing apps, design tools, and AI image editors can integrate Text Detection API to let users automatically select text regions with a single click. The returned mask can then be used for text removal, blur effects, inpainting, or custom editing actions.

Get Started

Add Text Selection Features to Editing Applications

Integration in 3 steps

Start Building In Minutes

Integrate the Text Detection API into your existing workflow via Snapedit with just a few simple steps. No credit card required to start.

Sign Up

Create your Snapedit account in 30 seconds and receive free credits instantly. No credit card required.

Create API Key

Generate a globally valid Snapedit key with easy management from your dashboard.

Start Calling

Update your Base URL and API key to start calling Text Detection with smart routing and cost optimization.

Get API Key

FAQ

Frequently Asked Questions

Everything you need to know about using the Text Detection API through Snapedit.

No. Text Detection API only identifies where text appears in an image and returns a pixel-level mask. It does not perform OCR or extract the actual text content.

Explore other models you might find useful

Object Detection

Detect bounding boxes & masks for 353 object types with high accuracy.

image-To-image$0.002/run

Wire Detection

Detect wires, cables and lines and return a mask for the Remove Wires API.

image-To-image$0.002/run

AI Image Detection

Detect whether an image is likely AI-generated, with a confidence score for moderation pipelines.

image-To-image$0.004/run

Remove Object Normal

Remove unwanted objects using GAN, optimized for simple backgrounds.

image-To-image$0.004/run

SilverAI