Edit and composite up to 3 input images with natural-language prompts.
Multi-Image Edit is an image editing model from the Alibaba Qwen Team, powered by Qwen-Image-Edit. Unlike single-image editors, it accepts up to three input images at once and blends or transforms them according to a natural-language prompt. This makes it ideal for compositing scenarios where subjects from different photos need to be merged into a single, coherent scene — for example, placing a person and their dog together inside a stadium.
The model interprets free-form editing instructions and applies them across all provided inputs, preserving subject identity while harmonizing lighting, perspective, and context. By exposing an explicit editing mode (such as editing for general-purpose edits), it gives developers predictable control over how the prompt is applied, enabling reliable photo compositing, subject merging, and scene reconstruction at scale.
Up to 3 input images: Provide one required image and up to two optional additional images to merge or edit together in a single request.
Natural-language prompts: Describe edits in plain language — no masks or manual selections required.
Compositing and merging: Combine subjects from separate photos into one unified scene with consistent lighting and perspective.
Editing modes: Select an explicit mode (e.g. editing) to control how the prompt is interpreted and applied.
Identity preservation: Keeps the appearance of people, animals, and objects consistent across the edit.
Qwen-Image-Edit backbone: Built on Alibaba's Qwen-Image-Edit model for high-fidelity, instruction-following edits.
Save up to 70% vs direct pricing
Aggregated volume discounts.
Use Cases
Scale your image production by seamlessly merging products, assets, and backgrounds without manual design work.
E-commerce platforms and product photography tools can combine product photos, lifestyle backgrounds, and promotional assets into a single marketing image. The Multi-Image Edit API helps automate creative production without manual Photoshop work.
Marketing teams can generate advertising visuals by merging products, people, logos, and campaign backgrounds from multiple images. Ideal for AI design tools, ad generators, and creative automation platforms.
Photo editing apps can allow users to combine family members, friends, or team members from different photos into a single realistic group picture. The API automatically matches lighting, perspective, and scene composition.
Integration in 3 steps
Integrate the Multi-Image Edit API into your existing workflow via Snapedit with just a few simple steps. No credit card required to start.
Create your Snapedit account in 30 seconds and receive free credits instantly. No credit card required.
Generate a globally valid Snapedit key with easy management from your dashboard.
Update your Base URL and API key to start calling Multi-Image Edit with smart routing and cost optimization.
FAQ
Everything you need to know about using the Multi-Image Edit API through Snapedit.
Explore other models you might find useful
Multi-image AI editing model with ControlNet support and superior text rendering.

Expand images in all 4 directions with AI-powered outpainting.

Get AI-generated pose suggestions from an input image for guided photography and fashion.

Detect bounding boxes & masks for 353 object types with high accuracy.