Question 1

How does ERNIE 5.0 score on benchmarks versus GPT-5 and Claude Opus 4.7?

Accepted Answer

ERNIE 5.0 posts 1460 on LMArena as of January 15, 2026, which puts it at rank one inside China and rank eight globally. On the Artificial Analysis Intelligence Index it scores 29 against a frontier aggregate of 57, so GPT-5, Claude Opus 4.7, and Gemini 3.1 Pro still lead on composite reasoning. ERNIE 5.0 leads on Chinese reading comprehension, long-context Chinese code tasks, and omni-modal generation from a single call. You reach the text tier through Baidu's Qianfan API at https://qianfan.cloud.baidu.com for roughly $0.60 input and $2.10 output per million tokens, which lands it below GPT-5 on price while keeping pace on most CJK evaluations.

Question 2

How does ERNIE-Image quality compare on Chinese and CJK glyphs?

Accepted Answer

ERNIE-Image is the current open-weight leader on dense CJK typography. On LongTextBench it scores 0.9733, the highest published number for open weights, and on GenEval with the prompt enhancer it lands at 0.8728. You can drive it live from fal-ai/ernie-image and ask for multi-panel menus, bilingual posters, or dense keynote slides and it will render every character legibly, even at small sizes. Flux 2 Pro and Ideogram 3 both ship good English typography but trip on simplified and traditional Chinese glyphs beyond ten characters. If your layout is Chinese-first or bilingual, ERNIE-Image is the endpoint you want.

Question 3

Why do text and image billing split between Qianfan and fal.ai?

Accepted Answer

They are different products on different infrastructure. ERNIE 5.0 is Baidu's frontier text and omni-modal tier served from Baidu Qianfan at https://qianfan.cloud.baidu.com with per-token billing at roughly $0.60 input and $2.10 output per million tokens. ERNIE-Image is the 8 billion parameter open-weight DiT hosted on fal at fal-ai/ernie-image, fal-ai/ernie-image/turbo, and the LoRA variants, all billed per megapixel at $0.03 standard or $0.01 on Turbo. The editorial side of this blog covers ERNIE 5.0 reasoning and agent workflows. The playground you see here runs fal-ai/ernie-image so you can ship pixels without a Baidu account.

Question 4

Can I access ERNIE from outside China?

Accepted Answer

Yes for both tiers, with different paths. ERNIE-Image runs on fal at fal-ai/ernie-image with a single FAL_KEY, so you reach it from anywhere fal is reachable. ERNIE 5.0 text runs on Baidu Qianfan at https://qianfan.cloud.baidu.com, which now supports international sign-ups and non-CN billing for developers outside mainland China. If your stack is global-first and you only need the image tier, stay on fal. If you need ERNIE 5.0 text reasoning, create a Qianfan account, generate an API key, and call the chat completions endpoint like any OpenAI-compatible service.

Question 5

How do I train a LoRA on ERNIE-Image?

Accepted Answer

Use fal-ai/ernie-image-trainer. You upload a zip of 15 to 50 reference frames, set the subject or style name, and the trainer fits an adapter you can then plug into either fal-ai/ernie-image/lora (50 step standard at $0.03 per megapixel) or fal-ai/ernie-image/lora/turbo (8 step fast at $0.01 per megapixel). The open-weight Apache 2.0 base means you can also pull the weights from Hugging Face and train locally on a single H100, but the fal trainer is the one-click path if you want a hosted adapter in under an hour.

Question 6

When does ERNIE-Image beat Flux 2 Pro or GPT Image 2?

Accepted Answer

Three scenarios. One, any layout that has to carry legible simplified or traditional Chinese characters at any scale beyond a short caption. ERNIE-Image posts 0.9733 on LongTextBench where Flux 2 Pro trips past ten CJK glyphs. Two, multi-panel comic strips and dense menu cards where consistent alignment across ten plus text blocks matters more than photoreal surface detail. Three, cost-sensitive production where you want to ship thousands of variants a day. fal-ai/ernie-image is $0.03 per megapixel versus Flux 2 Pro at $0.06. For English-only photoreal hero stills, Flux 2 Pro and Imagen 4 still win on skin, hair, and subtle lighting.

Question 7

How do I migrate a DALL-E 3 pipeline to ERNIE-Image?

Accepted Answer

Swap the client and rewrite two parameters. DALL-E 3 takes size strings like '1024x1024'; fal-ai/ernie-image takes aspect_ratio as '1:1', '16:9', '9:16', '4:3', '3:4', '3:2', '2:3', or '21:9'. DALL-E 3's quality flag becomes num_inference_steps (50 on the standard endpoint, 8 on Turbo). DALL-E 3's style parameter maps to enable_prompt_enhancer; leave it true unless you want strict literal prompts. Keep your prompt text as-is, install @fal-ai/client, set FAL_KEY, and call fal.subscribe('fal-ai/ernie-image'). You drop from $0.04 per image on DALL-E 3 to $0.03 per megapixel on ERNIE-Image and pick up CJK typography as a bonus.

Question 8

How do I produce dense text layouts and multi-panel comics?

Accepted Answer

Three levers on fal-ai/ernie-image. One, keep enable_prompt_enhancer set to true. It bumps GenEval to 0.8728 and cleans up layout instructions. Two, write prompts as structured blocks. State the panel grid, the text in each panel, and the exact position of every headline and subline. Three, pick aspect_ratio that matches the layout. Use '3:4' for vertical posters, '4:3' for menu cards, and '1:1' for 2x2 comic grids. For four-panel comics with consistent type, describe panel 1 through panel 4 explicitly and note the speech bubble text in quotes so the model treats it as glyph rather than decoration.

Question 9

How do I set up Baidu Qianfan for ERNIE 5.0 text calls?

Accepted Answer

Create a Qianfan account at https://qianfan.cloud.baidu.com, verify your identity, and create an application in the console. Grab the API key and secret key from the application detail page. Qianfan exposes an OpenAI-compatible chat completions endpoint, so you can use the official openai SDK by pointing base_url at the Qianfan endpoint and passing your API key. Call the ERNIE 5.0 model id as listed in the console. Billing is usage-based at roughly $0.60 per million input tokens and $2.10 per million output tokens, with a free tier for initial testing. For omni-modal requests pass image, audio, or video blocks in the messages array.

Question 10

Why run ERNIE-Image on fal.ai?

Accepted Answer

Eight reasons stack up. One, single FAL_KEY covers fal-ai/ernie-image, the Turbo variant, both LoRA endpoints, and the trainer. Two, async queues with webhooks handle bursts without cold starts. Three, per-megapixel billing at $0.03 standard and $0.01 Turbo beats hosted GPU rentals once you pass a few hundred images a day. Four, LoRA training and serving run on the same API key and URL scheme, so fine-tune to production is one line. Five, fal auto-scales, no instance warmup. Six, the endpoint sits alongside 600 plus other models, so you can chain ERNIE-Image with upscalers, video models, or LLMs in one pipeline. Seven, Apache 2.0 open weights means you can always self-host if fal pricing ever stops working. Eight, logs and queue status are first class, so you get observability without wiring Prometheus yourself.

Baidu ERNIE API

ERNIE-Image typography and ERNIE 5.0 intelligence via fal.ai

Frequentlyasked.

ERNIE-Imageat a glance.

Call ERNIE-Imagein under 20 lines.

What ERNIE-Imagecosts on fal.ai.

ERNIE-Imagevs the field.

The story
everyone's on.

ERNIE 5.0 vs 4.5: The Omni-Modal Leap

Three to read first.

Baidu Qianfan vs fal Endpoints: When to Use Each

CJK Text Rendering: Where ERNIE-Image Beats Flux 2 Pro and GPT Image 2

Debugging ERNIE-Image: Why Your English Posters Look Off

Every topic we cover.

Comparison

Integration

Debugging

Workflow

Technique

Use case

Prompting

More on Comparison.

CJK Text Rendering: Where ERNIE-Image Beats Flux 2 Pro and GPT Image 2

ERNIE 5.0 vs 4.5: The Omni-Modal Leap

ERNIE 5.0 vs GPT-5 vs Claude Opus 4.7: Real Benchmark Reads

Latest posts.

ERNIE 5.0 vs GPT-5 vs Claude Opus 4.7: Real Benchmark Reads

ERNIE-Image vs Turbo: The 50 vs 8 Step Tradeoff

Integrating ERNIE-Image LoRAs Into Your Brand System

LongTextBench and GenEval: Reading ERNIE-Image's Scores

Multi-Panel Comics with ERNIE-Image: Step by Step

Prompting ERNIE-Image for Dense Typography

The numbers.

What we write about most.

Keep reading.The full blog is open.

Browse the full blog

Baidu Qianfan vs fal Endpoints: When to Use Each

Baidu ERNIE API

ERNIE-Image typography and ERNIE 5.0 intelligence via fal.ai

Frequentlyasked.

ERNIE-Imageat a glance.

Call ERNIE-Imagein under 20 lines.

What ERNIE-Imagecosts on fal.ai.

ERNIE-Imagevs the field.

The storyeveryone's on.

ERNIE 5.0 vs 4.5: The Omni-Modal Leap

Three to read first.

Baidu Qianfan vs fal Endpoints: When to Use Each

CJK Text Rendering: Where ERNIE-Image Beats Flux 2 Pro and GPT Image 2

Debugging ERNIE-Image: Why Your English Posters Look Off

Every topic we cover.

Comparison

Integration

Debugging

Workflow

Technique

Use case

Prompting

More on Comparison.

CJK Text Rendering: Where ERNIE-Image Beats Flux 2 Pro and GPT Image 2

ERNIE 5.0 vs 4.5: The Omni-Modal Leap

ERNIE 5.0 vs GPT-5 vs Claude Opus 4.7: Real Benchmark Reads

Latest posts.

ERNIE 5.0 vs GPT-5 vs Claude Opus 4.7: Real Benchmark Reads

ERNIE-Image vs Turbo: The 50 vs 8 Step Tradeoff

Integrating ERNIE-Image LoRAs Into Your Brand System

LongTextBench and GenEval: Reading ERNIE-Image's Scores

Multi-Panel Comics with ERNIE-Image: Step by Step

Prompting ERNIE-Image for Dense Typography

The numbers.

What we write about most.

Keep reading.The full blog is open.

Browse the full blog

Baidu Qianfan vs fal Endpoints: When to Use Each

The story
everyone's on.