# Mistral OCR 3 发布

- 来源：Mistral AI：News（网页）
- 发布时间：2025-12-17 00:00
- AIHOT 分数：55
- AIHOT 链接：https://aihot.virxact.com/items/cmppdcr7d0e39slv41c4kl2tl
- 原文链接：https://mistral.ai/news/mistral-ocr-3

## AI 摘要

Mistral AI 发布 Mistral OCR 3，这是一款专为从各类文档中高保真提取文本与嵌入图像而设计的 OCR 模型。在表单、扫描文档、复杂表格和手写体处理等基准测试中，该模型的整体胜率达到 74%，超越了 Mistral OCR 2 以及企业级与 AI 原生 OCR 方案。作为一款体积更小的模型，其定价为行业领先的每 1,000 页 2 美元（通过 Batch API 可享 50% 折扣，即 1 美元）。开发者可通过 API（模型标识符 `mistral-ocr-2512`）进行集成，其输出为包含 HTML 表格标签的 markdown 格式，便于下游系统理解文档结构。该模型适用于高量级企业文档处理流水线。

## 正文

Frontier AI LLMs, assistants, agents, services | Mistral AI

[](https://mistral.ai/)

Start building

Menu

Products

Solutions

Models

Developers

Blog

Customers

Company

Contact salesStart building

Studio Build, test, and run AI agents and apps. Forge Train, align, and evaluate custom AI models. Vibe AI agent for long-horizon work. Vibe for code Coding agents in the terminal, IDE, and background. Compute Frontier-scale infrastructure for training and inference.

Pricing

PlansAPI pricingFor enterprises

Services

Delivery methodologyModel customization

Industries

Financial servicesPublic sector & governmentManufacturing

Use cases

Use case overviewCodingDocument intelligenceSpeech

Latest models

Mistral Medium 3.5 Mistral Small 4 Mistral 3 Voxtral TTS

See all models

DocsAPI ReferenceCookbooks

Latest posts

AI Now Summit 2026Vibe gets to work.Introducing physics AI at Mistral: the foundation for engineering acceleration.

Read all news

Categories

ProductResearchEngineeringSolutionsCompany

Featured stories

CiscoHSBCHTX

See all

Who we are

About usCareersBrand

Connect

CommunityPartnersHelp center

ProductsSolutionsModelsDevelopersBlogCustomersCompany

Studio Build, test, and run AI agents and apps. Forge Train, align, and evaluate custom AI models. Vibe AI agent for long-horizon work. Vibe for code Coding agents in the terminal, IDE, and background. Compute Frontier-scale infrastructure for training and inference.

Pricing

PlansAPI pricingFor enterprises

Services

Delivery methodologyModel customization

Industries

Financial servicesPublic sector & governmentManufacturing

Use cases

Use case overviewCodingDocument intelligenceSpeech

Latest models

Mistral Medium 3.5 Mistral Small 4 Mistral 3 Voxtral TTS

See all models

DocsAPI ReferenceCookbooks

Latest posts

AI Now Summit 2026Vibe gets to work.Introducing physics AI at Mistral: the foundation for engineering acceleration.

Read all news

Categories

ProductResearchEngineeringSolutionsCompany

Featured stories

CiscoHSBCHTX

See all

Who we are

About usCareersBrand

Connect

CommunityPartnersHelp center

Start building

StudioVibeVibe for Code

Contact sales

Back to Blog 3 min read

Blog

Research Introducing Mistral OCR 3

December 17, 2025

Mistral AI

Back to Blog 3 min read

Share this post

[](https://www.facebook.com/sharer/sharer.php?u=https%3A%2F%2Fmistral.ai%2Fnews%2Fmistral-ocr-3%2F)[](https://www.linkedin.com/sharing/share-offsite/?url=https%3A%2F%2Fmistral.ai%2Fnews%2Fmistral-ocr-3%2F)[](https://x.com/intent/post?url=https%3A%2F%2Fmistral.ai%2Fnews%2Fmistral-ocr-3%2F) Highlights Breakthrough performance: 74% overall win rate over Mistral OCR 2 on forms, scanned documents, complex tables, and handwriting. State-of-the-art accuracy, outperforming both enterprise document processing solutions as well as AI-native OCR solutions Now powers Document AI Playground in Mistral AI Studio, a simple drag-and-drop interface for parsing PDFs/images into clean text or structured JSON Major upgrade over Mistral OCR 2 in forms, handwritten content, low-quality scans, and tables Overview

Mistral OCR 3 is designed to extract text and embedded images from a wide range of documents with exceptional fidelity. It supports markdown output enriched with HTML-based table reconstruction, enabling downstream systems to understand not just document content, but also structure. As a much smaller model than most competitive solutions, it is available at an industry-leading price of $2 per 1,000 pages, with a 50% Batch-API discount, reducing the cost to $1 per 1,000 pages.

Developers can integrate the model (mistral-ocr-2512) via API, and users can leverage Document AI, a UI that parses documents into text or structured JSON instantly. Benchmarks

To raise the bar, we introduced more challenging internal benchmarks based on real business use-case examples from customers. We then evaluated several models across the domains highlighted below, comparing their outputs to ground truth using fuzzy-match metric for accuracy. Upgrades over previous generations of OCR models

Whereas most OCR solutions today specialize in specific document types, Mistral OCR 3 is designed to excel at processing the vast majority of document types in organizations and everyday settings. Handwriting: Mistral OCR accurately interprets cursive, mixed-content annotations, and handwritten text layered over printed forms. Forms: Improved detection of boxes, labels, handwritten entries, and dense layouts. Works well on invoices, receipts, compliance forms, government documents, and such. Scanned & complex documents: Significantly more robust to compression artifacts, skew, distortion, low DPI, and background noise. Complex tables: Reconstructs table structures with headers, merged cells, multi-row blocks, and column hierarchies. Outputs HTML table tags with colspan/rowspan to fully preserve layout.

Video 1

Mistral OCR 3 is a significant upgrade across all languages and document form factors compared to Mistral OCR 2. Recommend use cases and applications

Mistral OCR 3 is ideal for both high-volume enterprise pipelines and interactive document workflows. Developers can use it for: Extracting text and images into markdown for downstream agents and knowledge systems Automated parsing of forms, invoices, and operational documents End-to-end document understanding pipelines Digitization of handwritten or historical documents Any other document → knowledge transformation applications.

Our early customers are using Mistral OCR 3 to process invoices into structured fields, digitize company archives, extract clean text from technical and scientific reports, and improve enterprise search.

“OCR remains foundational for enabling generative AI and agentic AI,” said Tim Law, IDC Director of Research for AI and Automation. “Those organizations that can efficiently and cost-effectively extract text and embedded images with high fidelity will unlock value and will gain a competitive advantage from their data by providing richer context.” Available today

Access the model either through the API or via the new Document AI Playground interface, both in Mistral AI Studio. Mistral OCR 3 is fully backward compatible with Mistral OCR 2. For more details, head over to mistral.ai/docs.

For organizations with stringent data privacy requirements, Mistral OCR offers a self-hosting option. This ensures that sensitive or classified information remains secure within your own infrastructure, providing compliance with regulatory and security standards. If you would like to explore self-deployment with us, please let us know.

0% Products Vibe Vibe Code Studio Forge Compute Pricing Solutions Delivery methodology Model customization Coding Document intelligence Speech Mistral for finance Mistral for public institutions Mistral for manufacturing Why Mistral About us Careers Partners Our customers Our Models Brand Legal Terms of Service Privacy Policy Privacy choices) Data processing agreement Legal notice

[](https://www.linkedin.com/company/mistralai/)[](https://x.com/mistralai)[](https://www.youtube.com/@MistralAIOfficial)[](https://discord.gg/mistralai)[](https://www.reddit.com/r/MistralAI/) Get Mistral Vibe

[](https://apps.apple.com/us/app/le-chat-by-mistral-ai/id6740410176)[](https://play.google.com/store/apps/details?id=ai.mistral.chat)

Mistral AI © 2026

Select language

English