Mistral AI 发布 Mistral OCR：新一代文档理解 OCR API

2025-03-06 00:00·484天前

AI 摘要

Mistral AI 推出 Mistral OCR，一款专注于文档理解的光学字符识别 API。该模型支持图像和 PDF 输入，能高精度提取并理解文本、表格、公式及内联图像，输出有序的文本与图像内容。其在内部基准测试中总分 94.89，超越了 GPT-4o-2024-11-20（89.77）与 Gemini-2.0-Flash-001（88.69）。API 命名为 mistral-ocr-latest，定价为 1000 页每美元，批量推理时处理能力翻倍。该 API 已在 la Plateforme 上线，支持部分组织自托管。模型原生支持多语言，单节点处理速度可达每分钟 2000 页。

原文 · 未翻译

Throughout history, advancements in information abstraction and retrieval have driven human progress. From hieroglyphs to papyri, the printing press to digitization, each leap has made human knowledge more accessible and actionable, fueling further innovation.

Today, we’re at the precipice of the next big leap—to unlock the collective intelligence of all digitized information. Approximately 90% of the world’s organizational data is stored as documents, and to harness this potential, we are introducing Mistral OCR .Mistral OCR is an Optical Character Recognition API that sets a new standard in document understanding. Unlike other models, Mistral OCR comprehends each element of documents—media, text, tables, equations—with unprecedented accuracy and cognition. It takes images and PDFs as input and extracts content in an ordered interleaved text and images.

As a result, Mistral OCR is an ideal model to use in combination with a RAG system taking multimodal documents (such as slides or complex PDFs) as input.

We have made Mistral OCR as the default model for document understanding across millions of users on Le Chat, and are releasing the API mistral-ocr-latest at 1000 pages / $ (and approximately double the pages per dollar with batch inference). The API is available today on our developer suite la Plateforme , and coming soon to our cloud and inference partners, as well as on-premises.

Highlights

State of the art understanding of complex documents

Natively multilingual and multimodal

Top-tier benchmarks

Fastest in its category

Doc-as-prompt, structured output

Mistral AI：News（网页）

42导出 Markdown

Mistral AI 发布 Mistral OCR：新一代文档理解 OCR API

2025-03-06 00:00·484天前

阅读原文· mistral.ai

AI 摘要

原文 · 保持原样，未翻译