.png)
How to Accurately Classify Documents with Intelligent OCR? A Concrete Use Case on ID Documents
Case study
Last update:
April 9, 2025
5 minutes
Mistral AI and ChatGPT offer high-performance optical character recognition (OCR). But which is really the most accurate way to extract text from invoices and documents? Discover our comparative test and our detailed results
Mistral AI vs ChatGPT Precision, speed, reliability... Find out which model best extracts text from documents!
At Koncile, we are always looking for the latest advances in the field of visual language models (VLM) and we regularly put these new technologies to the test to better understand their limits in real conditions. It is in this dynamic that we have developed our own OCR powered by AI, in order to offer a more accurate and reliable solution for extracting complex data.
Today, Mistral AI unveiled its brand new OCR model, which they present as being at the cutting edge of technology (SOTA), based on as yet unpublished benchmarks. As is often the case, excitement quickly took over the internet. The model found itself at the top of discussions on Hacker News, and many users immediately claimed that extracting text from PDFs was now a problem solved once and for all.
It is with this in mind that we chose to evaluate Mistral OCR, by comparing it with GPT chat, another major player in the world of artificial intelligence. Although Mistral claims 94.9% accuracy for its OCR and other reports suggest that ChatGPT achieves similar scores (89.77%), our tests revealed a significant gap between this theoretical performance and the real results obtained on our own data set.
We analyzed a typical invoice using Mistral's new OCR model.
Here is the data extraction legend:
The results are shown below.
Here is the legend of the reliability table:
In summary, this legend gives us a clear overview of the types of errors the tool makes, how common they are, and how they impact overall reliability.
📌 Overall reliability rate: 63.75%
So we also analyzed a standard invoice using the ChatGPT template.
The results give us a clear overview of the types of errors the tool makes, how common they are, and how they impact overall reliability.
📌 Overall reliability rate: 57.5%
Despite promising claims, our tests reveal that neither Mistral AI (63.75% reliability) nor ChatGPT (57.5%) truly deliver on their OCR capabilities.
📌 Mistral AI excels in pure transcription with 98.75% accuracy, but struggles with 27.5% missing data.
📌 ChatGPT, while better at positioning data, loses even more essential information, with 42.5% missing data.
🔍 The verdict is clear: neither model guarantees reliable and complete data extraction, especially for complex documents like invoices.
At Koncile, we’ve developed a next-generation OCR that combines high-precision extraction with intelligent document understanding. Our optimized AI drastically reduces errors and ensures accurate data extraction, even from non-standardized documents.
💡 Why choose Koncile OCR?
Higher reliability – Our model is designed to minimize errors
Fewer missing data & better information structuring
Adapted for complex documents – Perfect for invoices, contracts, and reports
For businesses that rely on precise and structured data extraction, Koncile OCR is the superior alternative.
Resources
How to Accurately Classify Documents with Intelligent OCR? A Concrete Use Case on ID Documents
Case study
Compare 4 OCRs according to your business uses, types of documents, API integration, customization and business logic.
Blog
Complete comparison of the best OCR solutions: Performances, use cases, prices.
Blog