Mistral has released Mistral OCR 4, advancing document understanding with support for bounding boxes, block classification, and inline confidence scores. Each extracted content block is now localized, classified by type, and accompanied by per-page and per-word confidence metrics, alongside the textual output. The model expands accessibility by supporting 170 languages across 10 language groups, including those that are rare or low-resource, addressing a gap in many existing solutions. Building on these enhancements, Mistral OCR 4 accepts common enterprise document formats: PDF, DOC, PPT, and OpenDocument, broadening its suitability for corporate workflows. For deployment, Mistral OCR 4 runs as a single container and can be fully self-hosted. This design enables organizations to manage cost-sensitive or high-volume operations while maintaining strict data sovereignty by keeping document processing within on-premises infrastructure. These capabilities allow the model to serve not only a...
Related
NotebookLM introduces AI-powered video overviews
NotebookLM now lets users generate 60-second Short Video Overviews using Nano Banana 2 Lite, turning notebook sources into compact educational videos with narrative explanations an...
Gemini enables native, editable AI-powered presentations in Google Slides
Gemini now allows Google Slides users to create fully native, multi-slide presentations that are entirely editable. Presentations can be generated from prompts, grounded with conte...
Notion Calendar integrates AI meeting notes with Outlook
Notion Calendar's AI Meeting Notes now work with Outlook Calendar, allowing users to generate AI-powered meeting transcriptions and summaries for Outlook events. The update also in...