> ## Documentation Index
> Fetch the complete documentation index at: https://docs.file.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# AI OCR models

> We’re excited to bring advanced AI Optical Character Recognition (OCR) capabilities to your applications through our latest AI models to support a variety of use cases.

### **How to Use**

Our AI OCR capabilities are available via the [<u>/POST endpoint.</u>](https://developers.file.ai/reference/publicapicontroller_uploadfilerequest#/)

* Simply send a file to receive structured text output — *with exact bounding box coordinates for text position coming soon*
* Fetch additional data from online sources via your custom AI schema — *with cross-file retrieval coming soon*

### **List of AI OCR models available**

English Models:

* Beethoven\_ENG\_O5.6 - OpenAI v6
* Beethoven\_ENG\_G5.5 - Gemini v5
* Beethoven\_ENG\_GP25 - Gemini Pro 2.5
* Beethoven\_ENG\_GP25.1 - Gemini Pro 2.5 v1
* Beethoven\_ENG\_GP25.2 - Gemini Pro 2.5 PDF
* Beethoven\_ENG\_GP3 - Gemini Pro 3
* Beethoven\_CUS\_O5.1 - Custom OpenAI v8
* Beethoven\_CUS\_O5.2 - Custom Gemini v13
* Unified (google-document-ai-ocr-gemini-v10) - Unified model

Chinese Models:

* Beethoven\_ZH\_O5.9 - Chinese OpenAI v9

Japanese Models:

* Beethoven\_JP\_O5.3 - Japanese OpenAI v3
* Beethoven\_JP\_G5.4 - Japanese Gemini fine-tuned

Thai Models:

* Beethoven\_TH\_O5.1 - Thai OpenAI v1
