How to Use
Our AI OCR capabilities are available via the /POST endpoint.- Simply send a file to receive structured text output — with exact bounding box coordinates for text position coming soon
- Fetch additional data from online sources via your custom AI schema — with cross-file retrieval coming soon
List of AI OCR models available
English Models:- Beethoven_ENG_O5.6 - OpenAI v6
- Beethoven_ENG_G5.5 - Gemini v5
- Beethoven_ENG_GP25 - Gemini Pro 2.5
- Beethoven_ENG_GP25.1 - Gemini Pro 2.5 v1
- Beethoven_ENG_GP25.2 - Gemini Pro 2.5 PDF
- Beethoven_ENG_GP3 - Gemini Pro 3
- Beethoven_CUS_O5.1 - Custom OpenAI v8
- Beethoven_CUS_O5.2 - Custom Gemini v13
- Unified (google-document-ai-ocr-gemini-v10) - Unified model
- Beethoven_ZH_O5.9 - Chinese OpenAI v9
- Beethoven_JP_O5.3 - Japanese OpenAI v3
- Beethoven_JP_G5.4 - Japanese Gemini fine-tuned
- Beethoven_TH_O5.1 - Thai OpenAI v1