Skip to main content

How to Use

Our AI OCR capabilities are available via the /POST endpoint.
  • Simply send a file to receive structured text output — with exact bounding box coordinates for text position coming soon
  • Fetch additional data from online sources via your custom AI schema — with cross-file retrieval coming soon

List of AI OCR models available

English Models:
  • Beethoven_ENG_O5.6 - OpenAI v6
  • Beethoven_ENG_G5.5 - Gemini v5
  • Beethoven_ENG_GP25 - Gemini Pro 2.5
  • Beethoven_ENG_GP25.1 - Gemini Pro 2.5 v1
  • Beethoven_ENG_GP25.2 - Gemini Pro 2.5 PDF
  • Beethoven_CUS_O5.1 - Custom OpenAI v8
  • Beethoven_CUS_O5.2 - Custom Gemini v13
  • Unified (google-document-ai-ocr-gemini-v10) - Unified model
Chinese Models:
  • Beethoven_ZH_O5.9 - Chinese OpenAI v9
Japanese Models:
  • Beethoven_JP_O5.3 - Japanese OpenAI v3
  • Beethoven_JP_G5.4 - Japanese Gemini fine-tuned
Thai Models:
  • Beethoven_TH_O5.1 - Thai OpenAI v1