Upload a file to the server, this will return a presigned upload url to be used for the upload.
The presignedUploadURL is valid for 300 seconds (5 minutes) and can be used multiple times.
The input should contain the following information:
fileName: the name of the file to be uploaded
fileType: the type of the file to be uploaded
isSplit: whether the file is a split file or not (optional, default: false)
isSplitExcel: whether to split Excel files by worksheets (optional, default: false)
callbackURL: the url that will be called after the file is uploaded
ocrModel: the OCR model to be used for file processing (optional). Available models:
Beethoven_ENG_O5.6 - OpenAI v6Beethoven_ENG_G5.5 - Gemini v5Beethoven_ENG_GP25 - Gemini Pro 2.5Beethoven_ENG_GP25.1 - Gemini Pro 2.5 v1Beethoven_ENG_GP25.2 - Gemini Pro 2.5 PDFBeethoven_CUS_O5.1 - Custom OpenAI v8Beethoven_CUS_O5.2 - Custom Gemini v13Unified (google-document-ai-ocr-gemini-v10) - Unified modelBeethoven_ZH_O5.9 - Chinese OpenAI v9Beethoven_JP_O5.3 - Japanese OpenAI v3Beethoven_JP_G5.4 - Japanese Gemini fine-tunedBeethoven_TH_O5.1 - Thai OpenAI v1schemaLocking: whether the schema should be locked after the file is uploaded, must be one of true or false
directoryId: the directory id of the file should be uploaded
| Property | Type | Required | Description |
|---|---|---|---|
| fileName | string | Yes | Original name of the file including extension (e.g., “document.pdf”) |
| fileType | string | Yes | MIME type of the file (e.g., “application/pdf”, “image/jpeg”) |
| isSplit | boolean | Yes | Whether the file should be processed as separate pages/sections |
| callbackURL | string | No | HTTP endpoint to receive processing completion notifications |
| ocrModel | string | No | OCR engine to use for text extraction. Available models vary by plan |
| schemaLocking | boolean | Yes | Whether to lock the schema after processing. Must be true or false |
API key for authentication
File name
"file.pdf"
File type
"application/pdf"
Is split
false
Is split excel - whether to split Excel files by worksheets
false
Callback URL
"https://example.com/callback"
OCR model
Beethoven_ENG_O5.6, Beethoven_ENG_G5.5, Beethoven_ENG_GP25, Beethoven_ENG_GP25.1, Beethoven_ENG_GP25.2, Beethoven_CUS_O5.1, Beethoven_CUS_O5.2, Unified (google-document-ai-ocr-gemini-v10), Beethoven_ZH_O5.9, Beethoven_JP_O5.3, Beethoven_JP_G5.4, Beethoven_TH_O5.1 "Beethoven_ENG_O5.6"
Schema locking
false
Directory Id
"649e2d2d2d2d2d2d2d2d2d2d"
Get a presigned upload url for upload file, after getting the result use the presignedUploadURL with a PUT method to send the request with the binary file, the presignedUploadURL is valid for 300 seconds (5 minutes) and can be used multiple times