Monite OCR
Maximize the potential of the Monite OCR engine.
Overview
Monite’s OCR provides a flexible, standalone endpoint for extracting text data from documents across multiple formats, including PDF, JPG, and PNG. The Monite OCR returns a structured JSON response with all detected text data for contextual analysis, enabling use cases such as receipt parsing, contract analysis, or custom form data extraction. The response retains the extracted content without post-processing, allowing you to integrate and manipulate the data according to your specific requirements.
This OCR endpoint is a generic one and is not meant to replace the existing OCR for Accounts Payable (AP).
Considerations and limitations
- The OCR process is asynchronous and the processing time may vary according to the document size.
- The maximum image file size is 10 MB.
- Multipage PDF files can have up to 10 pages.
Covered languages
Here is the list of languages that Monite OCR supports:
- Afrikaans
- Arabic
- Armenian
- Bulgarian
- Catalan
- Chinese
- Croatian
- Czech
- Danish
- Dutch
- English
- Estonian
- Finnish
- French
- German
- Greek
- Hungarian
- Icelandic
- Indonesian
- Italian
- Japanese
- Korean
- Lao
- Latvian
- Lithuanian
- Malay
- Norwegian
- Polish
- Portuguese
- Romanian
- Russian
- Serbian
- Slovak
- Slovenian
- Spanish
- Swedish
- Turkish
- Ukrainian
- Vietnamese
Process files via URL
To process a file, thus creating an OCR task, call POST /ocr_tasks
. The query parameter document_type
is mandatory and its possible values are invoice
, credit_note
, and receipt
::
The successful response contains the information found in the file:
Upload from file
You can upload files in the PDF, PNG, or JPG format by calling POST /ocr_tasks/upload_from_file
. The query parameter document_type
is mandatory and its possible values are invoice
, credit_note
, and receipt
:
The sucessful response contains the id
and other parameters of the file:
List all OCR tasks
To obtain a list of all OCR tasks, call GET /ocr_tasks
.
Retrieve a specific OCR tasks
To obtain information about a specific OCR task, call GET /ocr_tasks/{task_id}
.