Monite’s OCR provides a flexible, standalone endpoint for extracting text data from documents across multiple formats, including PDF, JPG, and PNG. The Monite OCR returns a structured JSON response with all detected text data for contextual analysis, enabling use cases such as receipt parsing, contract analysis, or custom form data extraction. The response retains the extracted content without post-processing, allowing you to integrate and manipulate the data according to your specific requirements.
This OCR endpoint is a generic one and is not meant to replace the existing OCR for Accounts Payable (AP).
While this is not an exhaustive list, Monite OCR guarantees tested coverage for the following languages:
There are two ways you can send a document for OCR recognition:
The OCR scanning might take up to 10-15 seconds, so it is recommended to poll the document every 5 seconds. In the meantime, the document’s status field is set as processing and the recognized_data field is not populated yet.
The webhook ocr_task.finished is triggered once the OCR has finished processing the file and storing the extracted data. The webhook will be triggered only if the x-monite-entity-id header parameter is provided.
To process a file, thus creating an OCR task, call POST /ocr_tasks. The query parameter document_type is optional and its possible values are invoice, credit_note, and receipt. If document_type is not specified, the system will attempt to determine it automatically:
The 202 Accepted response contains the information found in the file:
If the file cannot be recognized, the system returns an error.
You can upload files in the PDF, PNG, or JPG format by calling POST /ocr_tasks/upload_from_file. The query parameter document_type is optional and its possible values are invoice, credit_note, and receipt. If document_type is not specified, the system will attempt to determine it automatically:
The sucessful 202 Accepted response contains the id and other parameters of the file:
If the file cannot be recognized, the system returns an error.
To obtain a list of all OCR tasks, call GET /ocr_tasks.
To obtain information about a specific OCR task, call GET /ocr_tasks/{task_id}.