Document Extraction

Overview

Send the image with a POST request to the Extract API endpoint and FormX will recognize the information from the document.

FormX will use the extractor of your choice to extract and return the data in a JSON format. The extractor can be specified by the extractor_id parameter. An Access Token should also be included. They can be obtained from the web portal dashboard.

Postman Example

You can use the Postman app to test the API. Remember to set the Access Token and Extractor ID variables in the collection scope to configure your extractor.

Link to Postman Collection

Image Sizing Recommendations

The images submitted to the extraction API should be of sufficient size so that the text and features can be easily distinguished. It is recommended to use an image minimum of 1000x750 pixels or 100 DPI.

Common image file types such as JPEG, PNG, PDF are supported. For the full list, please check out our guide Here.

Limits

The file size limit of the image is 10MB if you upload the file via body param.

If you provide the file via image_url, there's no file size limit.

The image dimension limit is 10,000 pixels on one side and total number of pixels < 75M, this applies to all upload methods.

Using the Asynchronous mode

If the request takes too long to complete, you can use the asynchronous mode to avoid timeout. This can be enabled by the X-WORKER-ASYNC parameter in the header.

Job ID

If the async job is successfully created, a 202 Accepted response will be returned with the job_id and request_id

202 Accepted 
{
    "job_id": "<string>",
    "request_id": "<string>",
    "status": "ok"
}

To get the extraction result, please refer to Get async extraction result