Document Extraction

Recent Requests
Log in to see full request history
TimeStatusUser Agent
Retrieving recent requests…
LoadingLoading…

Overview

Send the image with a POST request to the Extract API endpoint and FormX will recognize the information from the document.

FormX will use the extractor of your choice to extract and return the data in a JSON format. The extractor can be specified by the extractor_id parameter. An Access Token should also be included. They can be obtained from the web portal dashboard.

Image Sizing Recommendations

The images submitted to the extraction API should be of sufficient size so that the text and features can be easily distinguished. To obtain the best results, it is recommended to use an image where each side is between 1000 and 2000 pixels, or a minimum of 100 DPI.

Common image file types such as JPEG, PNG, PDF are supported. For the full list, please check out our guide Here.

Limits

The file size limit of the image is 10MB if you upload the file via body param.

If you provide the file via image_url, there's no file size limit.

The image dimension limit is 10,000 pixels on one side and total number of pixels < 75M, this applies to all upload methods.

Using the Asynchronous mode

If the request takes too long to complete, you can use the asynchronous mode to avoid timeout. This can be enabled by the X-WORKER-ASYNC parameter in the header.

Job ID

If the async job is successfully created, a 202 Accepted response will be returned with the job_id and request_id

202 Accepted 
{
    "job_id": "<string>",
    "request_id": "<string>",
    "status": "ok"
}

To get the extraction result, please refer to Get async extraction result

Body Params
Headers
string
required

Extractor ID

string

URL of the image to be processed in jpg or png format, or URL of a pdf file

string
enum
Defaults to raw

Encoding of the request body, allowed raw or base64

Allowed:
number
Defaults to 150

DPI of the uploaded pdf file

string
enum

OCR Engine

Allowed:
string

original file name

string
enum
Defaults to false

Flag for using the asynchronous mode

Allowed:
string
enum
Defaults to true

Flag for auto adjusting image size for better extraction result, it will take a longer for extraction if enabled

Allowed:
string
enum
Defaults to false

Flag for returning OCR result in the response.

Allowed:
string
enum
Defaults to per-page

Processing mode: If set, it will override the processing mode setting on the specified extractor. Please note that the per-file mode is not supported by all models. When it is not applicable, only the first page of the file will be processed.

Allowed:
string
length ≤ 2048

Pass extra data as ASCII string to FormX. For structured data, consider represent it as JSON. For string containing non-ASCII string, consider encode it with base64

string
enum
Defaults to false

should import to accuracy center

Allowed:
string
enum
Defaults to image/*

Generated from available request content types

Allowed:
Responses

Language
Credentials
Header
URL
LoadingLoading…
Response
Click Try It! to start a request and see the response here! Or choose an example:
application/json