Extract data from documents
Extraction pulls structured data from unstructured documents using AI.
Run an extraction job
- Go to Extractions in the sidebar
- Click New Extraction
- Select an extraction model (or create one)
- Choose the documents to process
- Click Run
OCR to Excel
For scanned documents or images, use OCR to Excel to convert them into structured spreadsheets that you can work with.
Best for:
- Scanned bank statements
- Image-based PDFs
- Tables in scanned documents
Validate and export
After extraction, review the results alongside the source document:
- Open the extraction results
- Click on any value to see grounding
- Correct any errors directly in the interface
- Export to Excel when satisfied
Batch extraction
For processing many similar documents:
- Create or select an appropriate extraction model
- Select multiple documents
- Run extraction as a batch
- Review results in bulk
Processing limits
Extraction processes the first 200 pages of each document. For longer documents, split them before processing.