An extraction template defines the structure of data you want to extract from documents. It specifies which fields to extract, their data types, and how to transform the data into your required format.
You can manually edit the template schema and adjust it to fit your specific document types.
Templates are flexible and customizable. You can create multiple variations of the same document type and use AI Instructions to guide extraction for each variation.
The AI Instructions is a prompt that guides the AI on how to extract the data from the document.
Fields represent individual data points extracted from documents (e.g., invoice date, vendor name, total amount).
Tables represent collections of related data points (e.g., line items in an invoice, transactions in a bank statement).
Each field has a type that determines how its value is obtained:
Each field also has a return type that defines the data format:
Manual fields require you to enter data manually. They are not extracted by AI. Use them for comments, notes, or other data that isn't present in the document.
AI extractible fields extract data from documents using AI. Use AI Instructions to specify how the AI should extract each field.
Use cases:
For every numeric field in a Table, a Sum field is automatically generated. It calculates the total of all values in that field.
Formula Code runs after AI extraction. Use it for:
You can write custom code or use the built-in AI assistant to generate it.
Field-level formulas: Create a field in the Fields section that calculates values based on other fields or table data. You can modify other fields, but it's recommended to return a value (e.g., a boolean indicating success).
Table-level formulas: Create a field in the Tables section that calculates values for each row in the table.
Notifications are displayed in the UI as a badge in the notification bell and on the document page.
Use Enum for fields with predefined categories. Define each category as an enum option, then use AI Instructions to guide the AI on how to map document values to the correct enum option.