Logo Parsio Knowledge Base

OCR Parsing of PDF Files and Images

Parsio can extract data from text PDFs, scanned PDFs and images. It uses Machine Learning for OCR and data extraction. There is a set of prebuilt AI models to automatically extract data from some commonly-used document types: Invoices Receipts Busine ...

Create Your First Template

After creating a mailbox and forwarding your first email you'll need to tell Parsio what data you want to extract. It's time to create your first template. In most of the cases this process is simple and straightforward. 1. From the Inbox page, o ...

HTML / Raw / Preview Tabs

Parsio's template page has 3 tabs: HTML, Raw and Preview. This short article explains the difference between them. The HTML tab displays the rendered version of your email. It allows you to highlight the valuable ...

How to Fix Unparsed (Failed to Process) Documents

Before going further, it's important to understand some basics of how Parsio works. When you create a template and select fields to extract, Parsio will "remember" the context, e.g. the data (either plain text or HTML tags) right before and after each fi ...

Regular vs Meta Fields. Field Selection Rules

Parsio allows you to extract any text data from your emails and documents. Sometimes you may also want to extract metadata: sender's email address, email received date and time etc. Regular fields Regular fields is probably what you will ...

Constant Fields

While regular fields are used to extract dynamic data from incoming emails, PDF files, etc., constant fields allow you to set predefined (constant) values. This is helpful when you need to "tag" some documents or export a value that doesn't exist in docum ...

Extracting Tables and Repetitive Data

Being able to parse emails with tables or repetitive data can be quite helpful when you don’t know in advance the number of items you want to extract from an email. Here’s when table parsing can come to your rescue. Extracting repetitive data ...

Field Types. Data Formatting and Normalization

Data normalization is the process of structuring parsed data. You may want to eliminate unwanted spaces and special characters, convert comma-separated numbers to decimal-separated etc. This can be done using Field Types. Parsio supports 9 built-in types ...

Field Mask

When you create a field, Parsio will remember the "context" around it to be able to parse similar emails. By default, Parsio will only look for the context and match any found field value. However, in some cases it's useful t ...

Processing PDF Files

Importing PDF Files to Parsio Parsio provides 4 main options to import PDF files for processing: Send them as email attachments. Upload them manually from the "Inbox" page. Download from the external link. Import using the API. The easiest way i ...

Processing CSV and Excel Files

CSV and Excel files can be uploaded or sent as email attachments. To upload a file, click on the “Import documents” button in the upper right corner of your inbox and drop the files that you want to be parsed. CSV and Excel files are parsed ...

Automatically Collect Emails

Parsio can automatically find and collect contacts from all your incoming emails. It processes email fields (From, To, Cc, Reply to), as well as the email body itself. To set up the automatic email collection, open your Mailbox and go to Settings: The ...

Optional (Empty) Fields

Sometimes you want to mark a few fields as "optional" or "empty" because they don't exist in some of your emails. By design, Parsio must find all the fields in your document to consider it as successfully parsed. However, there are two main solutions to ...

FAQ & Troubleshooting

How to send all my old emails from Gmail to Parsio? If you are using the Gmail web interface, you can forward all your old emails to Parsio at once. To do that, select all the emails you need to send, then click the three dots icon and select "Forward a ...