LogoLogo
About
  • Home
  • Guides
  • BDK
  • REST API
  • Release Notes
  • Airtable
    • Fetching Airtable Data
    • Creating and Modifying Airtable Records
    • Creating Airtable Reports
    • Exporting Airtable Reports
    • File Management
    • Submit Airtable Records for Approval
  • AWS S3
    • Listing Files
    • Retrieving Files
    • Uploading Files
    • Uploading CSV
    • Uploading Strings
    • Deleting Files
    • Deleting CSV
  • Azure Blob Storage
    • Uploading Files
    • Retrieving Files
  • Azure Service Bus
    • Sending Messages to Azure Service Bus Queue
    • Receiving Messages from Azure Service Bus Queue
  • Azure Translate
    • Translating A Document
  • Database
    • Retrieving Database Records
    • Updating Database Records
    • Creating Database Records
    • Deleting Database Records
  • Document Processing
    • Document Processing: Workflow Patterns and Best Practices
    • Document Processing with OCR
    • Extract Data from a Document
    • Extract Pages from a Document
    • Extract Tables from a Document
    • Extract a Subdocument
    • Extract Subdocuments
    • Getting Fields from a Document
    • Getting Tables from a Document
  • Google Cloud Storage
    • Dumping Files to Google Cloud Storage
    • Purging Files from Google Cloud Storage
    • Dumping CSVs to Google Cloud Storage
    • Purging CSVs from Google Cloud Storage
    • Enriching Questions
  • HTML
    • Extracting Tables from HTML
  • HTTP
    • GET Request
    • POST Request
    • DELETE Request
    • PATCH Request
    • PUT Request
    • HEAD Request
  • Hubspot
    • Fetching Data from Hubspot
  • Microsoft Excel
    • Opening Excel Files
    • Converting a File to Excel Format
    • Getting a Table from Excel
    • Converting Tables to Excel Workbooks
    • Renaming Excel Files
    • Creating a Merged Spreadsheet from Multiple Files
    • Getting an Excel Worksheet's Row
    • Getting an Excel Worksheet's Row's Cell
    • Getting an Excel Worksheet Cell's Column Label
    • Setting an Excel Worksheet Cell to a String
    • Getting an Excel Worksheet's Changelog
    • Apply Changelogs to an Excel Worksheet
    • Saving An Excel Worksheet
  • Microsoft Outlook
    • Email Operations
    • Getting Schedules
    • Moving Emails
  • Microsoft Power BI
    • Working with Power BI Objects
  • Microsoft SharePoint
    • Retrieving SharePoint Items
    • Uploading a File to SharePoint
    • Converting Data to SharePoint String
    • Moving a File to a Folder
    • Deleting a Folder
    • Getting a Folder
    • Uploading a String to a Folder
    • Creating and Retrieving a Folder
    • Deleting a File
  • Oracle Cloud Fusion
    • Fetching Data from Oracle Cloud Fusion
    • Adding New Records
    • Updating Records
    • Deleting Oracle Fusion Objects
    • Managing Attachments
    • Downloading Files from Oracle Fusion Objects
    • Detaching Files from Oracle Fusion Objects
  • Paycom
    • Working with Employee Details
    • Managing Punch Entries
  • Salesforce
    • Working with Salesforce Objects
    • Submitting Sales Objects for Approval
    • Creating Salesforce Reports
    • File Management
  • Servicenow
    • Retrieving Data
    • Manipulating Data
  • Slack
    • Sending Slack Messages
    • Reading Slack Messages
  • Stripe Pay
    • Fetch Data from Stripe
    • Add New Records
    • Working with Invoices
    • Deleting Objects from Stripe
  • Zendesk
    • Creating a Ticket
    • Updating a Ticket
    • Assigning a Ticket
    • Deleting a Ticket
    • Getting a Ticket
Powered by GitBook
On this page
  • Prerequisites
  • Learning the Document Processing Book
  • Introducing Documents and File Objects
  • To Upload a Local File
  • Video Example: Extracting Information

Was this helpful?

Export as PDF

Document Processing

Learn how to process documents and extract information from them in Kognitos.

Last updated 1 month ago

Was this helpful?

Prerequisites

Learning the Document Processing Book

To begin extracting information from documents, follow these steps to first learn the Document Processing Book:

  1. In the left sidebar, click on Books.

  2. Use the search bar to find "Document Processing."

  3. Click on +Book to open the Add New Book pop-up.

  4. Click on Add to finish adding the book.

Add the Document Processing Book

Introducing Documents and File Objects

To introduce a document or file object in your automation, use any of the following lines:

the file
get the file
use the file
the document
get the document
use the document

These lines will raise a Question in Kognitos, prompting you to Please provide the file or document, as the system requires the specified object to proceed.

To Upload a Local File

  1. Click on Select a method to open the drop-down menu.

  2. Select Upload files. Refer to the table below for supported file types.

  3. Upload your file or document and click Submit.

After uploading, you can perform additional operations on the document or file within the platform.

Category
Supported File Types for Upload

Text Documents

.pdf, .docx

ERP Document

.edi

Image

.jpeg, .jpg, .png, .tif, .tiff

Data

.txt, .json, .yml, .yaml, .csv

Spreadsheets

.xlsx, .xls, .csv

HTML

.html

Email

.eml

Audio

.mp3, .wav

📘 Working with .tif and .tiff Files

Use the following syntax to upload and convert .tif and .tiff files to PDF format:

read the file as a pdf

Once uploaded and converted, you can use PDF automation procedures on the file object.

Video Example: Extracting Information

This video walks through an example of extracting information from an SAP Sales Order.

Troubleshooting Performance

If your process is taking longer than usual, compare the document sizes with previous runs to identify any changes that could be affecting performance.