LogoLogo
About
  • Home
  • Guides
  • BDK
  • REST API
  • Airtable
    • Fetching Airtable Data
    • Creating and Modifying Airtable Records
    • Creating Airtable Reports
    • Exporting Airtable Reports
    • File Management
    • Submit Airtable Records for Approval
  • AWS S3
    • Listing Files
    • Retrieving Files
    • Uploading Files
    • Uploading CSV
    • Uploading Strings
    • Deleting Files
    • Deleting CSV
  • Azure Blob Storage
    • Uploading Files
    • Retrieving Files
  • Azure Service Bus
    • Sending Messages to Azure Service Bus Queue
    • Receiving Messages from Azure Service Bus Queue
  • Azure Translate
    • Translating A Document
  • Database
    • Retrieving Database Records
    • Updating Database Records
    • Creating Database Records
    • Deleting Database Records
  • Document Processing
    • Document Processing: Workflow Patterns and Best Practices
    • Document Processing with OCR
    • Extract Data from a Document
    • Extract Pages from a Document
    • Extract Tables from a Document
    • Extract a Subdocument
    • Extract Subdocuments
    • Getting Fields from a Document
    • Getting Tables from a Document
  • Google Cloud Storage
    • Dumping Files to Google Cloud Storage
    • Purging Files from Google Cloud Storage
    • Dumping CSVs to Google Cloud Storage
    • Purging CSVs from Google Cloud Storage
    • Enriching Questions
  • HTML
    • Extracting Tables from HTML
  • HTTP
    • GET Request
    • POST Request
    • DELETE Request
    • PATCH Request
    • PUT Request
    • HEAD Request
  • Hubspot
    • Fetching Data from Hubspot
  • Microsoft Excel
    • Opening Excel Files
    • Converting a File to Excel Format
    • Getting a Table from Excel
    • Converting Tables to Excel Workbooks
    • Renaming Excel Files
    • Creating a Merged Spreadsheet from Multiple Files
    • Getting an Excel Worksheet's Row
    • Getting an Excel Worksheet's Row's Cell
    • Getting an Excel Worksheet Cell's Column Label
    • Setting an Excel Worksheet Cell to a String
    • Getting an Excel Worksheet's Changelog
    • Apply Changelogs to an Excel Worksheet
    • Saving An Excel Worksheet
  • Microsoft Outlook
    • Email Operations
    • Getting Schedules
    • Moving Emails
  • Microsoft Power BI
    • Working with Power BI Objects
  • Microsoft SharePoint
    • Retrieving SharePoint Items
    • Uploading a File to SharePoint
    • Converting Data to SharePoint String
    • Moving a File to a Folder
    • Deleting a Folder
    • Getting a Folder
    • Uploading a String to a Folder
    • Creating and Retrieving a Folder
    • Deleting a File
  • Oracle Cloud Fusion
    • Fetching Data from Oracle Cloud Fusion
    • Adding New Records
    • Updating Records
    • Deleting Oracle Fusion Objects
    • Managing Attachments
    • Downloading Files from Oracle Fusion Objects
    • Detaching Files from Oracle Fusion Objects
  • Paycom
    • Working with Employee Details
    • Managing Punch Entries
  • Salesforce
    • Working with Salesforce Objects
    • Submitting Sales Objects for Approval
    • Creating Salesforce Reports
    • File Management
  • Servicenow
    • Retrieving Data
    • Manipulating Data
  • Slack
    • Sending Slack Messages
    • Reading Slack Messages
  • Stripe Pay
    • Fetch Data from Stripe
    • Add New Records
    • Working with Invoices
    • Deleting Objects from Stripe
  • Zendesk
    • Creating a Ticket
    • Updating a Ticket
    • Assigning a Ticket
    • Deleting a Ticket
    • Getting a Ticket
Powered by GitBook
On this page
  • Syntax
  • Extracting a Specific Table
  • Extracting Data With Directional Keywords

Was this helpful?

Export as PDF
  1. Document Processing

Getting Tables from a Document

Learn how to extract tables from documents using the get and table keywords.

Syntax

get the document's tables

If there are multiple sections in the document that resemble tables, the system will return them as a list. Below, we show you how to narrow down to a specific table.

Extracting a Specific Table

If a document contains multiple tables, you can specify which one to extract by referencing a column name:

get the document's tables whose columns contain "Column Name"
get the above as the items table

You can also use relative keywords like first, second, or last to target specific tables:

get the document's first table
get the document's third table
get the document's last table

For more precise filtering, you can use multiple column names:

get the document's table whose columns contain "Column1", "Column2", and "Column3"

Extracting Data With Directional Keywords

Directional keywords can be used with document extraction to pinpoint specific lines.

  • below: Looks beneath the reference line

  • above: Looks on top of the reference line

  • left: Looks left of the reference line

  • right: Looks right of the reference line

Example

get the document's first line which contains "recipe"
get the lines below that as the recipe text

Last updated 1 month ago

Was this helpful?