Extract Subdocument
Extracts a subdocument from a document or file based on page numbers or markers.
Overview
This procedure extracts a subdocument from a document or file. A subdocument is a continuous subsection of a larger document. You can specify the section to extract using page numbers or by using markers that describe the content. The extracted subdocument can then be processed independently in your automation workflow.
Make sure to add the Document Processing Book to your agent before using this automation procedure.
Syntax
Below is a line-by-line overview of the automation syntax. Expand each line to learn more.
Examples
1. Extract by Page Numbers
In this example, pages 2 through 5 are extracted as a single subdocument.
extract subdocument from the document where
the start page is 2
the end page is 5
2. Extract by Content Markers
Extracts from 'Introduction' section to before 'Conclusion' section.
extract subdocument from the document where
the start page marker is "Page containing the text 'Introduction'"
the excluded end page marker is "Page containing the text 'Conclusion'"
3. Extract with Size Limit
Extracts starting from a specific form, limited to 1 page.
extract subdocument from the file where
the start page marker is "Page containing the text 'MERCHANT SERVICES BANK ACCOUNT CHANGE REQUEST FORM.'"
the subdocument size is 1
4. Extracting by Page Numbers with OpenAI Model
extract subdocument from the transcript
the start page is 2
the end page is 5
the openai model is "gpt-4o-mini"
5. Extract Section by Inclusive End Marker
Extracts from Section 2 start through Section 2 end (inclusive).
extract subdocument from the document where
the start page marker is "The start of the Section 2"
the end page marker is "The end of the Section 2"
Last updated
Was this helpful?