Question
Is there a component or a function that supports the following processes for PDF files?
- Extracting specific words from a PDF file.
- Splitting the PDF file into sections up to and after the page containing the extracted words, and then linking the split files.
Answer
HULFT Square does not have component or function to extract specific words from a PDF file or to split a PDF file and link them together.
As a workaround example, word extraction process for PDF files can be implemented as follows using Google Drive Connector.
- Upload a PDF file to Google Drive, open it with Google Docs, extract the text, and save it as a text file.
- From DATA INTEGRATION > Connections in HULFT Square, create a connection resource to Google Drive by selecting Google Workspace as the connector and select Google Drive OAuth Authentication for the type.
- Create a script and obtain the text file extracted in step 1 by Read File/Folder operation in Google Drive Connector.
Cloud > Google Drive > Read File/Folder - Extract specific words contained in the text file with Mapper.
For splitting PDF files, please consider using an external service such as Adobe Acrobat.
Comments
0 comments
Article is closed for comments.