Pdf extraction data

suraj_gaikwad · May 19, 2023, 5:49am

Hello,

I’m trying to extract text from multiple pdf files but specific text only so i have done till reading all pdf files by using Document understanding so how I do get that text only which I want using Document understanding

These are ss of du and I’m writing pdf text in text file but there is an problem is that first pdf reading it’s write in write text den for second pdf writing and data remove from text file

vrdabberu · May 19, 2023, 5:53am

Hi @suraj_gaikwad

As the write text file is within the for each loop it creates a new file at each run time so remove the write text file activity from the for each loop and create a file at the start of the program.

Hope it works !!

yikwen.goo · May 19, 2023, 5:54am

You could actually get a DataTable containing the extraction results using the Export Extraction Results activity, then get the header fields and table fields by using DataSet.Tables("Simple Fields") and DataSet.Tables("Line Items")

Thereafter just use Excel Write Range activities to create one Excel file per PDF, so that the data is not overwritten.

suraj_gaikwad · May 19, 2023, 5:59am

We don’t want to write in Excel sheet

suraj_gaikwad · May 19, 2023, 6:02am

In data extraction scope it’s selected all pdf page that’s why it’s writting all pages so how to remove that bcz I’m trying to remove it’s not working

suraj_gaikwad · May 19, 2023, 9:34am

Dat is not extracting after write text outside of try catch ?

yikwen.goo · May 22, 2023, 2:25am

The checkbox just means which fields you’ve validated in validation station. Could you please explain the problems you’re facing in greater detail?

Dilip_Wakdikar_1996 · May 22, 2023, 3:44am

Hello Suraj,
Got To Data Extraction Scope Activity and click on Configure Extractor. Inside that you would be able to select what field you want to extract and what not.

suraj_gaikwad · May 22, 2023, 11:24am

I have done this by using match text not du bcz nota able to get output

system · May 25, 2023, 11:25am

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Document understanding solution Documentation docs , question	17	1075	May 8, 2023
Extract data from different files Help	5	996	January 2, 2020
Extract document data y for each file in folder Activities pdf , help , for-each-file , extract-document-data	5	48	March 21, 2025
Extract pdf data to excel Studio studio , question , activities_panel , read-pdf , extract-pdf	2	588	July 17, 2023
Extract PDF data to excel without Document Understanding Activities excel , studio , regex , question , pdf-extraction , pdf-conversion	4	661	June 2, 2023

Pdf extraction data

Related topics