Screen Resolution issues for PDF data extraction

Dominic · April 18, 2017, 7:32am

I am much aware of extracting PDF content in a text file/storing in a variable using activities available in UIPath (Read PDF text/with OCR), while I use this "Read PDF with OCR "activity for a simple readable PDF (content not as an image blocks) resulting with an error “Scrape returned empty text” or sometimes nothing will be extracted if the selected text doesn’t fit the screen or hiding behind/down. How do I solve this issue ?

Gabriel_Tatu · April 18, 2017, 1:18pm

is this happening with a specific pdf file or with any? can you try with a simple pdf created by you?
are you up to date with the pdf activities pack? and with Studio version also?

Changeponder_Tester · May 30, 2018, 7:53am

@Dominic, Me also facing the same kind of issues while using ‘Get OCR Text’ activity to scrape particular data from the PDF file when that particular data is non visible.Also, It sometimes scraping wrong data while changing the screen resolution.Means that PDF pixel position is not adjusting as per the screen resolution adjustment.Kindly let me know how to solve this issue?

Dominic · May 30, 2018, 8:11am

Hi Changeponder_Tester,

Earlier, I wasn’t aware of other workarounds like Read PDF with text and making a string manipulation and so on. Finally I have made it to work with the help of some activities like Anchor base, Find Relative Element. Also do note that we proceed further with an assumption of fixed screen resolution.

Changeponder_Tester · May 30, 2018, 11:41am

Hi Dominic, Fixed resolution is working fine, Moreover i am expecting Permanent solution to work with any resolution.Actually i have used Get OCR Text activity with ‘Google OCR engine’ i. It is scraping the accurate data when corresponding data is visible and screen resolution is not changed,Else it throws error like ‘Scrape returned empty text’ or returning the wrong value. Is there any best way to scrape from PDF? Also, is there any best OCR engine to scrape data from the PDF when corresponding data is non visible?

Gabriel_Tatu · May 30, 2018, 4:01pm

Are you scraping from PDF directly or using PDF activities?

Changeponder_Tester · May 31, 2018, 5:52am

@Gabriel_Tatu, I am scraping data from the PDF directly using anchor base with Get OCR text activity. Because my PDF is scanned format,So, i can’t Use Find Element,etc…I can use only OCR and Image based activities alone for getting particular data.Suppose that particular data is non visible then getting pbm. Means that particular data is present in the current page alone but have to scroll for visible it.Is there solution for this one?

Gabriel_Tatu · May 31, 2018, 6:34am

Yes, use the activities

Changeponder_Tester · August 28, 2018, 9:13am

I have solved this one using Read pdf text and Read pdf with OCR text activity. Actually scraped all the data’s and then applied string manipulation techniques to get necessary data.

SriRana · October 9, 2018, 6:49pm

Hello guys,

I am actually having issues with extracting accurate data using ‘get text’ from PDF

my PDFs are not downloaded into my system they popup when I click View and from there I extract data, sometimes I get accurate data and sometimes its not I just get wrong invoice amount.

I cant use Read PDF as PDFs are not downloaded, and PDF is not image based doc so no OCR. I did read about string manipulation technique from couple posts can you example me how this works with ‘Get Text’ I am a beginner at UiPath.

Thanks,
Sri

Topic		Replies	Views
PDF error while changing the screen resolution Help	0	1070	May 31, 2018
Read specific part of PDF with OCR Studio	1	1375	June 6, 2020
Screen scraping is not capturing the text from the PDF Studio	6	1184	April 11, 2020
PDF - Scrape Text Returns Empty Text Help pdf , ocr , activities	7	8130	October 29, 2018
PDF Data Extraction Using OCR Help pdf , ocr , activities , question	2	1827	November 16, 2019

Screen Resolution issues for PDF data extraction

Related topics