Error at looping through pdf files of different formats to scrap data

Chaitanya_podilapu · December 20, 2019, 11:16am

Hi, I’m trying to scrap ocr data from multiple pdf files. All of the files are having the same content but are not on same format. Those are scanned pdf’s.
Here i’m having a thought

find the anchor image.
Where ever the image found, I want to scrape the data using OCR from the certain size of the region next to the anchor image from that pdf.

Help me here with the steps of workflow. Or any other suggestions will be appreciated.
Here is what i’ve tried so far.

tried to make a standard selector but getting some other data from ocr when it is working on other pdf files
Tried Anchor base activity but didn’t work.

Some times what i’m tring to scrap might be on top or middle or bottom of the pdf, So Even i try to read pdf with OCR it won’t work.

Palaniyappan · December 20, 2019, 11:19am

Hi
Did we try with CV activities

Cheers @Chaitanya_podilapu

Chaitanya_podilapu · December 23, 2019, 9:07am

Hey @Palaniyappan. How’s your weekend! Never tried CV elements before and I didn’t see any relevant activity useful to scrap data from pdf. Installed IntellegentOCR and MachineLearningExtractor packages. Not understanding how to use them and.
Inside CV scope i’ve got this error
response from this server not valid [404]
Coppied my api key in double quotes
and URL as “https://platform.uipath.com/[domainname]”
Gone through some topics but not rectified

Thank you!

Palaniyappan · December 23, 2019, 10:01am

Kindly have a view on this thread

@Chaitanya_podilapu

Chaitanya_podilapu · December 23, 2019, 10:17am

Yeah! But i’m already on latest version 2019.11.0 beta and recently updated and installed those packages. Tried with stable version also! But couldn’t clear that error

Topic		Replies	Views
Error after used Data Scraping for PDF TO CSV Help pdf	16	4195	June 14, 2018
PDF Data Scrapping issue Help pdf , activities , data_scraping , error	5	1157	November 30, 2019
Problem with data scraping in PDF Activities pdf , activities , data_scraping , question	5	1318	October 18, 2021
UI element error while screen scrapping Pdf file Help	11	1855	October 7, 2019
Screen Scraping multiple PDFs in a ForEach loop Help	6	2228	February 5, 2020

Most Active Users - Yesterday
prashant1603765
ashokkarale
mively
anjasing
Yoichi
sonaliaggarwal47
lrtetala
V_Roboto_V
pikorpa
sharazkm32
More details...

Error at looping through pdf files of different formats to scrap data

Related topics