How to delete OCR scraping Cache, in order make the second round work properly

Artur_Martirosyan · November 21, 2018, 12:48pm

Hello Everyone,
I am using OCR scraping method to get some information from PDF (picture type), Specifically I am using “scrape relative” function.
There is a loop in my automation, on the first round for the “PDF A” it works properly but on the second round for the “PDF B”, it brings some error such as “Assign: Index was outside the bounds of the array.” or " Scrape return empty text". But After running second time after the errors the “PDF B” also works properly, then comes the same errors for the “PDF C” and so on.
So I found out that it is because the bot saves some scraping Cache from the first round and because of this these errors occur. I tried to reassign back to empty string or to “Nothing” all the variables that I use or I scrape but not use, however still there is something that stays from the first round.
So how I can set the scraping OCR recording part to the initial stage like if you start the bot?
thank you in advance.

rado · November 21, 2018, 1:21pm

Hi,

It could help just create separeted workflow for OCR and run it with isolated session.

Artur_Martirosyan · November 21, 2018, 1:23pm

Hi Rado
But in reality, it could be hundred different pdf files. I can’t create separate workflow for each of them.

rado · November 21, 2018, 1:28pm

What i meant was to separate method which is OCR your document.

Artur_Martirosyan · November 21, 2018, 1:35pm

To be honest I don’t really get what you mean.

rado · November 21, 2018, 1:43pm

Ok, then please see attached picture which is showing what i was thinking of. Let’s just assume that DataTable contains paths to PDFs.

And remember to check Isolated in properties.
GetOcrText2

Artur_Martirosyan · November 21, 2018, 1:45pm

So you mean, to invoke the part that deals with OCR scraping and the check the box of isolated right?

rado · November 21, 2018, 1:46pm

Exactly.

Artur_Martirosyan · November 21, 2018, 1:47pm

Will try now, hopefully will help.
Will let you know
thanks

Artur_Martirosyan · November 21, 2018, 1:55pm

Thank you so much,
it worked
was working 2 days on this
Thanks a lot

rado · November 21, 2018, 2:05pm

No problem

system · November 24, 2018, 2:19pm

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Problem with scraping after first PDF processed Activities pdf , activities , data_scraping , question	5	1181	October 15, 2021
Looped "Read PDF with OCR" -- process breaks when attempting to use try catch for exception handling Help pdf , ocr , activities , data_scraping	8	6687	April 30, 2019
Problem with data scraping in PDF Activities pdf , activities , data_scraping , question	5	1148	October 18, 2021
Error at looping through pdf files of different formats to scrap data Help selector , pdf , activities , data_scraping	4	1220	December 23, 2019
Error in get text using ocr Learn	8	884	May 25, 2020

Most Active Users - Yesterday
Anil_G
ashokkarale
jinal.shah
Gautham_Pattabiraman
postwick
chandreshsinh.jadeja
vrdabberu
Ajay_Mishra
sven.wullum1
Vyshnavi_Nalumachu
More details...

How to delete OCR scraping Cache, in order make the second round work properly

Related Topics