I have made all the settings in the PDF “Change the reading options”.
Why does the program not recognize the individual numbers, but only the entire number block? Also I pressed F4 and with all variants it does not work.
Does Data Scraping work with PDF Documents? I get the following error message: “This control does not support data exctration. Please select a table cell”.
We have also observed that for some of the PDF documents we would not be able to perform the data Scraping.
However, Could you also let us know if the PDF is a Digital document or a Scanned one. If it is a Digital document then we could also move towards extracting the data using PDF Activities and with String/Regex manipulation.
Let us know if it is possible to move towards different approaches for your case. Additionally, samples of the document or of the data extracted from PDF would be of greater help to analyse better.
Use Read PDF Text Activity with PreserveFormat as True. You would get the output in the form of a String type, say stored in variable pdfText.
We could now use Regex operations on this data to get the data you need. We will first recognise the pattern that is present in the data. The pattern that is observable is that each item in the Table is separated by more than 2 space atleast. Hence, we could use this pattern to capture these values separately like shown below :
Next, we use For EachActivity to iterate through the matching data and add it as a row to the OutputDT like below :
Visuals from Debug :
The above approach is suggested based on the assumption of your data pattern, we haven’t yet properly analysed your data.
However, you could check with the approach mentioned and let us know if you’re not able to get the required output and also mention if there is any error received.