Extract table from PDF - Document Understanding

Hi @Yoichi,

I attempted to use Document Understanding, but my expertise in this area is limited.

Please review the attached PDF and the Excel output from my workflow.

Also, I found a website where I can convert PDF to excel and it exactly converted the table into excel. Please see 3rd Party Website.xlsx (23.3 KB) (Table 4). That’s the output i’m trying to do in UiPath.

Is this a native PDF? If so, pls try extract table using PDF activities. If it didn’t work, Please look at this thread. It will help you in extraction the table from PDF using document understanding. How to Extract PDF Table through Document Understanding in UiPath

Will try this on my workflow, i just need to change the extractor from Machine Learning extractor to Form extractor. Ty!

Hi @Ralph_Sena26
please find workflow solution I have used form extractor which can extract table from the given pdf
Document_Understanding.zip (559.1 KB)
Let me know if its work for you

1 Like

@Akash_Javalekar1 I tried using the form extractor, but the issue is that the number of rows in the table can vary with each invoice. Is there a way to extract table rows that change dynamically per invoice?

@Ralph_Sena26 you can go with machine learning extractor as in your given pdf having two table you need to train in AI center and this will extract even if rows dynamically change