I’m currently facing an issue with converting my PDF file to an Excel sheet for some data manipulations. After searching, I resorted to converting it to a text file, which I now aim to transform into an Excel sheet. Despite trying various methods, I haven’t achieved the desired results. In the attached images, you can see the original PDF file converted to a text file and the intended format for the Excel file.
**The Desired Tabular form in Excel**
I would greatly appreciate it if someone could provide a detailed solution for this conversion. Alternatively, if there is an easier approach to convert PDF to Excel, please share your insights.
Thank you all for your cooperation and assistance.
Mostly to deal with PDF files(structured/unstructured), UiPath has Document Understanding package with which you can easily extract data after labelling and training the pr-built model.
Document Understanding - DU
Can you please attach the PDF if possible here to check for the alternative approaches because there is no straight forward method.
Also, you can have a look at this post from Forum - PDF to Excel
Can you try the following expression? (replace the previous with the following)
mc = System.Text.RegularExpressions.Regex.Matches(strData,"(?<Order_ID>[A-Za-z]+-\d+-\w+)\s+(?<Order_Date>\S+)\s+(?<Customer_Name>.*?)\s+(?<Country>\S+)\s+(?<Item>\S+)\s+(?<Price>\$\s+[.\d]+)\s+(?<Quantity>\d+)\s*")
Dear Yoichi,
Sorry for asking too many questions.
could you help if I want to create Column Name “Total Price” and Calculate the total price for each order (Total price = Item Price x Quantity), how to achieve this!