Hello, I have been making a series of data scraping processes, which takes data from a word document and outputs it to excel. The files I have now been given are PDFs, and for consistency, I am trying to keep all the inputs as word docs. So far, I have tried reading the PDF, and then using a word …

Got a solution. A long and slow solution, but a solution nonetheless. I opened Word with start process, then selected open, and typed in the PDF filepath. I then saved it once it loaded as a .docx, closed it and reopened it. Data can now be scraped from it, and it still has all the formatting/table…

Convert .pdf to .docx and keep tables/formatting

Help Studio

william.coulson (Will Coulson) March 17, 2020, 11:52am 3

Hi @msan,

I am trying to open the PDFs with Word, or convert the PDFs to Docx’s, and scrape data from them.

Convert pdf to word without changing format and allignment

Topic		Replies	Views
Convert pdf to word without changing format and allignment Studio studio	7	2228	July 13, 2020
Converting Pdf table to excel Activities excel , pdf , activities , studio	23	3549	January 18, 2023
Pdf to excel conversion Help pdf	5	4531	October 17, 2018
Extract table from PDF into EXCEL StudioX datatable , excel , pdf , orchestrator , robot , activities , studio , studiox , question	7	1622	October 16, 2024
How to convert pdf table to excel table or data table Studio datatable , excel , pdf , data_scraping	13	6068	April 30, 2021

Most Active Users - Yesterday
ashokkarale
sharazkm32
sonaliaggarwal47
LamaX
More details...

Convert .pdf to .docx and keep tables/formatting

Related topics