Unwanted spaces when using "replace text in word document"

sanghavivatsal74 · January 8, 2022, 6:54pm

The task is to read text from PDF, extract only the relevant content, and lastly, replace some text within a word document with this new extracted relevant data. The workflow adopted by me -
“Read PDF Text” to get the pdf data in a string format - Everything works fine.
Use “matches” and configure regex to extract relevant content - successfully able to do it.
Use “Replace text in Document” to replace a string within word doc with the new extracted data - ISSUE LIES HERE!!!

WORD OUTPUT -

As you can see in the attached snapshots, words “accompanying” and “mental” are at the very end of sentence within PDF. When adding content to word, that is exactly where there are unwanted spaces. Please help!!!

sanghavivatsal74 · January 8, 2022, 6:55pm

PDF INPUT -

Vinit_Mhatre · January 8, 2022, 7:19pm

System.Text.RegularExpressions.Regex.Replace(str,“\s”,“”)
Try this

sanghavivatsal74 · January 31, 2022, 6:58pm

Wouldn’t that eliminate the spaces between title - “Brief description of Services” and the start of paragraph too ? I only want to eliminate the spaces which are weirdly inserted mid-sentence. I think this has something to do with the READ PDF TEXT activity which is reading each pdf sentence as a separate line and not as a continuous text. Hope this makes sense.

Topic		Replies	Views
Regex to remove space at the end Activities pdf	3	1128	November 21, 2021
Kill whitespaces in extraction Academy Feedback datatable , selector , uiautomation , activities , question	3	634	April 15, 2020
How to remove space between line in word doc? Activities activities , question , word	8	1592	March 4, 2022
Removing unwanted spaces in .docx files Activities activities , word	1	1051	October 19, 2020
New PDF activity reads extra space Help	2	1245	May 25, 2019

Unwanted spaces when using "replace text in word document"

Related topics