Hi Everyone,
I want to get the amount in pdf file(1927.00 and 2100.00). The positioning is the same. but when i split the extracted text. The value I’m getting is not accurate. Please refer to the attached screenshot.
Thank you
Hi Everyone,
I want to get the amount in pdf file(1927.00 and 2100.00). The positioning is the same. but when i split the extracted text. The value I’m getting is not accurate. Please refer to the attached screenshot.
Thank you
Hi @rpaforum
Use Regex to extract the amount directly, for example:
\d{1,3}(,\d{3})*.\d{2}
This will correctly return 1,927.00 and 2,100.00 regardless of formatting.
If needed, switch to Read PDF with OCR for more consistent text extraction.
How did you try to extract?
when reading pdf you can use preserve formatting or uncheck it both gives two different output text layouts can use whichever suits you better
it would be helpful if you can provide the extracted text as is to check
cheers
create a sample workflow,
hello @rpaforum ,
if you have document data and location or not fixed if fixed you can use
Extract Document data activity and select generative predefined.
and also in perticular filed you can put single line ptompt for 1927.00 and 2100.00
try this method you can get your result.
Regards
Dheerandra Vishwakarma