How to redact data in scanned/native PDF documents?

Kalees9486 · October 26, 2023, 4:42am

Hi,

Do we have package in UiPath to redact few sensitive information in native and scanned PDF or image documents?

Thanks

Dilli_Reddy · October 26, 2023, 4:46am

For scanned PDFs or images, you need to perform OCR to extract text from the documents. UiPath has built-in activities for OCR. You can use the “Read PDF with OCR” or “Screen Scraping” activities to extract text.
Once you have the text, you can use UiPath activities to search for sensitive information and replace it with redacted text. You can use the “Replace” function in UiPath.
You can use UiPath activities to manipulate PDFs. You may need to extract text, redact it, and then recreate the PDF.
You can use UiPath to open image files and apply redaction using image editing libraries. If you need to automate more complex redactions.
Ensure that the redacted information is securely handled and stored to maintain data protection standards and privacy regulations.

Cheers…!

Kalees9486 · October 26, 2023, 4:52am

@Dilli_Reddy

Thank you for the quick reply.

I am aware of data extraction using ‘Read PDF Text’ or ‘Read PDF with OCR.’ In this case, I expect data redaction to be done by placing a rectangular box with a green highlight, rather than replacing specific words. We have a third party package called Encodian. It will perform the similar kind of activity for native PDF only. i looking for scanned PDF/image documents as well

Any suggestion?

Dilli_Reddy · October 26, 2023, 4:59am

you might need to utilize a combination of OCR for scanned documents and a PDF manipulation library for both types of documents.
-OCR for Scanned Documents
-PDF Manipulation for Redaction
-Conditional Redaction
-Overlay Rectangles
-Save Redacted PDF
-Document Audit

Kalees9486 · October 26, 2023, 5:05am

@Dilli_Reddy

It would be helpful if you can share any documentation/procedure or video for this?

Dilli_Reddy · October 26, 2023, 5:17am

Kalees9486 · October 26, 2023, 6:49am

Hello @Dilli_Reddy ,

It would be helpful if u can share redaction part demo video/documentation

Thanks

Dilli_Reddy · October 26, 2023, 7:09am

Kalees9486 · October 30, 2023, 6:27am

@Dilli_Reddy , the package is not available in the market place with latest version support.

Is there any other package available?

Parth_Trivedi1 · April 4, 2024, 8:04pm

Is this resolve for you looking for same problem solution

Topic		Replies	Views
Redact of data in uipath without using Market place package from UiPath Studio studio , question , activities_panel	3	200	May 14, 2024
Redact PDF Activity in the Adobe PDF Services Activities pdf , considering , feedback	7	260	June 26, 2025
How to redact text in PDF Help	3	2756	July 15, 2020
Extract Text from Scanned Document Video Tutorials ocr	0	989	December 19, 2021
Pdf redaction that doesn't involve Document Understanding Activities pdf , activities , question , vbnet , vb	0	923	January 25, 2022

How to redact data in scanned/native PDF documents?

Related topics