PDF Redaction Custom Activity

That’s a great idea. Interesting feature request. I will add this to the feature request for an upcoming version.

Hi Bokyeong - I’ll keep you posted, and let once the new DU plugin is published.


The Omnipage engine is able to read asian characters if it’s engine pack is set as ‘Extended’ (not ‘Basic’).
How can I change the Omnipage engine pack setting to ‘Extended’ in ‘PDF Redaction’ activity?

Can I get a notice when the new vesion is released?

Thank you so much.

Excellent point. Thank you for pointing this out. I will adjust this and add this capability in the next release. It will be published very soon.

Hey Bernard,

Is there a link to download your sample workflow file that was used in the demo video? I downloaded the activity and it looks great.

Great tool, thank you for creating and sharing it Bernard! Some feature requests from myself (in the UK):

  • Ability to provide an array of regex patterns in the Formula property rather than only a single pattern
  • Support for phrases/patterns with spaces as mentioned in this thread
  • Support for non-US pre-defined patterns in the FormulaAuto property. For example, other currencies than USD, and phone number patterns other than US format

Out of interest, is there any limit on how many entries in the Keywords string array, or is it the .NET limit itself?

Hi @lawes

Could you share the sample workflow again.
The sample workflow is removed from this post.

Thank You!

Hi @lawes ,

I get the error when I use this activity:
The UiPath version that we are using is 2020.10.2
Used the package “UiPathTeam.PDFRedaction.Activities” package version 2.0.0 from the marketplcae link that you had shared.

Can you let us know if am missing something, i have filled all the required parameters

Redact

Hello Mr. Lawes
Has any of my suggestions been added?

  1. adding confidential mark
  2. redacting asian characters with omnipage extended
  3. redacting phrases

Hi Bokyeong

Yes. In fact, all of these have been added to the upcoming new release. I’ll share more once it is published in the next couple of weeks

Thank you so much.

@lawes

Hello Bernard,

We tested this in our project and we see there are a few limitations ,

  • It works on a pdf with upto 64 lines or pdf spanning for few pages.

  • It doesn’t work on a pdf more than 64+ lines or on a pdf spanning multiple pages. We get a blurred pdf document as the output without masking

  • Multiple occurrences of the same keyword are getting highlighted.

We are on a critical project, this functionality is awesome and very helpful. We really want to move ahead with this.

But we see there are limitations, How can we progress on this?

We can connect to discuss more on this.

Thank You!

Pleased to announce that I will be releasing the latest version of PDF Redaction Activity,
version 2.0.21

Release notes include:

  1. Fixed a number of bugs reported since the last major release
  2. Fixed document resolution blurring issue
  3. Simplified a number of arguments.
  4. Added ability redact documents processed by UiPath Document Understanding by passing in the Document Object Model and the Extraction Result.

The newly added DU Redaction Plugin Activity is a step up from the basic Redaction Activity in that it provides for

  • more intelligent redaction options through use of Document Understanding extraction tools and AI Center
  • Asian language OCR Options
  • handwriting capability,
  • ability to redact paragraphs rather than just words

This version is currently under review by our team, I will update you on this forum once it is finally released.

Manisha - thank you for reaching out to me directly. I’m glad we were able to resolve your challenges. Let me know if I can be of any additional assistance.

Best

Bernard

Does the new version include ‘adding confidential mark to redaction box’, by any chance?

Hi Bernard,

I’m unable to redact/highlight phrases in PDF using the new DU Redaction plugin. Only word redact is possible. Does the new version have the feature to redact phrases?

Thanks in advance!

Ithikash

Hi Ithikasha,

If you are able to extract a phrase with Document Understanding, then the entire phrase can be redacted with the plugin. Does your DU extraction result include captured ‘phrases’?

Best
Bernard