I followed the Document Understanding directions (set up taxonomy, load taxonomy, digitize document, classify document, data extraction, and present validation), but I cannot get the Regex Based Extractor to match any values. I’ve configured the expressions and tested that they work, and I’ve configured the extractors to check the boxes for the fields that should be extracted from the Regex Extractor. But when the validation station pops up, it says all values were not extracted. Exporting the extraction results shows the same–nothing pulled out. I’ve done this with position-based and ML extractors and haven’t had issues. What could I be missing?
Hi welcome to the community!
Is there anything more detailed that you can share with us?
Here’s several screenshots of the process.
I have the ML extractor and Regex extractor inside the Data Extraction Scope:
I input all of the variables to this scope activity from the previous steps according to the Doc Understanding instructional video:
I set the Configure Extractors to take NAIC ID from the Regex Extractor:
Here’s the Regex expression with the actual text that’s being read by the OCR as the sample, and there is in fact a match:
Why is there still no value extracted from the Regex portion?
any updates on this ?
I have no solution yet. Are you having the same issue?
Yes, I am noticing this issue. If there are multiple extractors, I see only the first one is working.
Could you get regex to work being the only extractor? That was failing for me as well.
I’m thinking of trying a workaround of doing the regex Matches activity on the DocText string, and then feeding the result into the ExtractionResults variable. Not sure if that’s easily modifiable, though.
Let me know if it works.
I will try extractor at a time and publish my results here as and when I complete
Hi, I am also facing same situation. followed the same steps but still during present validation station I have to manually select items always
I don’t think my idea is feasible to edit the ExtractionResults variable and add the Matches text.
I’d like to hear from a UiPath person who can speak to the issue with the Regex Based Extractor.
@alexcabuz It will be much appreciated, if you can take a look at this.I am also facing the exact same issue when using multiple extractors. My regex when tested separately it’s a match, but when during present validation station no data is getting extracted.
@btc653 I’m having this same issue and couldn’t edit the extraction results either. I got around it by exporting the extraction results into a dataset and then going through the tables and filling in any missing values using the matches activity. However this makes the Regex extractor useless so it’d be good a hear back from someone on support on whether it’s user error or just some delayed feature fixes.
For the record I’d also like to add that I can’t get the regex extractors to work by themselves or with other extractors, but the regex wizard does confirm a match (I’m at the point where I just match a word literal to try and get something in the results).