Hello,
I have a pdf where I need to extract “Effective Date” “Partnumber” “Total Cost”
I have extracted these data points using read pdf and regex and later according to my workflow I must place total cost value in excel row by filtering partnumbers. My output excel looks like this.
We have a Regex Datatable / Excel data, where you have extracted the values and the Part Number and the Total Price are co-related. In your case we have this data already as a Datatable, say RegexDT.
Next, we would have to read the Excel sheet data where we need to update the values. Get it as a Datatable, let’s Say Sheet2DT.
Now we can loop through the RegexDT using a For Each Row activity and inside it we can use the LookUp Datatable activity to get the row index of the matching part number value present in the Sheet2DT. The row index variable name be rowIndex.
Next, we can Check if the rowIndex value = -1, if so skip the update, meaning that the Part Number is not present in the Sheet2DT. If not -1, then we can update the Sheet2DT row value using the below Assign statement :
Another way…We can directly write on excel as well using below method
Use find/replace and search for partnumber in column A and store it to varpartcell
Use find/replace and then search for date you need in first row and save the output cell in vardatecell
Use write cell activity to write the data and give the cell number as followSystem.Text.RegularExpressions.Regex.Match(vardatecell,"\w+).Value + System.Text.RegularExpressions.Regex.Match(varpartcell,"\d+).Value
I am extracting row number from part and column from date variables