PDF Search Automation

Hey Team
I want to search like 4.1, 4.2 and copy the data in that point , Is there any automation for this, Please help me to copy the data

1 Like

Hi

There are many ways to do it

  1. Simple READ PDF or READ PDF WITH OCR
    And get the output as text and then use Regex method to identify the text u want
    For Regex cheat sheet refer this
  1. Next u can use document understanding with Regex extractor

Cheers @Kuldeep_Pandey

@Kuldeep_Pandey

You can use read pdf text and then regex to extract the required data

can try something like this

(?<=4\.1\.1\.2.*\n).*

cheers

4.1.1 Is not fixed have to change this every time from excel input

also taking these no from excel row

@Kuldeep_Pandey

You can use variable like this

Use this inside for each row in Datatable activity

System.Text.RegularExpressions.Regex.Match(str,"(?<=" + CurrentRow("ColumName").ToString + " .*\n).*".Replace(".","\.")).Value

CurrentRow(“ColumName”).ToString - this is the column containing the numbers needed

cheers

Its giving me blank value
Not giving Output

@Kuldeep_Pandey

Is it Possible to send a sample data here…

Also please try this

System.Text.RegularExpressions.Regex.Match(str,"(?<=" + CurrentRow("ColumName").ToString + " .*\n).*".Replace(".","\."),RegexOptions.MultiLine).Value

cheers

6.2.1 See ISO 9001:2015 requirements.
No Ford Customer-Specific Requirement for this section.
6.2.2 See ISO 9001:2015 requirements.
No Ford Customer-Specific Requirement for this section.
6.2.2.1 Quality objectives and planning to achieve them — supplemental
No Ford Customer-Specific Requirement for this section.
6.3 Planning of changes
See ISO 9001:2015 requirements.

Sample data

We need all the data before any dynamic no

Like this
No Ford Customer-Specific Requirement for this section.

Hi Anil

Can you check the replace

seems like it will replace all the . with . so i guess .replace should be places after .tostring

System.Text.RegularExpressions.Regex.Match(str,"(?<=" + CurrentRow("ColumName").ToString.Replace(".","\.")+ " .*\n).*",RegexOptions.MultiLine).Value

Reagrds

@Kuldeep_Pandey

@LAKSHMI_NARAYANA_PEMMASAN is correct…please use the updated one

System.Text.RegularExpressions.Regex.Match(str,"(?<=" + CurrentRow("ColumName").ToString.Replace(".","\.") + " .*\n).*",RegexOptions.MultiLine).Value

Yes This is working But one update is there
like In 8.1 I got
8 Operation
8.1 Operational planning and control
Statement of Work
Appropriate to the organization’s responsibilities, the organization shall meet the requirements
of the Statement of Work(s). There may be an Engineering Statement of Work (available from
the Ford Product Development Engineer), an Assembly Statement of Work, a Manufacturing
Statement of Work or other types available from the appropriate Ford organization. See the
Global Product Development System (GPDS) for specific timing.
APQP
The External Supplier APQP/PPAP Readiness Assessment (Schedule A) is available through
https://web.qpr.ford.com/sta/APQP.html
The organization shall submit completed Schedule A’s as specified in the Schedule A
notification letter for each program (monthly and after any significant change in APQP status).
This applies to priority and non-priority suppliers, see Supplier Engagement Process on
https://web.qpr.ford.com/sta/GPDSSupplierEngagement.html
Even if the organization has not received a Schedule A notification letter for a program but has
New Tooled End Items (NTEIs) for a Ford program launch, the organization is still required to
complete a Schedule A for each program milestone for all NTEIs and retain the final Schedule A
in the PPAP file for the life of part (production and service) plus one year.
Page 19 of 40 December 2021
Copyright © 2021 Ford Motor Company.
Prototypes
When the organization is also sourced with the production of prototypes, effective use should be
made of data from prototype fabrication to plan the production process. The organization
records the dimensional data per the Prototype Build Control Plan, reviews the measured
characteristics with Ford PD Engineer and obtains approval on the results from the Ford PD
Engineer with confirmed acceptance of parts. If prototype parts are not fully compliant to
specification, Ford PD Engineering can approve use of the part with a WERS Alert.
The organization should use the APQP/PPAP Evidence Workbook to record prototype part data
for Ford PD review. The APQP/PPAP Evidence Workbook is available through
Home Realm Discovery .
Prototype Tooling
To dispose of Prototype Tooling, the organization must contact Global Asset Management
(GAM) within 30 days after PPAP2 completion and request a Prototype Tooling Disposal form.
The organization submits the completed form to the Ford PD Engineering supervisor for
approval and the approved form is to be returned to GAM for disposal authorization and scrap
processing.
Organization(s) must contact GAM via email: GAMNA1@ford.com
(Organizations with Tooling in Europe, contact GAM at gamfoe@ford.com)
8.1.1 Operational planning and control — supplemental
No Ford Customer-Specific Requirement for this section.
8.1.2 Confidentiality
No Ford Customer-Specific Requirement for this section

How to take 8.1

@Kuldeep_Pandey

So badically you mean 8.1 is having multiple lines? Instead of one line?

Cheers

Yes Correct many of have multiple lines

Any Regex Statement for this?

try this

System.Text.RegularExpressions.Regex.Match(str,"(?<=" + CurrentRow("ColumName").ToString.Replace(".","\.")+ "[\S\s]+?(?=\r?\n\d|\z)",RegexOptions.MultiLine).Value

mostly it should work for all the cases

Regards


Errror

try this

System.Text.RegularExpressions.Regex.Match(str,"(?<=" + CurrentRow("ColumName").ToString.Replace(".","\.")+ ")[\S\s]+?(?=\r?\n\d|\z)",RegexOptions.MultiLine).Value

Regards