Regexr formula NEEDS MAJOR HELP PLEASE :( anyone -NEEDS URGENT HELP!

Hi all. Is anyone able to help extract all of the director’s name (RAUL ANBULLY PAKURDAD, SAMBABABIN HENTHAL MUDYR, and TAKHALA SAI PANGSAI) here using RegExr.com? Need major help! Thank you all so much!

ACCOUNTING AND CORPORATE REGULATORY AUTHORITY
(ACRA)

WHILST EVERY ENDEAVOR IS MADE TO ENSURE THAT INFORMATION PROVIDED IS UPDATED AND CORRECT. THE AUTHORITY
DISCLAIMS ANY LIABILITY FOR ANY DAMAGE OR LOSS THAT MAY BE CAUSED AS A RESULT OF ANY ERROR OR OMISSION.

Business Profile (Company) of WHAT INTHEWO PTE. LTD. (123456674M) Date: 19/02/2020

The Following Are The Brief Particulars of :

Registration No. : 123456674M

Company Name. : WHAT INTHEWO PTE. LTD.

Former Name if any :

Incorporation Date. : 10/01/2017

Company Type : HELVATE COMPANY LIMITED BY SHARES

Status : Dead Company

Status Date : 10/01/2017

Principal Activities

Activities (I) : INFORMATION TECHNOLOGY CONSULTANCY (EXCEPT HELPRSECURITY) (61231)

Description : SOFTWARE CONSULTING, SOFTWARE APPLICATION DEVELOPMENT

Activities (II) :

Description :

Capital

Issued Share Capital Number of Shares * Currency Share Type

(AMOUNT)

25321 9999 TAIWANESE, DOLLARS ORDINARY

  • Number of Shares includes number of Treasury Shares

Paid-Up Capital Number of Shares Currency Share Type

(AMOUNT)

25321 TAIWANESE, DOLLARS ORDINARY

COMPANY HAS THE FOLLOWING ORDINARY SHARES HELD AS TREASURY SHARES

Number Of Shares Currency

Authentication No. : L12001832Z

Page 1 of 4ACCOUNTING AND CORPORATE REGULATORY AUTHORITY
(ACRA)

WHILST EVERY ENDEAVOR IS MADE TO ENSURE THAT INFORMATION PROVIDED IS UPDATED AND CORRECT. THE AUTHORITY
DISCLAIMS ANY LIABILITY FOR ANY DAMAGE OR LOSS THAT MAY BE CAUSED AS A RESULT OF ANY ERROR OR OMISSION.

Business Profile (Company) of WHAT INTHEWO PTE. LTD. (123456674M) Date: 19/02/2020

Registered Office Address : 4 SHELDON WAY
#01-02
GPS CENTRE II
TAIWAN (43212)

Date of Address : 10/01/2017

Date of Last AGM : 03/02/2020

Date of Last AR : 18/02/2020

FYE As At Date of Last AR : 31/12/2018

Audit Firms

NAME

LKKK LLP

Charges

Charge No. Date Registered Currency Amount Secured Chargee(s)

Officers/Authorised Representative(s)

Name ID Nationality Source of Date of Appointment
Address
Address Position Held

RAUL ANBULLY PAKURDAD S3451233E TAIWANESE CITIZEN OSCARS 25/11/2017

36 TURF CLUB ROAD Director
SINGAPORE (287978)

SAMBABABIN HENTHAL MUDYR S3123442E INDIAN ACRA 18/07/2019

80 JOLLYHELI ROAD Director
#06-11
CITYLIGHTS
TAIWAN (141532)

TAKHALA SAI PANGSAI F4123451Q INDIAN ACRA 10/01/2017

36 STURDY ROAD Director
#01-53
RIVERDALE
TAIWAN (754352)

TAKHALA SAI PANGSAI F4123451Q INDIAN ACRA 10/01/2017

Authentication No. : L12001832Z

Page 2 of 4ACCOUNTING AND CORPORATE REGULATORY AUTHORITY
(ACRA)

WHILST EVERY ENDEAVOR IS MADE TO ENSURE THAT INFORMATION PROVIDED IS UPDATED AND CORRECT. THE AUTHORITY
DISCLAIMS ANY LIABILITY FOR ANY DAMAGE OR LOSS THAT MAY BE CAUSED AS A RESULT OF ANY ERROR OR OMISSION.

Business Profile (Company) of WHAT INTHEWO PTE. LTD. (123456674M) Date: 19/02/2020

Officers/Authorised Representative(s)

Name ID Nationality Source of Date of Appointment
Address
Address Position Held

36 STURDY ROAD Secretary
#01-53
RIVERDALE
TAIWAN (754352)

Shareholder(s)

Name ID Nationality/Place of Source of Address Changed
incorporation/Origin Address
Address

1 HELLS TECHNOLOGIES PTE. LTD. 1234161241H TAIWANESE ACRA

4 SHELDON WAY
#12-03
GPS CENTRE II
TAIWAN (068807)

Ordinary(Number) Currency

9999 TAIWAN, DOLLARS

Abbreviation

UL - Local Entity not registered with ACRA

UF - Foreign Entity not registered with ACRA

AR - Annual Return

AGM - Annual General Meeting

FS - Financial Statements

FYE - Financial Year End

OSCARS - One Stop Change of Address Reporting Service by Immigration & Checkpoint Authority.

Note :

  • The information contained in this Business Profile is extracted from lodgements filed by this entity with ACRA.

Authentication No. : L12001832Z

Page 3 of 4ACCOUNTING AND CORPORATE REGULATORY AUTHORITY
(ACRA)

WHILST EVERY ENDEAVOR IS MADE TO ENSURE THAT INFORMATION PROVIDED IS UPDATED AND CORRECT. THE AUTHORITY
DISCLAIMS ANY LIABILITY FOR ANY DAMAGE OR LOSS THAT MAY BE CAUSED AS A RESULT OF ANY ERROR OR OMISSION.

Business Profile (Company) of WHAT INTHEWO PTE. LTD. (123456674M) Date: 19/02/2020

  • The list of officers for this entity is available for online authentication within 30 days from the date of purchase of this Business Profile. Please scan
    the QR code available on the last page of this profile to access the authentication page. For more information, please visit www.acra.gov.sg.

FOR REGISTRAR OF COMPANIES AND BUSINESS NAMES
SINGAPORE

RECEIPT NO. : ACRA12344534544124 (Free Business Profile by ACRA)

DATE : 19/02/2020

This is computer generated. Hence no signature required.

Authentication No. : L12001832Z

Page 4 of 4

Hi @Kian
All director names seem to be followed by what looks like their employee ID.
It has a certain pattern, which is - [1 letter] [7 digits] [1 letter]

RAUL ANBULLY PAKURDAD S3451233E
SAMBABABIN HENTHAL MUDYR S3123442E
TAKHALA SAI PANGSAI F4123451Q

If you write your regex to find any string that ends with this pattern, you should only find the director names.
Here is the solution, with the explanation shown on the same page:
https://regex101.com/r/Hoa2D3/1/

Also, may I suggest you refer to the Regex megapost by our resident RegEx expert @Steven_McKeering

1 Like

Hi @RPAForEveryone thanks for replying! However, I am required to use the word ‘director’ as an anchor since I am required to not only extract the director’s name for this pdf, but also for other pdf’s as well of different IC formats.

Is there any other way? :frowning:

Here’s another pattern using “Director” as anchor:

^[^\d\n\r]+(?= .+(\r?\n){2,}.+Director)

Note that you need to specify the multiline option since we want ^ to match the beginning of each line.

Do you have a sample of the other pdfs?

1 Like

Yes I do @ptrobot , but to simplify things here, below are 2 more samples based on the Director’s part.
Furthermore, I’ve tried inputting this formula into Regexr Tester and UiPath, however, when I swapped to another PDF, it does not work. Is there any possible way to solve this?

1.------------------------------------------------------------------
Officers/Authorised Representative(s)

Name ID Nationality/Citizenship Source of Date of Appointment
Address
Address Position Held

RAMASATU SUBARUSUZUKI IYER 55123534G AMERICAN CITIZEN ACRA 09/09/2020
@RAMADAN

14 FLORADADA ROAD Director
#21-12
AZALEA PARK BANGOLAW
SINGAPORE (523423)

ELAHAA HUA SUALA BEEHOON S3213467H SINGAPORE CITIZEN OSCARS 09/09/2020

113 CHAOGA RIS YEEYEET Secretary
#15-129
THE PALAY
TAIWAN (123434)

Shareholder(s)

2.-------------------------------------------------------------------------
Officers/Authorised Representative(s)

Name ID Nationality/Citizenship Source of Date of Appointment
Address
Address Position Held

CHAN CHUA REN S2315852F SINGAPORE CITIZEN ACRA 17/04/2018

105 TOWER ROAD Director
#10-380
SINGAPORE (411109)

KAN SHUN TEHO AB1333545 THAI ACRA 17/04/2018

27/1 SOI CHAEN CHI, LONGTON NUER Director
SUB-DISTRICT
WATAMA DISTRICT, MAHANI NAHON 10519,
THAILAND

GOH BEEN CHYI (WU WENI) S0135567Q SINGAPORE CITIZEN ACRA 17/04/2018

5D ANG MEN KIE STREET 52 Secretary
#00-37
CITY VIEW @ CHEG SAN
SINGAPORE (501496)

Authentication No. : D23086009L

Page 2 of 4ACCOUNTING AND CORPORATE REGULATORY AUTHORITY
(ACRA)

WHILST EVERY ENDEAVOR IS MADE TO ENSURE THAT INFORMATION PROVIDED IS UPDATED AND CORRECT. THE AUTHORITY
DISCLAIMS ANY LIABILITY FOR ANY DAMAGE OR LOSS THAT MAY BE CAUSED AS A RESULT OF ANY ERROR OR OMISSION.

Business Profile (Company) of TEXIP INTERN PRIVATE LIMITED Date: 26/06/2020
(202212984B)

Shareholder(s)

The pattern should work fine for sample 2 but not sample 1 since it contains the line “@RAMADAN” between the name row and director row. What the pattern does is that it takes the first row above Director row (that is not empty) and extract the first part of the name row that does not contain any digit.

Is the name row always in the same format?

Firstname MiddleName Lastname A1234567B ..... 00/00/0000

Hi @ptrobot .Yes, it’s always as of this format. However, sometimes the (A1234567B) would be either (AB123456 or 123456789). As shown in the additional example below. Is there any possible way to solve this?

Officers/Authorised Representative(s)

Name ID Nationality/Citizenship Source of Date of Appointment
Address
Address Position Held

CIA KIM SENG, NICHOAS SA1232735 SINGAPORE CITIZEN ACRA 02/08/1999

9 COOLER LOSE Director
SERANGOON EDEN ESTATE
SINGAPORE (586109)

CHIA CHEN HEY MIKEY 12546809 SINGAPORE CITIZEN ACRA 06/06/1995

6 LIAN WAN STATE Director
GOLDEN HILL ESTATE
SINGAPORE (560017)

LIM AH BENG S1338180A SINGAPORE CITIZEN OSCARS 06/06/1995

17 PHONIX DRIVE Director
SINGAPORE (608129)

CHIA QUEEN SANG CHRISTOPHER S6456563E SINGAPORE CITIZEN ACRA 31/07/2007

139 GILSTEADY ROAD Director
#11-25
GILSTEADY BOOKS
SINGAPORE (311083)

Authentication No. : W29070211B

Page 2 of 4ACCOUNTING AND CORPORATE REGULATORY AUTHORITY
(ACRA)

WHILST EVERY ENDEAVOR IS MADE TO ENSURE THAT INFORMATION PROVIDED IS UPDATED AND CORRECT. THE AUTHORITY
DISCLAIMS ANY LIABILITY FOR ANY DAMAGE OR LOSS THAT MAY BE CAUSED AS A RESULT OF ANY ERROR OR OMISSION.

Business Profile (Company) of PMS CONTROLLING STEM PTE LTD (109514989Z) Date: 29/10/2020

Officers/Authorised Representative(s)

Name ID Nationality/Citizenship Source of Date of Appointment
Address
Address Position Held

CHIA QUEEN SANG CHRISTOPHER S6456563E SINGAPORE CITIZEN ACRA 31/07/2007

139 GILSTEADY ROAD SECRETARY
#11-25
GILSTEADY BOOKS
SINGAPORE (311083)

Shareholder(s)

@ptrobot Adding onto the previous post, since I am incorporating Regexr formula into UiPath, UiPath is unable to read the formula which contains multiline, is there any way to solve this? :frowning: Please help me :frowning:

@Kian If I may, Can I ask as to How many different types of PDF format do you have and want to process ?

Test with this updated pattern. It will skip the “@RAMADAN” line and will only match if the line ends with a date.

^[^\d\n\r]+(?= .+\d{2}/\d{2}/\d{4}(.*\r?\n.*)?(\r?\n){2,}.+Director)

Make sure that you have the Multiline option checked in the Matches activity.

image

image

RegexDirector.xaml (7.8 KB)

1 Like

Hi @supermanPunch ,currently I am required to process 41 Pdf’s. However since they contain confidential client details, I am not able to post all of the Pdf’s here.
However, all of the 41 Pdf’s has the same design as the per Pdf shown below. Just that some Pdf’s will have lets say, 6 directors, and that some Pdf’s have lets say 6 pages in total where some only 3. Therefore I am required to perform Regexr extraction as it is the most viable method as of now.

Example.pdf (115.6 KB)

Omg @ptrobot. Thank you so very much! This formula works! Thank you so so so much!!

1 Like

@Kian You’re welcome! Glad to hear that it’s working. :grinning:

Hi @ptrobot, if I have anymore questions that require your assistance, can I send you a private message to enquire your assistance?

@Kian Yes, you’re welcome to PM me. I will try to help if I can. But in general I would recommend that you create new topics/posts in the forum. There are alot of helpful people here and most of them have more experience with UiPath than I have.

Hi @ptrobot thank you so much!!! :smiley:

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.