I really need help. I am working on web data scrapping from different url.
This is my workflow:
I got the below results. How to split the two companies from the list and store them in different variables.
Example:
Company name(The Design Firm (TDF) (6 months)), job title(Digital Marketing Director) and date(1/2022 - 8/2022).
Note
The Design Firm (TDF) (6 months)Digital Marketing Director1/2022 - 8/2022
11thAgency (1 year, 3 months)Project Manager4/2022
StoriCamPK (1 year)Marketing Communications Manager7/2022
Cloud Innovators Solution (1 year, 2 months)Digital Marketing Executive1/2021 - 4/2022
Growthmates (4 months)Project Manager9/2020 - 1/2021
Cloud Innovators Solution (9 months)Social Media Marketing Specialist1/2020 - 10/2020
Star Marketing Pvt. Ltd (1 year, 1 month)Associate Digital media exective 11/2018 - 12/2019
Vivotic Solucions (9 months)Content Writer3/2017 - 12/2017
Thanks.
Anil_G
(Anil Gorthi)
August 14, 2023, 6:25am
2
@Gowthaman_Gajakutti
Can you please tell which is the input string and what is the data you need from it
Cheers
rlgandu
(Rajyalakshmi Gandu)
August 14, 2023, 6:37am
3
@Gowthaman_Gajakutti
Use find Matching pattern activity
Company name:
job title:
Dates
Hi @Anil_G ,
1.His sample input
11thAgency (1 year, 3 months)Project Manager4/2022
2.sample ouput
First Company name: 11thAgency (1 year, 3 months)
First Company Job Title: Project Manager
First Company Experience Date: 4/2022
Company Name:- .*(?=))
Company Role:- ((?<=))(\w*\s*)*(?=\d/|\d\d/))
Date:- \d{1,2}/\d{4}
1 Like
Thanks dude. It is very useful.
How to find first two company details from the list?
My workflow
Dynamic experience details from different url
The Design Firm (TDF) (6 months)Digital Marketing Director1/2022 - 8/2022
11thAgency (1 year, 3 months)Project Manager4/2022
StoriCamPK (1 year)Marketing Communications Manager7/2022
Cloud Innovators Solution (1 year, 2 months)Digital Marketing Executive1/2021 - 4/2022
Growthmates (4 months)Project Manager9/2020 - 1/2021
Cloud Innovators Solution (9 months)Social Media Marketing Specialist1/2020 - 10/2020
Star Marketing Pvt. Ltd (1 year, 1 month)Associate Digital media exective 11/2018 - 12/2019
Vivotic Solucions (9 months)Content Writer3/2017 - 12/2017
Hey if you use find matching pattern activity and use the results field in properties panel, then using index you can get the first two values
can you tell how to find the index in my workflow? Thanks.
Anil_G
(Anil Gorthi)
August 14, 2023, 7:04am
9
@Gowthaman_Gajakutti
I believe regex is already provided
in the for loop properties you will have a property index for which you can assign an integer value …and then use index<2 to get only first two items …and on else side use break to stop the loop after 2
cheers
H! @Gowthaman_Gajakutti
can you try like this by using the regex,
steps
Take assign activity and pass the value like this = System.Text.RegularExpressions.Regex.Matches(str,“(?<=[)])[A-Za-z\s]+|.*(?=[)])|\d{1,2}[/]\d{4}”)
after that based on the index you can call it !
for the reference you can see the below screenshot.
Or else take the matches activity separate separately and pass the regex pattern like this way
.*(?=[)]) for the company name
(?<=[)])[A-Za-z\s]+ for job role
\d{1,2}[/]\d{4} for the date