I just give up with this case. I need your support. Let’s see if I am able to explain it.
The use case is based on e-mails reaching a shared mailbox. The body of those mails is as follows:
Each line contains information that has to be retrieved and introduced in a web page afterwards.
My problem is that, the body of those mails comes in HTML format and I haven’t been able to extract the data in any of the methods I can imagine. Here’s the “insides” of the HTML file. I marked in yellow the relative tag of every data I want to retrieve (in green):
I tried several methods (get full text, screen scrapping…) and I’m not able to get a structured entity from where I could start working. If Regex is the best solution here, I need your knowledge and expertise as I’m not that good using Regex.
Thanks everyone for your suggestions and, please, let me know if something is unclear.
Thanks for your suggestion. I already was able to save the body of the mail in a text file. My problem comes just afterwards. I don’t know how to get rid off all the rubish (unwanted tags and symbols) and leave only the data I need.
If some Regex ‘guru’ could lend a hand here I would really appreciate it.
Sure. Basicallly I get all the e-mails in scope with a ‘Get Exchange Mail Message’ activity and I store them in a list of Mail Messages.
Then, I do a For Each and I process each mail on that list. I can read the body information with item.Body.ToString assignment.
So, the last step is to save that information with a ‘Write Text File’ activity. And now I have a plain-text file (which I can save as *.txt or *.html) containing the information I mentioned in my first post.
I need to extract the information from this file. And there’s where I am stucked with.
item.Body is similar on what we had done in the immediate panel, but got plain text only.
Please do it at your end similar and share the screenshot with us thanks
I can’t see the difference between item.Body and item.Body.ToString. I guess it’s exactly the same and, I insist on the fact that that’s not the real issue.
My need here is to extract the information from the plain text file which contains the HTML code. And I guess that’s a topic for Regex.