Python or c#

Hi, what would you use to extract text from pdfs, python or c#? Why? I would like to extract the text and use the font and size to recognize it. Thank you!

hello @anamariavioleta.dinca ,

  1. For your specific requirement of extracting text and utilizing font and size information for recognition, both Python and C# can achieve this task effectively. However, if you plan to incorporate machine learning models for text recognition, Python’s extensive ecosystem of machine learning libraries may offer more options and flexibility.

  2. if you’re more comfortable with Python or your project involves extensive use of machine learning, Python might be a better choice. On the other hand, if your project is .NET-based or requires tight integration with Windows services, C# could be the preferable option.

Regards,
Dheerendra Vishwakarma

2 Likes

Thank you very much, I’ve never worked with python before, so I’ll probably limit myself to c# for the moment because I don’t have anything very complicated to do, and the input is always the same.

@anamariavioleta.dinca ,

Welcome

Happy Automation :slightly_smiling_face:

1 Like

This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.