Hi, what would you use to extract text from pdfs, python or c#? Why? I would like to extract the text and use the font and size to recognize it. Thank you!
hello @anamariavioleta.dinca ,
-
For your specific requirement of extracting text and utilizing font and size information for recognition, both Python and C# can achieve this task effectively. However, if you plan to incorporate machine learning models for text recognition, Python’s extensive ecosystem of machine learning libraries may offer more options and flexibility.
-
if you’re more comfortable with Python or your project involves extensive use of machine learning, Python might be a better choice. On the other hand, if your project is .NET-based or requires tight integration with Windows services, C# could be the preferable option.
Regards,
Dheerendra Vishwakarma
Thank you very much, I’ve never worked with python before, so I’ll probably limit myself to c# for the moment because I don’t have anything very complicated to do, and the input is always the same.
This topic was automatically closed 3 days after the last reply. New replies are no longer allowed.