GM guys,
I am trying to read the .pdf files from filling history with Python to extract financial data, and as its scan images, seems that it does not work properly, it looses data or make it wrong, I am not sure the best approach here, does any of you have a validated method to read the .pdfs with some tool? Or get this financial data using another method? What is the method used by the other database providers? the goal is to use some technology to have it automated, to read the the financial data for companies like Revenue by year, this data seems to be only inside the .pdfs