Extract PDF information via PHP position

Asked

Viewed 6,690 times

2

Is it possible to extract the information according to the PDF position (x,y coordinate) in PHP? I have seen that it is possible to do this in C# using the Pdfreader class.

1 answer

3


  • I’ve already used these functions... They take everything you have in the PDF.. I would like to take only one area of the PDF, that is, pass the coordinates and only get the text of that selected area!

  • If that area that Voce wants to pick up with strings is always fixed then it is a property of the document as CreationDate or Page right?

  • So here’s the thing... There will be several types of PDF’s, for each model I will create a "mask" to extract the information and this information is not like Creationdate or Page.

  • 1

    use Pdfparser and with the generated text use the function strripos PHP to find the word or chunk you want, try to manipulate it this way.

  • If there’s no other way, I’ll do it like this... I am trying to do this so that I do not take irrelevant information, because the PDF’s that I will manipulate do not always understand all letters. My goal is to extract the PDF data and make the insertion in the database so that the insertion of documents in my GED system is automatic. I thank you for your help and your good will!

  • @Did Jordanbragon manage to make it work this way? Tehno the need for something similar.

  • @Did Jordanbragon solve it? I’m in need of something similar.

  • @Williancoqueiro I could not!

Show 3 more comments

Browser other questions tagged

You are not signed in. Login or sign up in order to post.