1
I am trying to extract information from PDF files to popular a table without having to read the PDF. Only I can’t find any references that indicate how to do this.
I need, for example, to discover the authors and date of publication of this article:
I would like package tips/functions in python or r.
Note: already able to extract text from pdf, what I do not know how to do is find the information I need within the text, given that I do not have the exact text to be searched.