0
It is possible to scan an entire site by going through all links in search of scrapy pdf files? would be something like apache nutch. I did a search but the staff only uses Xpath, and Xpath can not for min pq I have to enter in several sites to do the research and make a Crawler for each site is humanly impossible.
Obs:
I have to download the pdf(s);
I have to pass several url(s) to Crawler.
What language do you work with?
– André Lins
Good afternoon, André. I work with php but I think with scrapy would be faster since it is an application to do Crawler.
– mell system