Category "nutch2"

Nutch Issue: While crawling PDFs using nutch, PDF fetching properly but not Parsing

I am using nutch-2.3.1 with Hbase-0.98.8-hadoop2 and the crawl runs fine for HTML pages, but when trying to run the crawl for PDF URLs only some of them seems t