I am using nutch-2.3.1 with Hbase-0.98.8-hadoop2 and the crawl runs fine for HTML pages, but when trying to run the crawl for PDF URLs only some of them seems t
jna
monaca
page.js
fcoalesce
upsetr
externalizable
vba7
comparator
nebula-graph
textspan
highlight.js
couchdb-nano
node-deasync
angular-oauth2-oidc
hl7-v2
pikachoose
sourceforge-appscript
multimodal
malsup-ajax-form
react-intersection-observer
idn
celery-canvas
bonecp
srv-record
django-2.0
drop-down-menu
letter-spacing
freepbx
meteor-accounts
es5-shim