java - Search for text in PDF files -
i have list of words (86 words), , pdf files. search words pdf files , return values tell me if exist.
during research solutions in tutorials meet 2 problems:
is i'm forced convert pdf file file ??
what simple bibilotheque allows me realize problem, because i'm stuck there's lot of examples (pdfbox, appach lucense, itext, pdftron ....)
is i'm forced convert pdf file file
pdf file file. so, not have convert it. have able read it. can use 1 of available java pdf parsers (e.g. pdfbox mentioned).
what simple bibilotheque allows me realize problem...
as far have 86 words , 1 document not need indexing tool lucene. if want build application supports different targets , different documents (especially if need real free text search) need lucene (or solr) perform indexing of documents first , performing search using index.
Comments
Post a Comment