java - Search for text in PDF files -


i have list of words (86 words), , pdf files. search words pdf files , return values ​​tell me if exist.

during research solutions in tutorials meet 2 problems:

  1. is i'm forced convert pdf file file ??

  2. what simple bibilotheque allows me realize problem, because i'm stuck there's lot of examples (pdfbox, appach lucense, itext, pdftron ....)

is i'm forced convert pdf file file

pdf file file. so, not have convert it. have able read it. can use 1 of available java pdf parsers (e.g. pdfbox mentioned).

what simple bibilotheque allows me realize problem...

as far have 86 words , 1 document not need indexing tool lucene. if want build application supports different targets , different documents (especially if need real free text search) need lucene (or solr) perform indexing of documents first , performing search using index.


Comments

Popular posts from this blog

php - cannot display multiple markers in google maps v3 from traceroute result -

c# - DetailsView in ASP.Net - How to add another column on the side/add a control in each row? -

javascript - firefox memory leak -