java - Search for text in PDF files -


i have list of words (86 words), , pdf files. search words pdf files , return values ​​tell me if exist.

during research solutions in tutorials meet 2 problems:

  1. is i'm forced convert pdf file file ??

  2. what simple bibilotheque allows me realize problem, because i'm stuck there's lot of examples (pdfbox, appach lucense, itext, pdftron ....)

is i'm forced convert pdf file file

pdf file file. so, not have convert it. have able read it. can use 1 of available java pdf parsers (e.g. pdfbox mentioned).

what simple bibilotheque allows me realize problem...

as far have 86 words , 1 document not need indexing tool lucene. if want build application supports different targets , different documents (especially if need real free text search) need lucene (or solr) perform indexing of documents first , performing search using index.


Comments

Popular posts from this blog

css - Text drops down with smaller window -

c# - DetailsView in ASP.Net - How to add another column on the side/add a control in each row? -

ruby on rails - Authlogic - how to make a registration and don't log in the new account? -