java - Search for text in PDF files -


i have list of words (86 words), , pdf files. search words pdf files , return values ​​tell me if exist.

during research solutions in tutorials meet 2 problems:

  1. is i'm forced convert pdf file file ??

  2. what simple bibilotheque allows me realize problem, because i'm stuck there's lot of examples (pdfbox, appach lucense, itext, pdftron ....)

is i'm forced convert pdf file file

pdf file file. so, not have convert it. have able read it. can use 1 of available java pdf parsers (e.g. pdfbox mentioned).

what simple bibilotheque allows me realize problem...

as far have 86 words , 1 document not need indexing tool lucene. if want build application supports different targets , different documents (especially if need real free text search) need lucene (or solr) perform indexing of documents first , performing search using index.


Comments

Popular posts from this blog

c# - DetailsView in ASP.Net - How to add another column on the side/add a control in each row? -

javascript - firefox memory leak -

Trying to import CSV file to a SQL Server database using asp.net and c# - can't find what I'm missing -