I’ve blogged in the past about indexing entire folders of documents with solr and Tika with Data Import Handler. This approach has pro and cons. On the good side, once you’ve understand the basics, setting everything up and running is a matter of a couple of hours max, on the wrong side, using a DIH […]

Continue reading about Index documents content with Solr and Tika

Previous Posts on the serie Import folder of Documents with Apache Solr 4.0 and Tika Highlight matched test inside documents indexed with Solr And Tika  Everything is up and running, but now requirements change, documents can have multiple languages (italian and english in my scenario) and we want to do the simplest thing that could […]

Continue reading about Index a folder of multilanguage documents in Solr with Tika

If you are used in installing Solr in Windows environment and you install for the first time a version greater than 4.2.1 you can have trouble in letting your Solr server to start. The symptom is: service is stopped in Tomcat Application Manager and if you press start you got a simple error telling you […]

Continue reading about Installing Solr on Tomcat on windows, Error solr SEVERE: Error filterStart

After I configured Solr 4.3 on a Virtual Machine (side by side with a 4.0) it refuses to start, and the only error I have in catilina log files is SEVERE: Error filterStart This leaved me puzzled, but thanks to Alexandre and the exceptional Solr Mailing list I was directed toward the solution. Solr 4.3 […]

Continue reading about Install Solr 4.3, pay attention to log libraries

I’ve already dealt on how to index documents with Solr and Tika and in this article I’ll explain how you can not only search for documents that match your query, but returns even some text extract that shows where the document match the query. To achieve this, you should store the full content of the […]

Continue reading about Hilight matched text inside documents indexed with Solr plus Tika