Already registered? Log in now to personalize your experience!
You have tons of Open Office, Microsoft Office and PDF documents, or even images... and you would like to be able to search their meta data and the content itself. How can this be done? Above all since the announcement that Google Search Appliance would be phased out.
In this session, David will explain how Apache Tika can provide this service and how to combine this amazing library with Elasticsearch:
* Elasticsearch ingest-attachment plugin: https://www.elastic.co/guide/en/elasticsearch/plugins/current/ingest-attachment.html
* FSCrawler : https://github.com/dadoonet/fscrawler)
* Workplace Search connector for FSCrawler in order to have a powerful rack-based user interface for your documents: https://www.elastic.co/workplace-search
Index your office documents with FSCrawler and the Elastic suite