Hello!
You'll need to send the file contents to Elasticsearch in base64 form
and Elasticsearch will use Tika to extract data from the file.
However, in typical case, you would rather store, not the whole data
of the binary file (as it can be quite big), but rather a path to the
file, so that the application that will query Elasticsearch know where
to look for the original file itself.
--
Regards,
Rafał Kuć
Performance Monitoring * Log Analytics * Search Analytics
Solr & Elasticsearch Support *
http://sematext.com/> Hi ,
> If I am not wrong you are talking about
>
https://github.com/elasticsearch/elasticsearch-mapper-attachments> <
https://github.com/elasticsearch/elasticsearch-mapper-attachments>
> So in this I can index the attachments(say pdf file) and that will be stored
> as base64 encoding. So is this plugin made available for searching the text
> present in pdf file as well?
> If yes what will be the result if I search for some keyword in attachment,
> will it return the proper text data or the base64 encoded data?
> ~Prashant
> --
> View this message in context:
>
http://elasticsearch-users.115913.n3.nabble.com/Can-we-perform-the-text-search-present-in-the-images-or-pdf-files-through-elasticsearch-tp4054367p4054371.html> Sent from the ElasticSearch Users mailing list archive at Nabble.com.
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to
[hidden email].
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/2310555013.20140418083728%40alud.com.pl.
For more options, visit
https://groups.google.com/d/optout.