You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Rallavagu <ra...@gmail.com> on 2015/10/29 21:47:02 UTC

Solr for Pictures

In general, is there a built-in data handler to index pictures 
(essentially, EXIF and other data embedded in an image)? If not, what is 
the best practice to do so? Thanks.

Re: Solr for Pictures

Posted by Rallavagu <ra...@gmail.com>.
I was playing with exiftool (written in perl) and a custom java class 
built using metadata-extrator project 
(https://github.com/drewnoakes/metadata-extractor) and wondering if 
there is anything built into Solr or are there any best practices 
(general practices) to index pictures.

On 10/29/15 1:56 PM, Daniel Valdivia wrote:
> Some extra googling yield this Wiki from a integration between Tika and a EXIFTool
>
> https://wiki.apache.org/tika/EXIFToolParser <https://wiki.apache.org/tika/EXIFToolParser>
>
>> On Oct 29, 2015, at 1:48 PM, Daniel Valdivia <ho...@danielvaldivia.com> wrote:
>>
>> I think you can look into Tika for this https://tika.apache.org/ <https://tika.apache.org/>
>>
>> There’s handlers to integrate Tika and Solr, some context:
>>
>> https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika <https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika>
>>
>>
>>
>>> On Oct 29, 2015, at 1:47 PM, Rallavagu <rallavagu@gmail.com <ma...@gmail.com>> wrote:
>>>
>>> In general, is there a built-in data handler to index pictures (essentially, EXIF and other data embedded in an image)? If not, what is the best practice to do so? Thanks.
>>
>
>

Re: Solr for Pictures

Posted by Daniel Valdivia <ho...@danielvaldivia.com>.
Some extra googling yield this Wiki from a integration between Tika and a EXIFTool

https://wiki.apache.org/tika/EXIFToolParser <https://wiki.apache.org/tika/EXIFToolParser>

> On Oct 29, 2015, at 1:48 PM, Daniel Valdivia <ho...@danielvaldivia.com> wrote:
> 
> I think you can look into Tika for this https://tika.apache.org/ <https://tika.apache.org/>
> 
> There’s handlers to integrate Tika and Solr, some context:
> 
> https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika <https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika>
> 
> 
> 
>> On Oct 29, 2015, at 1:47 PM, Rallavagu <rallavagu@gmail.com <ma...@gmail.com>> wrote:
>> 
>> In general, is there a built-in data handler to index pictures (essentially, EXIF and other data embedded in an image)? If not, what is the best practice to do so? Thanks.
> 


Re: Solr for Pictures

Posted by Daniel Valdivia <ho...@danielvaldivia.com>.
I think you can look into Tika for this https://tika.apache.org/ <https://tika.apache.org/>

There’s handlers to integrate Tika and Solr, some context:

https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika <https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika>



> On Oct 29, 2015, at 1:47 PM, Rallavagu <ra...@gmail.com> wrote:
> 
> In general, is there a built-in data handler to index pictures (essentially, EXIF and other data embedded in an image)? If not, what is the best practice to do so? Thanks.