You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Arky (Jira)" <ji...@apache.org> on 2021/03/30 20:27:00 UTC

[jira] [Commented] (TIKA-1993) Image Recognition with Tika

    [ https://issues.apache.org/jira/browse/TIKA-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17311791#comment-17311791 ] 

Arky commented on TIKA-1993:
----------------------------

We have a downstream use case for this. We'll like to help ICIJ Datashare users to be able to classify images and extract information accordingly. This would also helpful in redacting the datasets before public release.

 

https://github.com/ICIJ/datashare/issues/784

> Image Recognition with Tika 
> ----------------------------
>
>                 Key: TIKA-1993
>                 URL: https://issues.apache.org/jira/browse/TIKA-1993
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Thamme Gowda
>            Assignee: Chris A. Mattmann
>            Priority: Major
>             Fix For: 1.14
>
>
> Create "ImageRecognitionParser" which can have pluggable implementation for core recognition logic.
> As the name suggests, this parser should detect objects in the images, and support many implementations + models (similar to what NamedEntityParser did for text).
> Supply a default implementation based on Tensorflow with the current state-of-the-art model \[1\].
> Links:
> \[1\] https://www.tensorflow.org/versions/r0.8/tutorials/image_recognition/index.html#usage-with-python-api



--
This message was sent by Atlassian Jira
(v8.3.4#803005)