You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Kenneth William Krugler (Jira)" <ji...@apache.org> on 2020/04/30 14:38:00 UTC

[jira] [Commented] (TIKA-3096) detect image in any document

    [ https://issues.apache.org/jira/browse/TIKA-3096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17096592#comment-17096592 ] 

Kenneth William Krugler commented on TIKA-3096:
-----------------------------------------------

Hi [~suchendra] - please ask usage questions on the Tika user mailing list, thanks! You can sign up using steps at [http://tika.apache.org/mail-lists.html.|http://tika.apache.org/mail-lists.html]

> detect image in any document
> ----------------------------
>
>                 Key: TIKA-3096
>                 URL: https://issues.apache.org/jira/browse/TIKA-3096
>             Project: Tika
>          Issue Type: Bug
>          Components: documentation, example, parser
>    Affects Versions: 1.23
>            Reporter: suchendra
>            Priority: Minor
>
> How do I detect whether a document contains an image or not ?
> val parser = new AutoDetectParser()
>  val handler = new ToXMLContentHandler()
>  parser.parse(tikaIs, handler, new Metadata, new ParseContext)
>  println("File Content:" + handler.toString)
>  
> I tried using HTMLHandler and based on existence of img tag, considered file contains image. Is there any better way to achieve this? 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)