You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Chris A. Mattmann (Commented) (JIRA)" <ji...@apache.org> on 2012/02/16 18:19:01 UTC

[jira] [Commented] (TIKA-862) JPSS HDF5 files not being detected appropriately

    [ https://issues.apache.org/jira/browse/TIKA-862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13209513#comment-13209513 ] 

Chris A. Mattmann commented on TIKA-862:
----------------------------------------

Hi Richard, can you:

* attach that sample HDF 5 file to JIRA here? Or point me to a URL where I can get it?
* let me know what version of Tika you are using -- looks like 1.0 -- can you confirm that?

I'll take the above and then investigate what we're seeing and get right back to you!

                
> JPSS HDF5 files not being detected appropriately
> ------------------------------------------------
>
>                 Key: TIKA-862
>                 URL: https://issues.apache.org/jira/browse/TIKA-862
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Richard Yu
>            Assignee: Chris A. Mattmann
>
> As commented in TIKA-614, JPSS HDF 5 files are not being properly detected by Tika. See this:
> from [~minfing]:
> {quote}
> We were trying to extract metadata from our h5 file (i.e. with JPSS extension). We ran the following command line:
> {noformat}
> [ryu@localhost hdf5extractor]$ java -jar tika-app-1.0.jar -m \
> > /usr/local/staging/products/h5/SVM13_npp_d20120122_t1659139_e1700381_b01225_c20120123000312144174_noaa_ops.h5
> Content-Encoding: windows-1252
> Content-Length: 22187952
> Content-Type: text/plain
> resourceName: SVM13_npp_d20120122_t1659139_e1700381_b01225_c20120123000312144174_noaa_ops.h5
> [ryu@localhost hdf5extractor]$
> {noformat}
> We noticed that the content type in text/plain and only 4 lines of output (i.e. we expected al lots of metadata).
> Let me know if more information is needed. Thanks!
> Richard
> {quote}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira