You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by Nandan Padar Chandrashekar <na...@usc.edu> on 2016/03/02 09:19:42 UTC

Need suggestion on file type .HFA to be added Tika.

Hi All,

Identified (Hierarchical File Architecture) HFA file format which is not
presently being identified through Tika.

file format details :

extension : *.hfa
Header tag contains string  EHFA_HEADER_TAG

Links :

1.
ftp://ftp.ecn.purdue.edu/jshan/86/help/html/appendices/hfa_object_directory.htm

2. ftp://ftp.ecn.purdue.edu/jshan/86/help/html/appendices/Ehfa_HeaderTag.htm

Should this be considered as custom mime type or standard mime type. ?

Need suggestion for content type(mime-type type) of this file format.

.
Regards
Nandan Padar Chandrashekar

Re: Need suggestion on file type .HFA to be added Tika.

Posted by Nick Burch <ap...@gagravarr.org>.
On Wed, 2 Mar 2016, Nandan Padar Chandrashekar wrote:
> Identified (Hierarchical File Architecture) HFA file format which is not
> presently being identified through Tika.
>
> extension : *.hfa
> Header tag contains string  EHFA_HEADER_TAG

Looks fine for adding to Tika to me

> Should this be considered as custom mime type or standard mime type. ?

As it's a common well known file type, it should be a standard one. It'd 
really only need to be a custom one if it was only used in your lab / 
school / company and no-where else

> Need suggestion for content type(mime-type type) of this file format.

application/x-erdas-hfa seems to be used in at least some places online, 
so I'd suggest using that, at least for now

Nick

Re: Need suggestion on file type .HFA to be added Tika.

Posted by Nandan Padar Chandrashekar <na...@usc.edu>.
Thanks Nick and Prof Chris.

Will update tika-mimetypes.xml  for the same.

Regards
Nandan

On Wed, Mar 2, 2016 at 8:32 PM, Mattmann, Chris A (3980) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> I agree with Nick’s replies here
>
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
> Chris Mattmann, Ph.D.
>
> Chief Architect
>
> Instrument Software and Science Data Systems Section (398)
>
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>
> Office: 168-519, Mailstop: 168-527
>
> Email: chris.a.mattmann@nasa.gov
>
> WWW:  http://sunset.usc.edu/~mattmann/
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
> Adjunct Associate Professor, Computer Science Department
>
> University of Southern California, Los Angeles, CA 90089 USA
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
>
>
>
>
>
>
> -----Original Message-----
>
> From: Nandan Padar Chandrashekar <na...@usc.edu>
>
> Reply-To: "dev@tika.apache.org" <de...@tika.apache.org>
>
> Date: Wednesday, March 2, 2016 at 12:19 AM
>
> To: "dev@tika.apache.org" <de...@tika.apache.org>
>
> Subject: Need suggestion on file type .HFA to be added Tika.
>
>
>
> >Hi All,
>
> >
>
> >Identified (Hierarchical File Architecture) HFA file format which is not
>
> >presently being identified through Tika.
>
> >
>
> >file format details :
>
> >
>
> >extension : *.hfa
>
> >Header tag contains string  EHFA_HEADER_TAG
>
> >
>
> >Links :
>
> >
>
> >1.
>
> >
> https://urldefense.proofpoint.com/v2/url?u=ftp-3A__ftp.ecn.purdue.edu_jshan_86_help_html_appendices_hfa-5Fobject-5Fdirector&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=BgWIKzxRUqzF77qjrsF6-g&m=BVeD8Kbe_WNBQqD7TtAwzwuAhO9oIED-ZN_iocAG1a4&s=ygpfgvzK7Dnia51V2Ae7ElNQbUAK7BJuuKNZpoShVRY&e=
>
> >y.htm
>
> >
>
> >2.
>
> >
> https://urldefense.proofpoint.com/v2/url?u=ftp-3A__ftp.ecn.purdue.edu_jshan_86_help_html_appendices_Ehfa-5FHeaderTag.htm&d=CwIGaQ&c=clK7kQUTWtAVEOVIgvi0NU5BOUHhpN0H8p7CSfnc_gI&r=BgWIKzxRUqzF77qjrsF6-g&m=BVeD8Kbe_WNBQqD7TtAwzwuAhO9oIED-ZN_iocAG1a4&s=JYPgwGv3q4ECRpxF9susC-5LHCCI3zPOiK8o1ylh4sE&e=
>
> >
>
> >Should this be considered as custom mime type or standard mime type. ?
>
> >
>
> >Need suggestion for content type(mime-type type) of this file format.
>
> >
>
> >.
>
> >Regards
>
> >Nandan Padar Chandrashekar
>
>
>
>

Re: Need suggestion on file type .HFA to be added Tika.

Posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov>.
I agree with Nick’s replies here

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++





-----Original Message-----
From: Nandan Padar Chandrashekar <na...@usc.edu>
Reply-To: "dev@tika.apache.org" <de...@tika.apache.org>
Date: Wednesday, March 2, 2016 at 12:19 AM
To: "dev@tika.apache.org" <de...@tika.apache.org>
Subject: Need suggestion on file type .HFA to be added Tika.

>Hi All,
>
>Identified (Hierarchical File Architecture) HFA file format which is not
>presently being identified through Tika.
>
>file format details :
>
>extension : *.hfa
>Header tag contains string  EHFA_HEADER_TAG
>
>Links :
>
>1.
>ftp://ftp.ecn.purdue.edu/jshan/86/help/html/appendices/hfa_object_director
>y.htm
>
>2. 
>ftp://ftp.ecn.purdue.edu/jshan/86/help/html/appendices/Ehfa_HeaderTag.htm
>
>Should this be considered as custom mime type or standard mime type. ?
>
>Need suggestion for content type(mime-type type) of this file format.
>
>.
>Regards
>Nandan Padar Chandrashekar