You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Michael Pisula (JIRA)" <ji...@apache.org> on 2011/05/09 10:10:03 UTC

[jira] [Created] (TIKA-658) Support pcap file format

Support pcap file format
------------------------

                 Key: TIKA-658
                 URL: https://issues.apache.org/jira/browse/TIKA-658
             Project: Tika
          Issue Type: Task
          Components: mime
            Reporter: Michael Pisula
            Priority: Trivial


Currently the Mime checker does not recognize the pcap file format. As I need the recognition for our project, I added it to the tike-mimetypes.xml:

  <!-- MIME Type for pcap file format, see http://www.iana.org/assignments/media-types/application/vnd.tcpdump.pcap -->
  <mime-type type="application/vnd.tcpdump.pcap">
    <magic priority="50">
      <match value="0xa1b2c3d4" type="big32" offset="0" />
      <match value="0xd4c3b2a1" type="big32" offset="0" />
    </magic>
  	<glob pattern="*.pcap"/>
  	<glob pattern="*.cap"/>
  	<glob pattern="*.dmp"/>
  </mime-type>

*.cap is already used in image/x-raw-phaseone, so I had to comment it out to pass the tests, but I'm sure there is a better way to deal with the conflict, just did not have the time to look into it.

With the configuration above pcap recognition works just fine for me, so I just wanted to contribute this, in case anyone finds it useful.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (TIKA-658) Support pcap file format

Posted by "Nick Burch (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nick Burch resolved TIKA-658.
-----------------------------

       Resolution: Fixed
    Fix Version/s: 1.0

> Support pcap file format
> ------------------------
>
>                 Key: TIKA-658
>                 URL: https://issues.apache.org/jira/browse/TIKA-658
>             Project: Tika
>          Issue Type: Task
>          Components: mime
>            Reporter: Michael Pisula
>            Priority: Trivial
>             Fix For: 1.0
>
>
> Currently the Mime checker does not recognize the pcap file format. As I need the recognition for our project, I added it to the tike-mimetypes.xml:
>   <!-- MIME Type for pcap file format, see http://www.iana.org/assignments/media-types/application/vnd.tcpdump.pcap -->
>   <mime-type type="application/vnd.tcpdump.pcap">
>     <magic priority="50">
>       <match value="0xa1b2c3d4" type="big32" offset="0" />
>       <match value="0xd4c3b2a1" type="big32" offset="0" />
>     </magic>
>   	<glob pattern="*.pcap"/>
>   	<glob pattern="*.cap"/>
>   	<glob pattern="*.dmp"/>
>   </mime-type>
> *.cap is already used in image/x-raw-phaseone, so I had to comment it out to pass the tests, but I'm sure there is a better way to deal with the conflict, just did not have the time to look into it.
> With the configuration above pcap recognition works just fine for me, so I just wanted to contribute this, in case anyone finds it useful.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (TIKA-658) Support pcap file format

Posted by "Nick Burch (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13031204#comment-13031204 ] 

Nick Burch commented on TIKA-658:
---------------------------------

Thanks for the patch, applied in r1101471. I decided that tcpdump was more likely to be used for the .cap entry, so disabled it for the phaseone raw image (which has another extension anyway)

> Support pcap file format
> ------------------------
>
>                 Key: TIKA-658
>                 URL: https://issues.apache.org/jira/browse/TIKA-658
>             Project: Tika
>          Issue Type: Task
>          Components: mime
>            Reporter: Michael Pisula
>            Priority: Trivial
>             Fix For: 1.0
>
>
> Currently the Mime checker does not recognize the pcap file format. As I need the recognition for our project, I added it to the tike-mimetypes.xml:
>   <!-- MIME Type for pcap file format, see http://www.iana.org/assignments/media-types/application/vnd.tcpdump.pcap -->
>   <mime-type type="application/vnd.tcpdump.pcap">
>     <magic priority="50">
>       <match value="0xa1b2c3d4" type="big32" offset="0" />
>       <match value="0xd4c3b2a1" type="big32" offset="0" />
>     </magic>
>   	<glob pattern="*.pcap"/>
>   	<glob pattern="*.cap"/>
>   	<glob pattern="*.dmp"/>
>   </mime-type>
> *.cap is already used in image/x-raw-phaseone, so I had to comment it out to pass the tests, but I'm sure there is a better way to deal with the conflict, just did not have the time to look into it.
> With the configuration above pcap recognition works just fine for me, so I just wanted to contribute this, in case anyone finds it useful.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (TIKA-658) Support pcap file format

Posted by "Nick Burch (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030787#comment-13030787 ] 

Nick Burch commented on TIKA-658:
---------------------------------

We can only have one mime type registered with a given extension, which generally is the most common one. The magic matching is generally the best way to detect though, avoids these issues

> Support pcap file format
> ------------------------
>
>                 Key: TIKA-658
>                 URL: https://issues.apache.org/jira/browse/TIKA-658
>             Project: Tika
>          Issue Type: Task
>          Components: mime
>            Reporter: Michael Pisula
>            Priority: Trivial
>
> Currently the Mime checker does not recognize the pcap file format. As I need the recognition for our project, I added it to the tike-mimetypes.xml:
>   <!-- MIME Type for pcap file format, see http://www.iana.org/assignments/media-types/application/vnd.tcpdump.pcap -->
>   <mime-type type="application/vnd.tcpdump.pcap">
>     <magic priority="50">
>       <match value="0xa1b2c3d4" type="big32" offset="0" />
>       <match value="0xd4c3b2a1" type="big32" offset="0" />
>     </magic>
>   	<glob pattern="*.pcap"/>
>   	<glob pattern="*.cap"/>
>   	<glob pattern="*.dmp"/>
>   </mime-type>
> *.cap is already used in image/x-raw-phaseone, so I had to comment it out to pass the tests, but I'm sure there is a better way to deal with the conflict, just did not have the time to look into it.
> With the configuration above pcap recognition works just fine for me, so I just wanted to contribute this, in case anyone finds it useful.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira