You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Antoni Mylka (Closed) (JIRA)" <ji...@apache.org> on 2011/12/19 12:29:30 UTC

[jira] [Closed] (TIKA-813) Webarchive detection.

     [ https://issues.apache.org/jira/browse/TIKA-813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Antoni Mylka closed TIKA-813.
-----------------------------

       Resolution: Fixed
    Fix Version/s: 1.1

Committed the magics and the unit tests in t1220696. Thanks for the example file!
                
> Webarchive detection.
> ---------------------
>
>                 Key: TIKA-813
>                 URL: https://issues.apache.org/jira/browse/TIKA-813
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.1
>            Reporter: Antoni Mylka
>             Fix For: 1.1
>
>         Attachments: Apache_Tika.webarchive, testWEBARCHIVE.webarchive, tika-813.patch
>
>
> I'd like to be be able to detect .webarchive files. They are a special case of the Apple Binary Property list format. They are generated by the Safari browser and contain all the files that comprise a web page within a single container file.
> Can anyone supply an example file? All the ones I have are confidential and I don't have a mac myself.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira