You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2014/12/23 04:00:24 UTC

[jira] [Commented] (TIKA-1502) Mime magic for database file formats

    [ https://issues.apache.org/jira/browse/TIKA-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14256527#comment-14256527 ] 

Hudson commented on TIKA-1502:
------------------------------

SUCCESS: Integrated in tika-trunk-jdk1.6 #366 (See [https://builds.apache.org/job/tika-trunk-jdk1.6/366/])
Some test database files for TIKA-1502 (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647473)
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_2.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_3.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_4.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_5.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testMYSQL.MYD
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testMYSQL.MYI
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testMYSQL.frm
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testSQLITE3.db


> Mime magic for database file formats
> ------------------------------------
>
>                 Key: TIKA-1502
>                 URL: https://issues.apache.org/jira/browse/TIKA-1502
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.6
>            Reporter: Nick Burch
>
> I noticed today that Tika can't detect a lot of common database formats, such as sqlite or Berkeley DB or MISAM
> The unix file utility got most of those, which makes me think that there's a sensible-ish header on most we can write some mime magic for
> It'd therefore be good to add mime entries, with magic where possible, for many of these common database file formats



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)