You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2014/12/23 04:00:24 UTC
[jira] [Commented] (TIKA-1502) Mime magic for database file formats
[ https://issues.apache.org/jira/browse/TIKA-1502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14256527#comment-14256527 ]
Hudson commented on TIKA-1502:
------------------------------
SUCCESS: Integrated in tika-trunk-jdk1.6 #366 (See [https://builds.apache.org/job/tika-trunk-jdk1.6/366/])
Some test database files for TIKA-1502 (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1647473)
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_2.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_3.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_4.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testBDB_5.db
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testMYSQL.MYD
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testMYSQL.MYI
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testMYSQL.frm
* /tika/trunk/tika-parsers/src/test/resources/test-documents/testSQLITE3.db
> Mime magic for database file formats
> ------------------------------------
>
> Key: TIKA-1502
> URL: https://issues.apache.org/jira/browse/TIKA-1502
> Project: Tika
> Issue Type: Improvement
> Components: mime
> Affects Versions: 1.6
> Reporter: Nick Burch
>
> I noticed today that Tika can't detect a lot of common database formats, such as sqlite or Berkeley DB or MISAM
> The unix file utility got most of those, which makes me think that there's a sensible-ish header on most we can write some mime magic for
> It'd therefore be good to add mime entries, with magic where possible, for many of these common database file formats
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)