You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2012/01/28 19:01:10 UTC

[jira] [Created] (TIKA-852) Quicktime / MP4 Metadata Parser

Quicktime / MP4 Metadata Parser
-------------------------------

                 Key: TIKA-852
                 URL: https://issues.apache.org/jira/browse/TIKA-852
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 1.0
            Reporter: Nick Burch
            Assignee: Nick Burch


>From the investigations done for TIKA-851, it looks like a parser for the Quicktime format, and MP4 (which is an extension to it) shouldn't be too hard to do. This should be able to output some of the media metadata, such duration, dimensions, and MP4 audio tags

Information resources on the format are linked from TIKA-851

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TIKA-852) Quicktime / MP4 Metadata Parser

Posted by "Sebastian Annies (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sebastian Annies updated TIKA-852:
----------------------------------

    Attachment: TIKA-852.patch

Hi, 
sorry for not responding/fixing Issue 30 in mp4parser. You have nagged me more ;) 
Please find attached a patch for this issue. It updates to isoparser-1.0-RC-1 and make the test work. 

Thank you for using my parser!
Regards,
Sebastian

PS.: with RC-1 the parser should be substantially faster

                
> Quicktime / MP4 Metadata Parser
> -------------------------------
>
>                 Key: TIKA-852
>                 URL: https://issues.apache.org/jira/browse/TIKA-852
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>         Attachments: TIKA-852.patch
>
>
> From the investigations done for TIKA-851, it looks like a parser for the Quicktime format, and MP4 (which is an extension to it) shouldn't be too hard to do. This should be able to output some of the media metadata, such duration, dimensions, and MP4 audio tags
> Information resources on the format are linked from TIKA-851

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-852) Quicktime / MP4 Metadata Parser

Posted by "Nick Burch (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13264349#comment-13264349 ] 

Nick Burch commented on TIKA-852:
---------------------------------

Thanks Sebastian, I've committed your patch in r1331788.

I'll leave the bug open for now, as we're not doing all the XMPDM keys we probably should do, and we're not doing video yet


                
> Quicktime / MP4 Metadata Parser
> -------------------------------
>
>                 Key: TIKA-852
>                 URL: https://issues.apache.org/jira/browse/TIKA-852
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>         Attachments: TIKA-852.patch
>
>
> From the investigations done for TIKA-851, it looks like a parser for the Quicktime format, and MP4 (which is an extension to it) shouldn't be too hard to do. This should be able to output some of the media metadata, such duration, dimensions, and MP4 audio tags
> Information resources on the format are linked from TIKA-851

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-852) Quicktime / MP4 Metadata Parser

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13195628#comment-13195628 ] 

Nick Burch commented on TIKA-852:
---------------------------------

Initial parser added in r1237162. Currently, it largely just handles Apple M4A audio files. There are probably a couple more XMPDM keys we'd want to fully support these, and quite a few more we'd need before we could do video. (We'd also need some test files, and to decide how best to handle video files, see an earlier dev@tika discussion for details of the problems)

For now, the unit test is disabled, as it fails when asserts are turned on. I think this is a mp4parser bug though, so I've raised <http://code.google.com/p/mp4parser/issues/detail?id=30> for it
                
> Quicktime / MP4 Metadata Parser
> -------------------------------
>
>                 Key: TIKA-852
>                 URL: https://issues.apache.org/jira/browse/TIKA-852
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>
> From the investigations done for TIKA-851, it looks like a parser for the Quicktime format, and MP4 (which is an extension to it) shouldn't be too hard to do. This should be able to output some of the media metadata, such duration, dimensions, and MP4 audio tags
> Information resources on the format are linked from TIKA-851

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Issue Comment Edited] (TIKA-852) Quicktime / MP4 Metadata Parser

Posted by "Sebastian Annies (Issue Comment Edited) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243198#comment-13243198 ] 

Sebastian Annies edited comment on TIKA-852 at 4/4/12 9:11 PM:
---------------------------------------------------------------

Hi, 
sorry for not responding/fixing Issue 30 in mp4parser. You should have nagged me more ;) 
Please find attached a patch for this issue. It updates to isoparser-1.0-RC-1 and make the test work. 

Thank you for using my parser!
Regards,
Sebastian

PS.: with RC-1 the parser should be substantially faster

                
      was (Author: sannies):
    Hi, 
sorry for not responding/fixing Issue 30 in mp4parser. You have nagged me more ;) 
Please find attached a patch for this issue. It updates to isoparser-1.0-RC-1 and make the test work. 

Thank you for using my parser!
Regards,
Sebastian

PS.: with RC-1 the parser should be substantially faster

                  
> Quicktime / MP4 Metadata Parser
> -------------------------------
>
>                 Key: TIKA-852
>                 URL: https://issues.apache.org/jira/browse/TIKA-852
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>         Attachments: TIKA-852.patch
>
>
> From the investigations done for TIKA-851, it looks like a parser for the Quicktime format, and MP4 (which is an extension to it) shouldn't be too hard to do. This should be able to output some of the media metadata, such duration, dimensions, and MP4 audio tags
> Information resources on the format are linked from TIKA-851

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-852) Quicktime / MP4 Metadata Parser

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13195598#comment-13195598 ] 

Nick Burch commented on TIKA-852:
---------------------------------

It looks like the Apache Licensed MP4Parser <http://code.google.com/p/mp4parser/> already provides much of this, so it looks simplest to use that rather than writing our own MP4 reader
                
> Quicktime / MP4 Metadata Parser
> -------------------------------
>
>                 Key: TIKA-852
>                 URL: https://issues.apache.org/jira/browse/TIKA-852
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>
> From the investigations done for TIKA-851, it looks like a parser for the Quicktime format, and MP4 (which is an extension to it) shouldn't be too hard to do. This should be able to output some of the media metadata, such duration, dimensions, and MP4 audio tags
> Information resources on the format are linked from TIKA-851

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira