You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2011/10/06 23:17:30 UTC

[jira] [Created] (TIKA-747) Ogg Vorbis and FLAC Parsers

Ogg Vorbis and FLAC Parsers
---------------------------

                 Key: TIKA-747
                 URL: https://issues.apache.org/jira/browse/TIKA-747
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 0.10
            Reporter: Nick Burch
            Assignee: Nick Burch
            Priority: Minor
             Fix For: 1.0


As mentioned on the list a few weeks back, I've now finished getting an Ogg Vorbis and FLAC parser working. The code is available at https://github.com/Gagravarr/VorbisJava

As I see it, there are three options for getting it into Tika:
* Roll a release of the whole project from GitHub, upload jars to Maven Central, and add a parser pom dependency
* Bring the parser code over to Tika, and release (+post to Central) just the core jar
* Bring the whole lot over to Apache (maybe to Tika, maybe elsewhere?)

This issue is to track the plan, and the inclusion of the parsers into Tika one way or another

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-747) Ogg Vorbis and FLAC Parsers

Posted by "Chris A. Mattmann (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243615#comment-13243615 ] 

Chris A. Mattmann commented on TIKA-747:
----------------------------------------

Probably because some component of it was finished, per Nick's comments above. However, the issue wasn't closed, or resolved, so as part of my normal pruning, I pushed it out to 1.2.
                
> Ogg Vorbis and FLAC Parsers
> ---------------------------
>
>                 Key: TIKA-747
>                 URL: https://issues.apache.org/jira/browse/TIKA-747
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.2
>
>
> As mentioned on the list a few weeks back, I've now finished getting an Ogg Vorbis and FLAC parser working. The code is available at https://github.com/Gagravarr/VorbisJava
> As I see it, there are three options for getting it into Tika:
> * Roll a release of the whole project from GitHub, upload jars to Maven Central, and add a parser pom dependency
> * Bring the parser code over to Tika, and release (+post to Central) just the core jar
> * Bring the whole lot over to Apache (maybe to Tika, maybe elsewhere?)
> This issue is to track the plan, and the inclusion of the parsers into Tika one way or another

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-747) Ogg Vorbis and FLAC Parsers

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13204630#comment-13204630 ] 

Nick Burch commented on TIKA-747:
---------------------------------

Getting the central sync to work turned out to be much more painful than expected, not least owing to bugs in both Nexus and the SKS KeyServer, but at last the 0.1 release has made it!

I've enabled the dependency to the Vorbis Java Tika Plugin in r1242387, which pulls in the Tika plugin and dependencies. I've also added an integration test that checks it all ones

The only remaining issue is with tika-app. Unfortunately, the current inlining setup takes the first occurrence of a file, and ignores subsequent ones. This means that the first parsers service file is taken, the Tika one, and the Vorbis Parser one is ignored.

Wel'll need someone who knows how that maven plugin works to fix this, by having the service files appended together, rather than using the "first one wins" approach which is ignoring the additional files.
                
> Ogg Vorbis and FLAC Parsers
> ---------------------------
>
>                 Key: TIKA-747
>                 URL: https://issues.apache.org/jira/browse/TIKA-747
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.1
>
>
> As mentioned on the list a few weeks back, I've now finished getting an Ogg Vorbis and FLAC parser working. The code is available at https://github.com/Gagravarr/VorbisJava
> As I see it, there are three options for getting it into Tika:
> * Roll a release of the whole project from GitHub, upload jars to Maven Central, and add a parser pom dependency
> * Bring the parser code over to Tika, and release (+post to Central) just the core jar
> * Bring the whole lot over to Apache (maybe to Tika, maybe elsewhere?)
> This issue is to track the plan, and the inclusion of the parsers into Tika one way or another

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-747) Ogg Vorbis and FLAC Parsers

Posted by "Jan Høydahl (Commented JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13243602#comment-13243602 ] 

Jan Høydahl commented on TIKA-747:
----------------------------------

Why is this issue listed as included in the Tika1.1 release?
                
> Ogg Vorbis and FLAC Parsers
> ---------------------------
>
>                 Key: TIKA-747
>                 URL: https://issues.apache.org/jira/browse/TIKA-747
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.2
>
>
> As mentioned on the list a few weeks back, I've now finished getting an Ogg Vorbis and FLAC parser working. The code is available at https://github.com/Gagravarr/VorbisJava
> As I see it, there are three options for getting it into Tika:
> * Roll a release of the whole project from GitHub, upload jars to Maven Central, and add a parser pom dependency
> * Bring the parser code over to Tika, and release (+post to Central) just the core jar
> * Bring the whole lot over to Apache (maybe to Tika, maybe elsewhere?)
> This issue is to track the plan, and the inclusion of the parsers into Tika one way or another

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

       

[jira] [Resolved] (TIKA-747) Ogg Vorbis and FLAC Parsers

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-747.
--------------------------------

    Resolution: Fixed

I fixed the tika-app issue in revision 1355741 by switching to the shade plugin for merging all dependencies into the single runnable jar. The shade plugin supports automatic merging of META-INF/services/ files.
                
> Ogg Vorbis and FLAC Parsers
> ---------------------------
>
>                 Key: TIKA-747
>                 URL: https://issues.apache.org/jira/browse/TIKA-747
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.2
>
>
> As mentioned on the list a few weeks back, I've now finished getting an Ogg Vorbis and FLAC parser working. The code is available at https://github.com/Gagravarr/VorbisJava
> As I see it, there are three options for getting it into Tika:
> * Roll a release of the whole project from GitHub, upload jars to Maven Central, and add a parser pom dependency
> * Bring the parser code over to Tika, and release (+post to Central) just the core jar
> * Bring the whole lot over to Apache (maybe to Tika, maybe elsewhere?)
> This issue is to track the plan, and the inclusion of the parsers into Tika one way or another

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TIKA-747) Ogg Vorbis and FLAC Parsers

Posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated TIKA-747:
-----------------------------------

    Fix Version/s:     (was: 1.1)
                   1.2

- push out to 1.2
                
> Ogg Vorbis and FLAC Parsers
> ---------------------------
>
>                 Key: TIKA-747
>                 URL: https://issues.apache.org/jira/browse/TIKA-747
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.2
>
>
> As mentioned on the list a few weeks back, I've now finished getting an Ogg Vorbis and FLAC parser working. The code is available at https://github.com/Gagravarr/VorbisJava
> As I see it, there are three options for getting it into Tika:
> * Roll a release of the whole project from GitHub, upload jars to Maven Central, and add a parser pom dependency
> * Bring the parser code over to Tika, and release (+post to Central) just the core jar
> * Bring the whole lot over to Apache (maybe to Tika, maybe elsewhere?)
> This issue is to track the plan, and the inclusion of the parsers into Tika one way or another

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TIKA-747) Ogg Vorbis and FLAC Parsers

Posted by "Chris A. Mattmann (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated TIKA-747:
-----------------------------------

    Fix Version/s:     (was: 1.0)
                   1.1

- push out to 1.1: prep for 1.0.
                
> Ogg Vorbis and FLAC Parsers
> ---------------------------
>
>                 Key: TIKA-747
>                 URL: https://issues.apache.org/jira/browse/TIKA-747
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.1
>
>
> As mentioned on the list a few weeks back, I've now finished getting an Ogg Vorbis and FLAC parser working. The code is available at https://github.com/Gagravarr/VorbisJava
> As I see it, there are three options for getting it into Tika:
> * Roll a release of the whole project from GitHub, upload jars to Maven Central, and add a parser pom dependency
> * Bring the parser code over to Tika, and release (+post to Central) just the core jar
> * Bring the whole lot over to Apache (maybe to Tika, maybe elsewhere?)
> This issue is to track the plan, and the inclusion of the parsers into Tika one way or another

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-747) Ogg Vorbis and FLAC Parsers

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13193046#comment-13193046 ] 

Nick Burch commented on TIKA-747:
---------------------------------

Following discussions on the list, I've decided to release everything from Github.

I've started the release process for a 0.1 release, currently pending on <https://issues.sonatype.org/browse/OSSRH-2793> for syncing to Central. Once that's sorted, I can add the parser dependency to Tika, along with some quick integration tests
                
> Ogg Vorbis and FLAC Parsers
> ---------------------------
>
>                 Key: TIKA-747
>                 URL: https://issues.apache.org/jira/browse/TIKA-747
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.1
>
>
> As mentioned on the list a few weeks back, I've now finished getting an Ogg Vorbis and FLAC parser working. The code is available at https://github.com/Gagravarr/VorbisJava
> As I see it, there are three options for getting it into Tika:
> * Roll a release of the whole project from GitHub, upload jars to Maven Central, and add a parser pom dependency
> * Bring the parser code over to Tika, and release (+post to Central) just the core jar
> * Bring the whole lot over to Apache (maybe to Tika, maybe elsewhere?)
> This issue is to track the plan, and the inclusion of the parsers into Tika one way or another

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira