You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/03/12 18:37:34 UTC

[DISCUSS] Apache Tika as TLP

Hey Guys,

If you're on general@lucene then you've probably seen it mentioned it's
worth discussing Tika as an Apache TLP. I'd like to get the community view
on this since I think it likely makes sense:

* Nothing in Tika depends on any of the other Lucene project jars
* Tika provides functionality (mime type, language detection, parsing) that
are useable not just in the context of search
* We are moving forward as a community (0.7 release hopefully soon, I'll try
and cut a release within the next 2 weeks of that)

What does the community think? I can take the outcome of the discussions and
call a VOTE (if it makes sense), then take that to the PMC (in a couple
weeks after some of the other discussions have died down) and see what they
think too.

Cheers,
Chris


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++



Re: [DISCUSS] Apache Tika as TLP

Posted by Jérôme Charron <je...@gmail.com>.
Hi Chris,

I really think it makes sense since Tika is not a search component, but
really a toolkit for documents parsing and analysis.
I know it's not the vote time, but it's +1 for me.

Best regards

Jérôme


  [image: Webpulse] *Jérôme CHARRON*
Directeur Technique
0675742890
staragora.com <http://www.staragora.com.fr> -
shopreflex.fr<http://www.shopreflex.fr>


On Fri, Mar 12, 2010 at 18:37, Mattmann, Chris A (388J) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Hey Guys,
>
> If you're on general@lucene then you've probably seen it mentioned it's
> worth discussing Tika as an Apache TLP. I'd like to get the community view
> on this since I think it likely makes sense:
>
> * Nothing in Tika depends on any of the other Lucene project jars
> * Tika provides functionality (mime type, language detection, parsing) that
> are useable not just in the context of search
> * We are moving forward as a community (0.7 release hopefully soon, I'll
> try
> and cut a release within the next 2 weeks of that)
>
> What does the community think? I can take the outcome of the discussions
> and
> call a VOTE (if it makes sense), then take that to the PMC (in a couple
> weeks after some of the other discussions have died down) and see what they
> think too.
>
> Cheers,
> Chris
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: Chris.Mattmann@jpl.nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>


-- 
Jérôme Charron
Directeur Technique @ WebPulse
Tel: +33675742890 <= ** NEW **
eMail : jerome.charron@webpulse.fr
http://www.webpulse.fr/
http://www.shopreflex.com/
http://www.staragora.com/

Re: [DISCUSS] Apache Tika as TLP

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Jukka,

Thanks for your feedback! I would be honored to be the PMC chair (if no one
objects in the community). I'm also +1 for your proposal to include the
current committer set on the PMC, and to come up with our own lower entry
barriers for committership.

I'll work on the Board Resolution this evening and try and get something
posted for a DISCUSS thread by tomorrow...

Cheers,
Chris



On 3/16/10 7:09 AM, "Jukka Zitting" <ju...@gmail.com> wrote:

> Hi,
> 
> On Fri, Mar 12, 2010 at 6:37 PM, Mattmann, Chris A (388J)
> <ch...@jpl.nasa.gov> wrote:
>> If you're on general@lucene then you've probably seen it mentioned it's
>> worth discussing Tika as an Apache TLP. I'd like to get the community view
>> on this since I think it likely makes sense:
> 
> Agreed. The Tika dev community is (just) big enough for a TLP and I
> think having our own PMC would make a few things easier. For example
> I'd like to see us grant commitership to new contributors with lower
> entry barriers than what the Lucene PMC has traditionally maintained.
> It would also make easier to conduct release votes, as all Tika
> committers would have binding votes on the releases.
> 
> The main tasks in going for a TLP are deciding who'll be included in
> the new PMC and who'll be the PMC chair. My proposal is to make all
> Tika committers PMC members (now and also in the future when we add
> new committers) and to appoint Chris (if he wants) as the PMC chair
> since he's already been taking care of many of our board reports. I'm
> already chairing Jackrabbit, and would rather avoid another officer
> post.
> 
> BR,
> 
> Jukkka Zitting
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++



Re: [DISCUSS] Apache Tika as TLP

Posted by Jukka Zitting <ju...@gmail.com>.
Hi,

On Fri, Mar 12, 2010 at 6:37 PM, Mattmann, Chris A (388J)
<ch...@jpl.nasa.gov> wrote:
> If you're on general@lucene then you've probably seen it mentioned it's
> worth discussing Tika as an Apache TLP. I'd like to get the community view
> on this since I think it likely makes sense:

Agreed. The Tika dev community is (just) big enough for a TLP and I
think having our own PMC would make a few things easier. For example
I'd like to see us grant commitership to new contributors with lower
entry barriers than what the Lucene PMC has traditionally maintained.
It would also make easier to conduct release votes, as all Tika
committers would have binding votes on the releases.

The main tasks in going for a TLP are deciding who'll be included in
the new PMC and who'll be the PMC chair. My proposal is to make all
Tika committers PMC members (now and also in the future when we add
new committers) and to appoint Chris (if he wants) as the PMC chair
since he's already been taking care of many of our board reports. I'm
already chairing Jackrabbit, and would rather avoid another officer
post.

BR,

Jukkka Zitting

Re: [DISCUSS] Apache Tika as TLP

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Grant,

Thanks. I'll get started on drafting a board resolution and putting together one more DISCUSS email thread once it is drafted before calling a community VOTE, and then bringing it to the PMC based on the actual tallies.

Appreciate the pointer!

Cheers,
Chris

On 3/16/10 6:29 AM, "Grant Ingersoll" <gs...@apache.org> wrote:

FYI, Mahout has a proposal in the works too, including a Board Resolution if you feel like copying one at the appropriate point in time.  See the Mahout as TLP thread over on Mahout Dev: http://www.lucidimagination.com/search/document/e5ca8dabd6f79a9d/discuss_mahout_tlp_board_resolution

-Grant

On Mar 13, 2010, at 7:10 AM, Grant Ingersoll wrote:

> Makes sense to me.  Tika is cranking these days and while it's great to have in Lucene, it makes more sense on it's own from an ASF organizational standpoint.
>
> -Grant
>
> On Mar 12, 2010, at 12:37 PM, Mattmann, Chris A (388J) wrote:
>
>> Hey Guys,
>>
>> If you're on general@lucene then you've probably seen it mentioned it's
>> worth discussing Tika as an Apache TLP. I'd like to get the community view
>> on this since I think it likely makes sense:
>>
>> * Nothing in Tika depends on any of the other Lucene project jars
>> * Tika provides functionality (mime type, language detection, parsing) that
>> are useable not just in the context of search
>> * We are moving forward as a community (0.7 release hopefully soon, I'll try
>> and cut a release within the next 2 weeks of that)
>>
>> What does the community think? I can take the outcome of the discussions and
>> call a VOTE (if it makes sense), then take that to the PMC (in a couple
>> weeks after some of the other discussions have died down) and see what they
>> think too.
>>
>> Cheers,
>> Chris
>>
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: Chris.Mattmann@jpl.nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>
>





++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: Chris.Mattmann@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [DISCUSS] Apache Tika as TLP

Posted by Grant Ingersoll <gs...@apache.org>.
FYI, Mahout has a proposal in the works too, including a Board Resolution if you feel like copying one at the appropriate point in time.  See the Mahout as TLP thread over on Mahout Dev: http://www.lucidimagination.com/search/document/e5ca8dabd6f79a9d/discuss_mahout_tlp_board_resolution

-Grant

On Mar 13, 2010, at 7:10 AM, Grant Ingersoll wrote:

> Makes sense to me.  Tika is cranking these days and while it's great to have in Lucene, it makes more sense on it's own from an ASF organizational standpoint.
> 
> -Grant 
> 
> On Mar 12, 2010, at 12:37 PM, Mattmann, Chris A (388J) wrote:
> 
>> Hey Guys,
>> 
>> If you're on general@lucene then you've probably seen it mentioned it's
>> worth discussing Tika as an Apache TLP. I'd like to get the community view
>> on this since I think it likely makes sense:
>> 
>> * Nothing in Tika depends on any of the other Lucene project jars
>> * Tika provides functionality (mime type, language detection, parsing) that
>> are useable not just in the context of search
>> * We are moving forward as a community (0.7 release hopefully soon, I'll try
>> and cut a release within the next 2 weeks of that)
>> 
>> What does the community think? I can take the outcome of the discussions and
>> call a VOTE (if it makes sense), then take that to the PMC (in a couple
>> weeks after some of the other discussions have died down) and see what they
>> think too.
>> 
>> Cheers,
>> Chris
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: Chris.Mattmann@jpl.nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
>> 
> 
> 



Re: [DISCUSS] Apache Tika as TLP

Posted by Grant Ingersoll <gs...@apache.org>.
Makes sense to me.  Tika is cranking these days and while it's great to have in Lucene, it makes more sense on it's own from an ASF organizational standpoint.

-Grant 

On Mar 12, 2010, at 12:37 PM, Mattmann, Chris A (388J) wrote:

> Hey Guys,
> 
> If you're on general@lucene then you've probably seen it mentioned it's
> worth discussing Tika as an Apache TLP. I'd like to get the community view
> on this since I think it likely makes sense:
> 
> * Nothing in Tika depends on any of the other Lucene project jars
> * Tika provides functionality (mime type, language detection, parsing) that
> are useable not just in the context of search
> * We are moving forward as a community (0.7 release hopefully soon, I'll try
> and cut a release within the next 2 weeks of that)
> 
> What does the community think? I can take the outcome of the discussions and
> call a VOTE (if it makes sense), then take that to the PMC (in a couple
> weeks after some of the other discussions have died down) and see what they
> think too.
> 
> Cheers,
> Chris
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: Chris.Mattmann@jpl.nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 
> 



Re: [DISCUSS] Apache Tika as TLP

Posted by Julien Nioche <li...@gmail.com>.
I think it makes perfect sense. +1 from me too

Julien
-- 
DigitalPebble Ltd
http://www.digitalpebble.com

On 12 March 2010 19:16, Uwe Schindler <uw...@thetaphi.de> wrote:

> +1
>
> -----
> Uwe Schindler
> H.-H.-Meier-Allee 63, D-28213 Bremen
> http://www.thetaphi.de
> eMail: uwe@thetaphi.de
>
>
> > -----Original Message-----
> > From: Mattmann, Chris A (388J) [mailto:chris.a.mattmann@jpl.nasa.gov]
> > Sent: Friday, March 12, 2010 6:38 PM
> > To: tika-dev@lucene.apache.org
> > Subject: [DISCUSS] Apache Tika as TLP
> >
> > Hey Guys,
> >
> > If you're on general@lucene then you've probably seen it mentioned it's
> > worth discussing Tika as an Apache TLP. I'd like to get the community
> > view
> > on this since I think it likely makes sense:
> >
> > * Nothing in Tika depends on any of the other Lucene project jars
> > * Tika provides functionality (mime type, language detection, parsing)
> > that
> > are useable not just in the context of search
> > * We are moving forward as a community (0.7 release hopefully soon,
> > I'll try
> > and cut a release within the next 2 weeks of that)
> >
> > What does the community think? I can take the outcome of the
> > discussions and
> > call a VOTE (if it makes sense), then take that to the PMC (in a couple
> > weeks after some of the other discussions have died down) and see what
> > they
> > think too.
> >
> > Cheers,
> > Chris
> >
> >
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > Chris Mattmann, Ph.D.
> > Senior Computer Scientist
> > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > Office: 171-266B, Mailstop: 171-246
> > Email: Chris.Mattmann@jpl.nasa.gov
> > WWW:   http://sunset.usc.edu/~mattmann/
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > Adjunct Assistant Professor, Computer Science Department
> > University of Southern California, Los Angeles, CA 90089 USA
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
>
>
>

RE: [DISCUSS] Apache Tika as TLP

Posted by Uwe Schindler <uw...@thetaphi.de>.
+1

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: uwe@thetaphi.de


> -----Original Message-----
> From: Mattmann, Chris A (388J) [mailto:chris.a.mattmann@jpl.nasa.gov]
> Sent: Friday, March 12, 2010 6:38 PM
> To: tika-dev@lucene.apache.org
> Subject: [DISCUSS] Apache Tika as TLP
> 
> Hey Guys,
> 
> If you're on general@lucene then you've probably seen it mentioned it's
> worth discussing Tika as an Apache TLP. I'd like to get the community
> view
> on this since I think it likely makes sense:
> 
> * Nothing in Tika depends on any of the other Lucene project jars
> * Tika provides functionality (mime type, language detection, parsing)
> that
> are useable not just in the context of search
> * We are moving forward as a community (0.7 release hopefully soon,
> I'll try
> and cut a release within the next 2 weeks of that)
> 
> What does the community think? I can take the outcome of the
> discussions and
> call a VOTE (if it makes sense), then take that to the PMC (in a couple
> weeks after some of the other discussions have died down) and see what
> they
> think too.
> 
> Cheers,
> Chris
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: Chris.Mattmann@jpl.nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 



Re: [DISCUSS] Apache Tika as TLP

Posted by Benson Margulies <bi...@gmail.com>.
+1 from a sporadic patcher.

On Fri, Mar 12, 2010 at 3:57 PM, Michael McCandless <
lucene@mikemccandless.com> wrote:

> +1
>
> Mike
>
> On Fri, Mar 12, 2010 at 3:51 PM, Ken Krugler
> <kk...@transpac.com> wrote:
> > +1
> >
> > On Mar 12, 2010, at 9:37am, Mattmann, Chris A (388J) wrote:
> >
> >> Hey Guys,
> >>
> >> If you're on general@lucene then you've probably seen it mentioned it's
> >> worth discussing Tika as an Apache TLP. I'd like to get the community
> view
> >> on this since I think it likely makes sense:
> >>
> >> * Nothing in Tika depends on any of the other Lucene project jars
> >> * Tika provides functionality (mime type, language detection, parsing)
> >> that
> >> are useable not just in the context of search
> >> * We are moving forward as a community (0.7 release hopefully soon, I'll
> >> try
> >> and cut a release within the next 2 weeks of that)
> >>
> >> What does the community think? I can take the outcome of the discussions
> >> and
> >> call a VOTE (if it makes sense), then take that to the PMC (in a couple
> >> weeks after some of the other discussions have died down) and see what
> >> they
> >> think too.
> >>
> >> Cheers,
> >> Chris
> >>
> >>
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Chris Mattmann, Ph.D.
> >> Senior Computer Scientist
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 171-266B, Mailstop: 171-246
> >> Email: Chris.Mattmann@jpl.nasa.gov
> >> WWW:   http://sunset.usc.edu/~mattmann/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Adjunct Assistant Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
> >
> > --------------------------------------------
> > Ken Krugler
> > +1 530-210-6378
> > http://bixolabs.com
> > e l a s t i c   w e b   m i n i n g
> >
> >
> >
> >
> >
>

Re: [DISCUSS] Apache Tika as TLP

Posted by Michael McCandless <lu...@mikemccandless.com>.
+1

Mike

On Fri, Mar 12, 2010 at 3:51 PM, Ken Krugler
<kk...@transpac.com> wrote:
> +1
>
> On Mar 12, 2010, at 9:37am, Mattmann, Chris A (388J) wrote:
>
>> Hey Guys,
>>
>> If you're on general@lucene then you've probably seen it mentioned it's
>> worth discussing Tika as an Apache TLP. I'd like to get the community view
>> on this since I think it likely makes sense:
>>
>> * Nothing in Tika depends on any of the other Lucene project jars
>> * Tika provides functionality (mime type, language detection, parsing)
>> that
>> are useable not just in the context of search
>> * We are moving forward as a community (0.7 release hopefully soon, I'll
>> try
>> and cut a release within the next 2 weeks of that)
>>
>> What does the community think? I can take the outcome of the discussions
>> and
>> call a VOTE (if it makes sense), then take that to the PMC (in a couple
>> weeks after some of the other discussions have died down) and see what
>> they
>> think too.
>>
>> Cheers,
>> Chris
>>
>>
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: Chris.Mattmann@jpl.nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>
>>
>
> --------------------------------------------
> Ken Krugler
> +1 530-210-6378
> http://bixolabs.com
> e l a s t i c   w e b   m i n i n g
>
>
>
>
>

Re: [DISCUSS] Apache Tika as TLP

Posted by Ken Krugler <kk...@transpac.com>.
+1

On Mar 12, 2010, at 9:37am, Mattmann, Chris A (388J) wrote:

> Hey Guys,
>
> If you're on general@lucene then you've probably seen it mentioned  
> it's
> worth discussing Tika as an Apache TLP. I'd like to get the  
> community view
> on this since I think it likely makes sense:
>
> * Nothing in Tika depends on any of the other Lucene project jars
> * Tika provides functionality (mime type, language detection,  
> parsing) that
> are useable not just in the context of search
> * We are moving forward as a community (0.7 release hopefully soon,  
> I'll try
> and cut a release within the next 2 weeks of that)
>
> What does the community think? I can take the outcome of the  
> discussions and
> call a VOTE (if it makes sense), then take that to the PMC (in a  
> couple
> weeks after some of the other discussions have died down) and see  
> what they
> think too.
>
> Cheers,
> Chris
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: Chris.Mattmann@jpl.nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>

--------------------------------------------
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c   w e b   m i n i n g





Re: [DISCUSS] Apache Tika as TLP

Posted by Felix Meschberger <fm...@gmail.com>.
Sounds reasonable to me as an outsider, TIKA user and reader on this list.

Regards
Felix

On 12.03.2010 18:37, Mattmann, Chris A (388J) wrote:
> Hey Guys,
> 
> If you're on general@lucene then you've probably seen it mentioned it's
> worth discussing Tika as an Apache TLP. I'd like to get the community view
> on this since I think it likely makes sense:
> 
> * Nothing in Tika depends on any of the other Lucene project jars
> * Tika provides functionality (mime type, language detection, parsing) that
> are useable not just in the context of search
> * We are moving forward as a community (0.7 release hopefully soon, I'll try
> and cut a release within the next 2 weeks of that)
> 
> What does the community think? I can take the outcome of the discussions and
> call a VOTE (if it makes sense), then take that to the PMC (in a couple
> weeks after some of the other discussions have died down) and see what they
> think too.
> 
> Cheers,
> Chris
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: Chris.Mattmann@jpl.nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 
> 
> 

Re: [DISCUSS] Apache Tika as TLP

Posted by Andrzej Bialecki <ab...@getopt.org>.
On 2010-03-12 18:37, Mattmann, Chris A (388J) wrote:
> Hey Guys,
>
> If you're on general@lucene then you've probably seen it mentioned it's
> worth discussing Tika as an Apache TLP. I'd like to get the community view
> on this since I think it likely makes sense:
>
> * Nothing in Tika depends on any of the other Lucene project jars
> * Tika provides functionality (mime type, language detection, parsing) that
> are useable not just in the context of search
> * We are moving forward as a community (0.7 release hopefully soon, I'll try
> and cut a release within the next 2 weeks of that)
>
> What does the community think? I can take the outcome of the discussions and
> call a VOTE (if it makes sense), then take that to the PMC (in a couple
> weeks after some of the other discussions have died down) and see what they
> think too.

Makes sense, +1.


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Re: [DISCUSS] Apache Tika as TLP

Posted by Dave Meikle <lo...@gmail.com>.
Hi,

On 12 March 2010 17:37, Mattmann, Chris A (388J) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Hey Guys,
>
> If you're on general@lucene then you've probably seen it mentioned it's
> worth discussing Tika as an Apache TLP. I'd like to get the community view
> on this since I think it likely makes sense:
>
> * Nothing in Tika depends on any of the other Lucene project jars
> * Tika provides functionality (mime type, language detection, parsing) that
> are useable not just in the context of search
> * We are moving forward as a community (0.7 release hopefully soon, I'll
> try
> and cut a release within the next 2 weeks of that)
>
> What does the community think? I can take the outcome of the discussions
> and
> call a VOTE (if it makes sense), then take that to the PMC (in a couple
> weeks after some of the other discussions have died down) and see what they
> think too.
>
> Cheers,
> Chris
>

+1 - I know it is belated, but wanted to share my opinion.

Cheers,
Dave