You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Piotr Kosiorowski <pk...@gmail.com> on 2009/01/20 11:56:04 UTC

Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
committed them long time ago in an attempt to bring some static
analysis toools to nutch sources. There was a short discussion around
it and we all thought t was worth doing but it never gained enough
momentum.   There is a pmd target in build.xml file that uses it -
they are not needed in runtime nor for standard builds.
As nutch is built using hudson now I think it would be worth to
integrate pmd (and checkstyle/findbugs/cobertura might be also
interesting) - hudson has very nice plugins for such tools. I am using
it in my daily job and I found it valuable.
But as I am not active committer now (I only try to follow mailing
lists) I do not think it is my call.  But if everyone will be
interested I can try to look at integration (but it will move forward
slowly - my youngest kid was born just 2 months ago and it takes a lot
of attention).
Piotr

On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) <ji...@apache.org> wrote:
> Update external jars to latest versions
> ---------------------------------------
>
>                 Key: NUTCH-680
>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
>             Project: Nutch
>          Issue Type: Improvement
>            Reporter: Doğacan Güney
>            Assignee: Doğacan Güney
>            Priority: Minor
>             Fix For: 1.0.0
>
>
> This issue will be used to update external libraries nutch uses.
>
> These are the libraries that are outdated (upon a quick glance):
>
> nekohtml (1.9.9)
> lucene-highlighter (2.4.0)
> jdom (1.1)
> carrot2 - as mentioned in another issue
> jets3t - above
> icu4j (4.0.1)
> jakarta-oro (2.0.8)
>
> We should probably update tika to whatever the latest is as well before 1.0.
>
>
> Please add ones  I missed in comments.
>
> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen there.....
>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>
>

Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

Posted by Piotr Kosiorowski <pk...@gmail.com>.
>From what I know (the way we use hudson) is that hudson has plugins
for presenting tool results only and the tools need to be executed
during build - and libraries need to be included so they are available
to ant.
Piotr

On Tue, Jan 20, 2009 at 9:40 PM, Doğacan Güney <do...@gmail.com> wrote:
> On Tue, Jan 20, 2009 at 10:35 PM, Otis Gospodnetic
> <og...@yahoo.com> wrote:
>> That I don't know...
>>
>> I don't see the jars here: http://svn.apache.org/viewvc/hadoop/core/trunk/lib/
>>
>> But who knows, maybe maven/ivy fetch them on demand.  I don't know.
>>
>
> Hmm, does 0.19 use ivy(0.19 also doesn't have pmd)?
>
> http://svn.apache.org/viewvc/hadoop/core/branches/branch-0.19/lib/
>
>> Otis
>> --
>> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>>
>>
>>
>> ----- Original Message ----
>>> From: Doğacan Güney <do...@gmail.com>
>>> To: nutch-dev@lucene.apache.org
>>> Sent: Tuesday, January 20, 2009 1:13:20 PM
>>> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest versions
>>>
>>> On Tue, Jan 20, 2009 at 7:48 PM, Otis Gospodnetic
>>> wrote:
>>> > Lucene doesn't use anything.
>>> > Hadoop uses pmd integrate in Hudson.
>>> >
>>>
>>> Does this mean we do not need pmd jars in nutch ( are they provided by hudson)?
>>>
>>> > Otis
>>> > --
>>> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>>> >
>>> >
>>> >
>>> > ----- Original Message ----
>>> >> From: Doğacan Güney
>>> >> To: nutch-dev@lucene.apache.org
>>> >> Sent: Tuesday, January 20, 2009 10:49:44 AM
>>> >> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest
>>> versions
>>> >>
>>> >> 2009/1/20 Piotr Kosiorowski :
>>> >> > pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
>>> >> > committed them long time ago in an attempt to bring some static
>>> >> > analysis toools to nutch sources. There was a short discussion around
>>> >> > it and we all thought t was worth doing but it never gained enough
>>> >> > momentum.   There is a pmd target in build.xml file that uses it -
>>> >> > they are not needed in runtime nor for standard builds.
>>> >> > As nutch is built using hudson now I think it would be worth to
>>> >> > integrate pmd (and checkstyle/findbugs/cobertura might be also
>>> >> > interesting) - hudson has very nice plugins for such tools. I am using
>>> >> > it in my daily job and I found it valuable.
>>> >>
>>> >> Thanks for the explanation. I am definitely +1 on having some sort of
>>> >> static analysis tools for nutch.
>>> >>
>>> >> Does anyone know what hadoop/hbase/lucene use for this? or do
>>> >> they use something at all?
>>> >>
>>> >> > But as I am not active committer now (I only try to follow mailing
>>> >> > lists) I do not think it is my call.  But if everyone will be
>>> >> > interested I can try to look at integration (but it will move forward
>>> >> > slowly - my youngest kid was born just 2 months ago and it takes a lot
>>> >> > of attention).
>>> >>
>>> >> Congratulations!
>>> >>
>>> >> > Piotr
>>> >> >
>>> >> > On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) wrote:
>>> >> >> Update external jars to latest versions
>>> >> >> ---------------------------------------
>>> >> >>
>>> >> >>                 Key: NUTCH-680
>>> >> >>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
>>> >> >>             Project: Nutch
>>> >> >>          Issue Type: Improvement
>>> >> >>            Reporter: Doğacan Güney
>>> >> >>            Assignee: Doğacan Güney
>>> >> >>            Priority: Minor
>>> >> >>             Fix For: 1.0.0
>>> >> >>
>>> >> >>
>>> >> >> This issue will be used to update external libraries nutch uses.
>>> >> >>
>>> >> >> These are the libraries that are outdated (upon a quick glance):
>>> >> >>
>>> >> >> nekohtml (1.9.9)
>>> >> >> lucene-highlighter (2.4.0)
>>> >> >> jdom (1.1)
>>> >> >> carrot2 - as mentioned in another issue
>>> >> >> jets3t - above
>>> >> >> icu4j (4.0.1)
>>> >> >> jakarta-oro (2.0.8)
>>> >> >>
>>> >> >> We should probably update tika to whatever the latest is as well before
>>> 1.0.
>>> >> >>
>>> >> >>
>>> >> >> Please add ones  I missed in comments.
>>> >> >>
>>> >> >> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen
>>> >> there.....
>>> >> >>
>>> >> >> --
>>> >> >> This message is automatically generated by JIRA.
>>> >> >> -
>>> >> >> You can reply to this email to add a comment to the issue online.
>>> >> >>
>>> >> >>
>>> >> >
>>> >>
>>> >>
>>> >>
>>> >> --
>>> >> Doğacan Güney
>>> >
>>> >
>>>
>>>
>>>
>>> --
>>> Doğacan Güney
>>
>>
>
>
>
> --
> Doğacan Güney
>

Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

Posted by Doğacan Güney <do...@gmail.com>.
So, is it OK to remove pmd-ext directory for now? It is not clear if
we need it when
we have the infrastructure but we don't have the infrastructure now
anyway :D. So,
I suggest that we remove it for now (and we trim 2.2MB ), and add it
back after 1.0
and actually use it.

Is everyone OK with this?

On Wed, Jan 21, 2009 at 12:01 AM, Piotr Kosiorowski
<pk...@gmail.com> wrote:
> I have configured hudson for 10 or more projects and always used pmd
> plugin to display the pmd results only - the actual pmd task to
> generate report was run from ant script. Maybe there is such
> possibility tu run pmd reports directly in hudson (not through project
> build scripts) but I have never come accross it.
> Piotr
>
> On Tue, Jan 20, 2009 at 10:39 PM, Otis Gospodnetic
> <og...@yahoo.com> wrote:
>> They've had pmd integrated with Hudson for many months now, I believe.  I've seen patches in JIRA that were the result of fixes for problems reported by pmd.  Or maybe they run pmd by hand?
>>
>> Otis
>> --
>> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>>
>>
>>
>> ----- Original Message ----
>>> From: Doğacan Güney <do...@gmail.com>
>>> To: nutch-dev@lucene.apache.org
>>> Sent: Tuesday, January 20, 2009 3:40:20 PM
>>> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest versions
>>>
>>> On Tue, Jan 20, 2009 at 10:35 PM, Otis Gospodnetic
>>> wrote:
>>> > That I don't know...
>>> >
>>> > I don't see the jars here: http://svn.apache.org/viewvc/hadoop/core/trunk/lib/
>>> >
>>> > But who knows, maybe maven/ivy fetch them on demand.  I don't know.
>>> >
>>>
>>> Hmm, does 0.19 use ivy(0.19 also doesn't have pmd)?
>>>
>>> http://svn.apache.org/viewvc/hadoop/core/branches/branch-0.19/lib/
>>>
>>> > Otis
>>> > --
>>> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>>> >
>>> >
>>> >
>>> > ----- Original Message ----
>>> >> From: Doğacan Güney
>>> >> To: nutch-dev@lucene.apache.org
>>> >> Sent: Tuesday, January 20, 2009 1:13:20 PM
>>> >> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest
>>> versions
>>> >>
>>> >> On Tue, Jan 20, 2009 at 7:48 PM, Otis Gospodnetic
>>> >> wrote:
>>> >> > Lucene doesn't use anything.
>>> >> > Hadoop uses pmd integrate in Hudson.
>>> >> >
>>> >>
>>> >> Does this mean we do not need pmd jars in nutch ( are they provided by
>>> hudson)?
>>> >>
>>> >> > Otis
>>> >> > --
>>> >> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>>> >> >
>>> >> >
>>> >> >
>>> >> > ----- Original Message ----
>>> >> >> From: Doğacan Güney
>>> >> >> To: nutch-dev@lucene.apache.org
>>> >> >> Sent: Tuesday, January 20, 2009 10:49:44 AM
>>> >> >> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest
>>> >> versions
>>> >> >>
>>> >> >> 2009/1/20 Piotr Kosiorowski :
>>> >> >> > pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
>>> >> >> > committed them long time ago in an attempt to bring some static
>>> >> >> > analysis toools to nutch sources. There was a short discussion around
>>> >> >> > it and we all thought t was worth doing but it never gained enough
>>> >> >> > momentum.   There is a pmd target in build.xml file that uses it -
>>> >> >> > they are not needed in runtime nor for standard builds.
>>> >> >> > As nutch is built using hudson now I think it would be worth to
>>> >> >> > integrate pmd (and checkstyle/findbugs/cobertura might be also
>>> >> >> > interesting) - hudson has very nice plugins for such tools. I am using
>>> >> >> > it in my daily job and I found it valuable.
>>> >> >>
>>> >> >> Thanks for the explanation. I am definitely +1 on having some sort of
>>> >> >> static analysis tools for nutch.
>>> >> >>
>>> >> >> Does anyone know what hadoop/hbase/lucene use for this? or do
>>> >> >> they use something at all?
>>> >> >>
>>> >> >> > But as I am not active committer now (I only try to follow mailing
>>> >> >> > lists) I do not think it is my call.  But if everyone will be
>>> >> >> > interested I can try to look at integration (but it will move forward
>>> >> >> > slowly - my youngest kid was born just 2 months ago and it takes a lot
>>> >> >> > of attention).
>>> >> >>
>>> >> >> Congratulations!
>>> >> >>
>>> >> >> > Piotr
>>> >> >> >
>>> >> >> > On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) wrote:
>>> >> >> >> Update external jars to latest versions
>>> >> >> >> ---------------------------------------
>>> >> >> >>
>>> >> >> >>                 Key: NUTCH-680
>>> >> >> >>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
>>> >> >> >>             Project: Nutch
>>> >> >> >>          Issue Type: Improvement
>>> >> >> >>            Reporter: Doğacan Güney
>>> >> >> >>            Assignee: Doğacan Güney
>>> >> >> >>            Priority: Minor
>>> >> >> >>             Fix For: 1.0.0
>>> >> >> >>
>>> >> >> >>
>>> >> >> >> This issue will be used to update external libraries nutch uses.
>>> >> >> >>
>>> >> >> >> These are the libraries that are outdated (upon a quick glance):
>>> >> >> >>
>>> >> >> >> nekohtml (1.9.9)
>>> >> >> >> lucene-highlighter (2.4.0)
>>> >> >> >> jdom (1.1)
>>> >> >> >> carrot2 - as mentioned in another issue
>>> >> >> >> jets3t - above
>>> >> >> >> icu4j (4.0.1)
>>> >> >> >> jakarta-oro (2.0.8)
>>> >> >> >>
>>> >> >> >> We should probably update tika to whatever the latest is as well before
>>> >> 1.0.
>>> >> >> >>
>>> >> >> >>
>>> >> >> >> Please add ones  I missed in comments.
>>> >> >> >>
>>> >> >> >> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen
>>> >> >> there.....
>>> >> >> >>
>>> >> >> >> --
>>> >> >> >> This message is automatically generated by JIRA.
>>> >> >> >> -
>>> >> >> >> You can reply to this email to add a comment to the issue online.
>>> >> >> >>
>>> >> >> >>
>>> >> >> >
>>> >> >>
>>> >> >>
>>> >> >>
>>> >> >> --
>>> >> >> Doğacan Güney
>>> >> >
>>> >> >
>>> >>
>>> >>
>>> >>
>>> >> --
>>> >> Doğacan Güney
>>> >
>>> >
>>>
>>>
>>>
>>> --
>>> Doğacan Güney
>>
>>
>



-- 
Doğacan Güney

Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

Posted by Piotr Kosiorowski <pk...@gmail.com>.
I have configured hudson for 10 or more projects and always used pmd
plugin to display the pmd results only - the actual pmd task to
generate report was run from ant script. Maybe there is such
possibility tu run pmd reports directly in hudson (not through project
build scripts) but I have never come accross it.
Piotr

On Tue, Jan 20, 2009 at 10:39 PM, Otis Gospodnetic
<og...@yahoo.com> wrote:
> They've had pmd integrated with Hudson for many months now, I believe.  I've seen patches in JIRA that were the result of fixes for problems reported by pmd.  Or maybe they run pmd by hand?
>
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> ----- Original Message ----
>> From: Doğacan Güney <do...@gmail.com>
>> To: nutch-dev@lucene.apache.org
>> Sent: Tuesday, January 20, 2009 3:40:20 PM
>> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest versions
>>
>> On Tue, Jan 20, 2009 at 10:35 PM, Otis Gospodnetic
>> wrote:
>> > That I don't know...
>> >
>> > I don't see the jars here: http://svn.apache.org/viewvc/hadoop/core/trunk/lib/
>> >
>> > But who knows, maybe maven/ivy fetch them on demand.  I don't know.
>> >
>>
>> Hmm, does 0.19 use ivy(0.19 also doesn't have pmd)?
>>
>> http://svn.apache.org/viewvc/hadoop/core/branches/branch-0.19/lib/
>>
>> > Otis
>> > --
>> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>> >
>> >
>> >
>> > ----- Original Message ----
>> >> From: Doğacan Güney
>> >> To: nutch-dev@lucene.apache.org
>> >> Sent: Tuesday, January 20, 2009 1:13:20 PM
>> >> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest
>> versions
>> >>
>> >> On Tue, Jan 20, 2009 at 7:48 PM, Otis Gospodnetic
>> >> wrote:
>> >> > Lucene doesn't use anything.
>> >> > Hadoop uses pmd integrate in Hudson.
>> >> >
>> >>
>> >> Does this mean we do not need pmd jars in nutch ( are they provided by
>> hudson)?
>> >>
>> >> > Otis
>> >> > --
>> >> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>> >> >
>> >> >
>> >> >
>> >> > ----- Original Message ----
>> >> >> From: Doğacan Güney
>> >> >> To: nutch-dev@lucene.apache.org
>> >> >> Sent: Tuesday, January 20, 2009 10:49:44 AM
>> >> >> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest
>> >> versions
>> >> >>
>> >> >> 2009/1/20 Piotr Kosiorowski :
>> >> >> > pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
>> >> >> > committed them long time ago in an attempt to bring some static
>> >> >> > analysis toools to nutch sources. There was a short discussion around
>> >> >> > it and we all thought t was worth doing but it never gained enough
>> >> >> > momentum.   There is a pmd target in build.xml file that uses it -
>> >> >> > they are not needed in runtime nor for standard builds.
>> >> >> > As nutch is built using hudson now I think it would be worth to
>> >> >> > integrate pmd (and checkstyle/findbugs/cobertura might be also
>> >> >> > interesting) - hudson has very nice plugins for such tools. I am using
>> >> >> > it in my daily job and I found it valuable.
>> >> >>
>> >> >> Thanks for the explanation. I am definitely +1 on having some sort of
>> >> >> static analysis tools for nutch.
>> >> >>
>> >> >> Does anyone know what hadoop/hbase/lucene use for this? or do
>> >> >> they use something at all?
>> >> >>
>> >> >> > But as I am not active committer now (I only try to follow mailing
>> >> >> > lists) I do not think it is my call.  But if everyone will be
>> >> >> > interested I can try to look at integration (but it will move forward
>> >> >> > slowly - my youngest kid was born just 2 months ago and it takes a lot
>> >> >> > of attention).
>> >> >>
>> >> >> Congratulations!
>> >> >>
>> >> >> > Piotr
>> >> >> >
>> >> >> > On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) wrote:
>> >> >> >> Update external jars to latest versions
>> >> >> >> ---------------------------------------
>> >> >> >>
>> >> >> >>                 Key: NUTCH-680
>> >> >> >>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
>> >> >> >>             Project: Nutch
>> >> >> >>          Issue Type: Improvement
>> >> >> >>            Reporter: Doğacan Güney
>> >> >> >>            Assignee: Doğacan Güney
>> >> >> >>            Priority: Minor
>> >> >> >>             Fix For: 1.0.0
>> >> >> >>
>> >> >> >>
>> >> >> >> This issue will be used to update external libraries nutch uses.
>> >> >> >>
>> >> >> >> These are the libraries that are outdated (upon a quick glance):
>> >> >> >>
>> >> >> >> nekohtml (1.9.9)
>> >> >> >> lucene-highlighter (2.4.0)
>> >> >> >> jdom (1.1)
>> >> >> >> carrot2 - as mentioned in another issue
>> >> >> >> jets3t - above
>> >> >> >> icu4j (4.0.1)
>> >> >> >> jakarta-oro (2.0.8)
>> >> >> >>
>> >> >> >> We should probably update tika to whatever the latest is as well before
>> >> 1.0.
>> >> >> >>
>> >> >> >>
>> >> >> >> Please add ones  I missed in comments.
>> >> >> >>
>> >> >> >> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen
>> >> >> there.....
>> >> >> >>
>> >> >> >> --
>> >> >> >> This message is automatically generated by JIRA.
>> >> >> >> -
>> >> >> >> You can reply to this email to add a comment to the issue online.
>> >> >> >>
>> >> >> >>
>> >> >> >
>> >> >>
>> >> >>
>> >> >>
>> >> >> --
>> >> >> Doğacan Güney
>> >> >
>> >> >
>> >>
>> >>
>> >>
>> >> --
>> >> Doğacan Güney
>> >
>> >
>>
>>
>>
>> --
>> Doğacan Güney
>
>

Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

Posted by Otis Gospodnetic <og...@yahoo.com>.
They've had pmd integrated with Hudson for many months now, I believe.  I've seen patches in JIRA that were the result of fixes for problems reported by pmd.  Or maybe they run pmd by hand?

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Doğacan Güney <do...@gmail.com>
> To: nutch-dev@lucene.apache.org
> Sent: Tuesday, January 20, 2009 3:40:20 PM
> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest versions
> 
> On Tue, Jan 20, 2009 at 10:35 PM, Otis Gospodnetic
> wrote:
> > That I don't know...
> >
> > I don't see the jars here: http://svn.apache.org/viewvc/hadoop/core/trunk/lib/
> >
> > But who knows, maybe maven/ivy fetch them on demand.  I don't know.
> >
> 
> Hmm, does 0.19 use ivy(0.19 also doesn't have pmd)?
> 
> http://svn.apache.org/viewvc/hadoop/core/branches/branch-0.19/lib/
> 
> > Otis
> > --
> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> >
> >
> >
> > ----- Original Message ----
> >> From: Doğacan Güney 
> >> To: nutch-dev@lucene.apache.org
> >> Sent: Tuesday, January 20, 2009 1:13:20 PM
> >> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest 
> versions
> >>
> >> On Tue, Jan 20, 2009 at 7:48 PM, Otis Gospodnetic
> >> wrote:
> >> > Lucene doesn't use anything.
> >> > Hadoop uses pmd integrate in Hudson.
> >> >
> >>
> >> Does this mean we do not need pmd jars in nutch ( are they provided by 
> hudson)?
> >>
> >> > Otis
> >> > --
> >> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> >> >
> >> >
> >> >
> >> > ----- Original Message ----
> >> >> From: Doğacan Güney
> >> >> To: nutch-dev@lucene.apache.org
> >> >> Sent: Tuesday, January 20, 2009 10:49:44 AM
> >> >> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest
> >> versions
> >> >>
> >> >> 2009/1/20 Piotr Kosiorowski :
> >> >> > pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
> >> >> > committed them long time ago in an attempt to bring some static
> >> >> > analysis toools to nutch sources. There was a short discussion around
> >> >> > it and we all thought t was worth doing but it never gained enough
> >> >> > momentum.   There is a pmd target in build.xml file that uses it -
> >> >> > they are not needed in runtime nor for standard builds.
> >> >> > As nutch is built using hudson now I think it would be worth to
> >> >> > integrate pmd (and checkstyle/findbugs/cobertura might be also
> >> >> > interesting) - hudson has very nice plugins for such tools. I am using
> >> >> > it in my daily job and I found it valuable.
> >> >>
> >> >> Thanks for the explanation. I am definitely +1 on having some sort of
> >> >> static analysis tools for nutch.
> >> >>
> >> >> Does anyone know what hadoop/hbase/lucene use for this? or do
> >> >> they use something at all?
> >> >>
> >> >> > But as I am not active committer now (I only try to follow mailing
> >> >> > lists) I do not think it is my call.  But if everyone will be
> >> >> > interested I can try to look at integration (but it will move forward
> >> >> > slowly - my youngest kid was born just 2 months ago and it takes a lot
> >> >> > of attention).
> >> >>
> >> >> Congratulations!
> >> >>
> >> >> > Piotr
> >> >> >
> >> >> > On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) wrote:
> >> >> >> Update external jars to latest versions
> >> >> >> ---------------------------------------
> >> >> >>
> >> >> >>                 Key: NUTCH-680
> >> >> >>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
> >> >> >>             Project: Nutch
> >> >> >>          Issue Type: Improvement
> >> >> >>            Reporter: Doğacan Güney
> >> >> >>            Assignee: Doğacan Güney
> >> >> >>            Priority: Minor
> >> >> >>             Fix For: 1.0.0
> >> >> >>
> >> >> >>
> >> >> >> This issue will be used to update external libraries nutch uses.
> >> >> >>
> >> >> >> These are the libraries that are outdated (upon a quick glance):
> >> >> >>
> >> >> >> nekohtml (1.9.9)
> >> >> >> lucene-highlighter (2.4.0)
> >> >> >> jdom (1.1)
> >> >> >> carrot2 - as mentioned in another issue
> >> >> >> jets3t - above
> >> >> >> icu4j (4.0.1)
> >> >> >> jakarta-oro (2.0.8)
> >> >> >>
> >> >> >> We should probably update tika to whatever the latest is as well before
> >> 1.0.
> >> >> >>
> >> >> >>
> >> >> >> Please add ones  I missed in comments.
> >> >> >>
> >> >> >> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen
> >> >> there.....
> >> >> >>
> >> >> >> --
> >> >> >> This message is automatically generated by JIRA.
> >> >> >> -
> >> >> >> You can reply to this email to add a comment to the issue online.
> >> >> >>
> >> >> >>
> >> >> >
> >> >>
> >> >>
> >> >>
> >> >> --
> >> >> Doğacan Güney
> >> >
> >> >
> >>
> >>
> >>
> >> --
> >> Doğacan Güney
> >
> >
> 
> 
> 
> -- 
> Doğacan Güney


Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

Posted by Doğacan Güney <do...@gmail.com>.
On Tue, Jan 20, 2009 at 10:35 PM, Otis Gospodnetic
<og...@yahoo.com> wrote:
> That I don't know...
>
> I don't see the jars here: http://svn.apache.org/viewvc/hadoop/core/trunk/lib/
>
> But who knows, maybe maven/ivy fetch them on demand.  I don't know.
>

Hmm, does 0.19 use ivy(0.19 also doesn't have pmd)?

http://svn.apache.org/viewvc/hadoop/core/branches/branch-0.19/lib/

> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> ----- Original Message ----
>> From: Doğacan Güney <do...@gmail.com>
>> To: nutch-dev@lucene.apache.org
>> Sent: Tuesday, January 20, 2009 1:13:20 PM
>> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest versions
>>
>> On Tue, Jan 20, 2009 at 7:48 PM, Otis Gospodnetic
>> wrote:
>> > Lucene doesn't use anything.
>> > Hadoop uses pmd integrate in Hudson.
>> >
>>
>> Does this mean we do not need pmd jars in nutch ( are they provided by hudson)?
>>
>> > Otis
>> > --
>> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>> >
>> >
>> >
>> > ----- Original Message ----
>> >> From: Doğacan Güney
>> >> To: nutch-dev@lucene.apache.org
>> >> Sent: Tuesday, January 20, 2009 10:49:44 AM
>> >> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest
>> versions
>> >>
>> >> 2009/1/20 Piotr Kosiorowski :
>> >> > pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
>> >> > committed them long time ago in an attempt to bring some static
>> >> > analysis toools to nutch sources. There was a short discussion around
>> >> > it and we all thought t was worth doing but it never gained enough
>> >> > momentum.   There is a pmd target in build.xml file that uses it -
>> >> > they are not needed in runtime nor for standard builds.
>> >> > As nutch is built using hudson now I think it would be worth to
>> >> > integrate pmd (and checkstyle/findbugs/cobertura might be also
>> >> > interesting) - hudson has very nice plugins for such tools. I am using
>> >> > it in my daily job and I found it valuable.
>> >>
>> >> Thanks for the explanation. I am definitely +1 on having some sort of
>> >> static analysis tools for nutch.
>> >>
>> >> Does anyone know what hadoop/hbase/lucene use for this? or do
>> >> they use something at all?
>> >>
>> >> > But as I am not active committer now (I only try to follow mailing
>> >> > lists) I do not think it is my call.  But if everyone will be
>> >> > interested I can try to look at integration (but it will move forward
>> >> > slowly - my youngest kid was born just 2 months ago and it takes a lot
>> >> > of attention).
>> >>
>> >> Congratulations!
>> >>
>> >> > Piotr
>> >> >
>> >> > On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) wrote:
>> >> >> Update external jars to latest versions
>> >> >> ---------------------------------------
>> >> >>
>> >> >>                 Key: NUTCH-680
>> >> >>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
>> >> >>             Project: Nutch
>> >> >>          Issue Type: Improvement
>> >> >>            Reporter: Doğacan Güney
>> >> >>            Assignee: Doğacan Güney
>> >> >>            Priority: Minor
>> >> >>             Fix For: 1.0.0
>> >> >>
>> >> >>
>> >> >> This issue will be used to update external libraries nutch uses.
>> >> >>
>> >> >> These are the libraries that are outdated (upon a quick glance):
>> >> >>
>> >> >> nekohtml (1.9.9)
>> >> >> lucene-highlighter (2.4.0)
>> >> >> jdom (1.1)
>> >> >> carrot2 - as mentioned in another issue
>> >> >> jets3t - above
>> >> >> icu4j (4.0.1)
>> >> >> jakarta-oro (2.0.8)
>> >> >>
>> >> >> We should probably update tika to whatever the latest is as well before
>> 1.0.
>> >> >>
>> >> >>
>> >> >> Please add ones  I missed in comments.
>> >> >>
>> >> >> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen
>> >> there.....
>> >> >>
>> >> >> --
>> >> >> This message is automatically generated by JIRA.
>> >> >> -
>> >> >> You can reply to this email to add a comment to the issue online.
>> >> >>
>> >> >>
>> >> >
>> >>
>> >>
>> >>
>> >> --
>> >> Doğacan Güney
>> >
>> >
>>
>>
>>
>> --
>> Doğacan Güney
>
>



-- 
Doğacan Güney

Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

Posted by Otis Gospodnetic <og...@yahoo.com>.
That I don't know...

I don't see the jars here: http://svn.apache.org/viewvc/hadoop/core/trunk/lib/

But who knows, maybe maven/ivy fetch them on demand.  I don't know.

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Doğacan Güney <do...@gmail.com>
> To: nutch-dev@lucene.apache.org
> Sent: Tuesday, January 20, 2009 1:13:20 PM
> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest versions
> 
> On Tue, Jan 20, 2009 at 7:48 PM, Otis Gospodnetic
> wrote:
> > Lucene doesn't use anything.
> > Hadoop uses pmd integrate in Hudson.
> >
> 
> Does this mean we do not need pmd jars in nutch ( are they provided by hudson)?
> 
> > Otis
> > --
> > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
> >
> >
> >
> > ----- Original Message ----
> >> From: Doğacan Güney 
> >> To: nutch-dev@lucene.apache.org
> >> Sent: Tuesday, January 20, 2009 10:49:44 AM
> >> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest 
> versions
> >>
> >> 2009/1/20 Piotr Kosiorowski :
> >> > pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
> >> > committed them long time ago in an attempt to bring some static
> >> > analysis toools to nutch sources. There was a short discussion around
> >> > it and we all thought t was worth doing but it never gained enough
> >> > momentum.   There is a pmd target in build.xml file that uses it -
> >> > they are not needed in runtime nor for standard builds.
> >> > As nutch is built using hudson now I think it would be worth to
> >> > integrate pmd (and checkstyle/findbugs/cobertura might be also
> >> > interesting) - hudson has very nice plugins for such tools. I am using
> >> > it in my daily job and I found it valuable.
> >>
> >> Thanks for the explanation. I am definitely +1 on having some sort of
> >> static analysis tools for nutch.
> >>
> >> Does anyone know what hadoop/hbase/lucene use for this? or do
> >> they use something at all?
> >>
> >> > But as I am not active committer now (I only try to follow mailing
> >> > lists) I do not think it is my call.  But if everyone will be
> >> > interested I can try to look at integration (but it will move forward
> >> > slowly - my youngest kid was born just 2 months ago and it takes a lot
> >> > of attention).
> >>
> >> Congratulations!
> >>
> >> > Piotr
> >> >
> >> > On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) wrote:
> >> >> Update external jars to latest versions
> >> >> ---------------------------------------
> >> >>
> >> >>                 Key: NUTCH-680
> >> >>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
> >> >>             Project: Nutch
> >> >>          Issue Type: Improvement
> >> >>            Reporter: Doğacan Güney
> >> >>            Assignee: Doğacan Güney
> >> >>            Priority: Minor
> >> >>             Fix For: 1.0.0
> >> >>
> >> >>
> >> >> This issue will be used to update external libraries nutch uses.
> >> >>
> >> >> These are the libraries that are outdated (upon a quick glance):
> >> >>
> >> >> nekohtml (1.9.9)
> >> >> lucene-highlighter (2.4.0)
> >> >> jdom (1.1)
> >> >> carrot2 - as mentioned in another issue
> >> >> jets3t - above
> >> >> icu4j (4.0.1)
> >> >> jakarta-oro (2.0.8)
> >> >>
> >> >> We should probably update tika to whatever the latest is as well before 
> 1.0.
> >> >>
> >> >>
> >> >> Please add ones  I missed in comments.
> >> >>
> >> >> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen
> >> there.....
> >> >>
> >> >> --
> >> >> This message is automatically generated by JIRA.
> >> >> -
> >> >> You can reply to this email to add a comment to the issue online.
> >> >>
> >> >>
> >> >
> >>
> >>
> >>
> >> --
> >> Doğacan Güney
> >
> >
> 
> 
> 
> -- 
> Doğacan Güney


Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

Posted by Doğacan Güney <do...@gmail.com>.
On Tue, Jan 20, 2009 at 7:48 PM, Otis Gospodnetic
<og...@yahoo.com> wrote:
> Lucene doesn't use anything.
> Hadoop uses pmd integrate in Hudson.
>

Does this mean we do not need pmd jars in nutch ( are they provided by hudson)?

> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> ----- Original Message ----
>> From: Doğacan Güney <do...@gmail.com>
>> To: nutch-dev@lucene.apache.org
>> Sent: Tuesday, January 20, 2009 10:49:44 AM
>> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest versions
>>
>> 2009/1/20 Piotr Kosiorowski :
>> > pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
>> > committed them long time ago in an attempt to bring some static
>> > analysis toools to nutch sources. There was a short discussion around
>> > it and we all thought t was worth doing but it never gained enough
>> > momentum.   There is a pmd target in build.xml file that uses it -
>> > they are not needed in runtime nor for standard builds.
>> > As nutch is built using hudson now I think it would be worth to
>> > integrate pmd (and checkstyle/findbugs/cobertura might be also
>> > interesting) - hudson has very nice plugins for such tools. I am using
>> > it in my daily job and I found it valuable.
>>
>> Thanks for the explanation. I am definitely +1 on having some sort of
>> static analysis tools for nutch.
>>
>> Does anyone know what hadoop/hbase/lucene use for this? or do
>> they use something at all?
>>
>> > But as I am not active committer now (I only try to follow mailing
>> > lists) I do not think it is my call.  But if everyone will be
>> > interested I can try to look at integration (but it will move forward
>> > slowly - my youngest kid was born just 2 months ago and it takes a lot
>> > of attention).
>>
>> Congratulations!
>>
>> > Piotr
>> >
>> > On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) wrote:
>> >> Update external jars to latest versions
>> >> ---------------------------------------
>> >>
>> >>                 Key: NUTCH-680
>> >>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
>> >>             Project: Nutch
>> >>          Issue Type: Improvement
>> >>            Reporter: Doğacan Güney
>> >>            Assignee: Doğacan Güney
>> >>            Priority: Minor
>> >>             Fix For: 1.0.0
>> >>
>> >>
>> >> This issue will be used to update external libraries nutch uses.
>> >>
>> >> These are the libraries that are outdated (upon a quick glance):
>> >>
>> >> nekohtml (1.9.9)
>> >> lucene-highlighter (2.4.0)
>> >> jdom (1.1)
>> >> carrot2 - as mentioned in another issue
>> >> jets3t - above
>> >> icu4j (4.0.1)
>> >> jakarta-oro (2.0.8)
>> >>
>> >> We should probably update tika to whatever the latest is as well before 1.0.
>> >>
>> >>
>> >> Please add ones  I missed in comments.
>> >>
>> >> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen
>> there.....
>> >>
>> >> --
>> >> This message is automatically generated by JIRA.
>> >> -
>> >> You can reply to this email to add a comment to the issue online.
>> >>
>> >>
>> >
>>
>>
>>
>> --
>> Doğacan Güney
>
>



-- 
Doğacan Güney

Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

Posted by Otis Gospodnetic <og...@yahoo.com>.
Lucene doesn't use anything.
Hadoop uses pmd integrate in Hudson.

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Doğacan Güney <do...@gmail.com>
> To: nutch-dev@lucene.apache.org
> Sent: Tuesday, January 20, 2009 10:49:44 AM
> Subject: Re: [jira] Created: (NUTCH-680) Update external jars to latest versions
> 
> 2009/1/20 Piotr Kosiorowski :
> > pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
> > committed them long time ago in an attempt to bring some static
> > analysis toools to nutch sources. There was a short discussion around
> > it and we all thought t was worth doing but it never gained enough
> > momentum.   There is a pmd target in build.xml file that uses it -
> > they are not needed in runtime nor for standard builds.
> > As nutch is built using hudson now I think it would be worth to
> > integrate pmd (and checkstyle/findbugs/cobertura might be also
> > interesting) - hudson has very nice plugins for such tools. I am using
> > it in my daily job and I found it valuable.
> 
> Thanks for the explanation. I am definitely +1 on having some sort of
> static analysis tools for nutch.
> 
> Does anyone know what hadoop/hbase/lucene use for this? or do
> they use something at all?
> 
> > But as I am not active committer now (I only try to follow mailing
> > lists) I do not think it is my call.  But if everyone will be
> > interested I can try to look at integration (but it will move forward
> > slowly - my youngest kid was born just 2 months ago and it takes a lot
> > of attention).
> 
> Congratulations!
> 
> > Piotr
> >
> > On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) wrote:
> >> Update external jars to latest versions
> >> ---------------------------------------
> >>
> >>                 Key: NUTCH-680
> >>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
> >>             Project: Nutch
> >>          Issue Type: Improvement
> >>            Reporter: Doğacan Güney
> >>            Assignee: Doğacan Güney
> >>            Priority: Minor
> >>             Fix For: 1.0.0
> >>
> >>
> >> This issue will be used to update external libraries nutch uses.
> >>
> >> These are the libraries that are outdated (upon a quick glance):
> >>
> >> nekohtml (1.9.9)
> >> lucene-highlighter (2.4.0)
> >> jdom (1.1)
> >> carrot2 - as mentioned in another issue
> >> jets3t - above
> >> icu4j (4.0.1)
> >> jakarta-oro (2.0.8)
> >>
> >> We should probably update tika to whatever the latest is as well before 1.0.
> >>
> >>
> >> Please add ones  I missed in comments.
> >>
> >> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen 
> there.....
> >>
> >> --
> >> This message is automatically generated by JIRA.
> >> -
> >> You can reply to this email to add a comment to the issue online.
> >>
> >>
> >
> 
> 
> 
> -- 
> Doğacan Güney


Re: [jira] Created: (NUTCH-680) Update external jars to latest versions

Posted by Doğacan Güney <do...@gmail.com>.
2009/1/20 Piotr Kosiorowski <pk...@gmail.com>:
> pmd-ext contains PMD (http://pmd.sourceforge.net/) libraries. I have
> committed them long time ago in an attempt to bring some static
> analysis toools to nutch sources. There was a short discussion around
> it and we all thought t was worth doing but it never gained enough
> momentum.   There is a pmd target in build.xml file that uses it -
> they are not needed in runtime nor for standard builds.
> As nutch is built using hudson now I think it would be worth to
> integrate pmd (and checkstyle/findbugs/cobertura might be also
> interesting) - hudson has very nice plugins for such tools. I am using
> it in my daily job and I found it valuable.

Thanks for the explanation. I am definitely +1 on having some sort of
static analysis tools for nutch.

Does anyone know what hadoop/hbase/lucene use for this? or do
they use something at all?

> But as I am not active committer now (I only try to follow mailing
> lists) I do not think it is my call.  But if everyone will be
> interested I can try to look at integration (but it will move forward
> slowly - my youngest kid was born just 2 months ago and it takes a lot
> of attention).

Congratulations!

> Piotr
>
> On Mon, Jan 19, 2009 at 3:02 PM, Doğacan Güney (JIRA) <ji...@apache.org> wrote:
>> Update external jars to latest versions
>> ---------------------------------------
>>
>>                 Key: NUTCH-680
>>                 URL: https://issues.apache.org/jira/browse/NUTCH-680
>>             Project: Nutch
>>          Issue Type: Improvement
>>            Reporter: Doğacan Güney
>>            Assignee: Doğacan Güney
>>            Priority: Minor
>>             Fix For: 1.0.0
>>
>>
>> This issue will be used to update external libraries nutch uses.
>>
>> These are the libraries that are outdated (upon a quick glance):
>>
>> nekohtml (1.9.9)
>> lucene-highlighter (2.4.0)
>> jdom (1.1)
>> carrot2 - as mentioned in another issue
>> jets3t - above
>> icu4j (4.0.1)
>> jakarta-oro (2.0.8)
>>
>> We should probably update tika to whatever the latest is as well before 1.0.
>>
>>
>> Please add ones  I missed in comments.
>>
>> Also what exactly is pmd-ext? There is an extra jakarta-oro and jaxen there.....
>>
>> --
>> This message is automatically generated by JIRA.
>> -
>> You can reply to this email to add a comment to the issue online.
>>
>>
>



-- 
Doğacan Güney