You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Sebastian Nagel <wa...@googlemail.com> on 2017/11/02 11:06:22 UTC

Re: Fwd: Maven configuration

Hi Raffaele,

please also have a look on
  https://issues.apache.org/jira/browse/NUTCH-2292
(there is already work done)

@Lewis: can you summarize the status of NUTCH-2292?

Thanks,
Sebastian



On 10/31/2017 07:04 PM, Raffaele Palmieri wrote:
> Hi Sebastian, effectively you're right. Initially I forked repo, then there have been many updates,
> and preferred to start from scratch. For me it is important to know that this could a feasible
> alternative to current configuration, eventually well accepted from the community.
> In the next days, I'll try to get back to history, let you know,
> best regards,
> RP.
> 
> 2017-10-31 18:23 GMT+01:00 Sebastian Nagel <wastl.nagel@googlemail.com
> <ma...@googlemail.com>>:
> 
>     Hi Raffaele,
> 
>     thanks for your work. If time I'll have a look at it.
> 
>     One questions: the repository does not contain any of the existing commits,
>     the entire history is lost. That makes it impossible to track the origin of the code.
>     Is it possible for you to do the work starting from a fresh checkout
>     of https://github.com/apache/nutch/ <https://github.com/apache/nutch/> (master branch)?
> 
>     Best,
>     Sebastian
> 
> 
>     On 10/27/2017 04:02 PM, Raffaele Palmieri wrote:
>     > Dear developers,
>     > as follow-up of proposal, I share the experimetal project with mavenized
>     > Nutch: https://github.com/zirafel/apache_nutch <https://github.com/zirafel/apache_nutch>
>     > Comments and feedbacks are well accepted. For the deployment process, assembly plugin could be used.
>     > Do you think it is feasible?
>     > Best,
>     > RP.
>     >
>     >
>     > ---------- Forwarded message ----------
>     > From: *Raffaele Palmieri* <raffaele.palmieri@gmail.com <ma...@gmail.com>
>     <mailto:raffaele.palmieri@gmail.com <ma...@gmail.com>>>
>     > Date: 2017-09-21 18:33 GMT+02:00
>     > Subject: Maven configuration
>     > To: dev@nutch.apache.org <ma...@nutch.apache.org> <mailto:dev@nutch.apache.org
>     <ma...@nutch.apache.org>>
>     >
>     >
>     > Hi devs,
>     > I've seen that maven configuration argument had been discussed and there is also an issue on Jira
>     > (https://issues.apache.org/jira/browse/NUTCH-1371
>     <https://issues.apache.org/jira/browse/NUTCH-1371>
>     <https://issues.apache.org/jira/browse/NUTCH-1371
>     <https://issues.apache.org/jira/browse/NUTCH-1371>>).
>     > Our need is developing/integrating some plugins for Nutch and I realize that using current
>     > configuration Ant/Ivy is quite difficult to use for the development and unit testing.
>     > Is there any updates to migrate to Maven?
>     > Our proposal is:
>     > pom.xml (aggregator)
>     > - main_module (actually src)
>     > - nutch_plugins (pom plugin aggregator)
>     > - creativecommons (actually /src/plugin/creativecommons)
>     > - index-anchor
>     >                 - ...
>     > It's true that's a problem for who runs current configuration, but Maven could be a better
>     solution
>     > also for them,
>     > Best,
>     > RP.
>     >
> 
> 


Re: Fwd: Maven configuration

Posted by Sebastian Nagel <wa...@googlemail.com>.
Hi Raffaele,

sorry for the late reply.

I also do not know about the status of NUTCH-2292 (except for what is visible on Jira and github).

> I haven't verified the tests.

Unit tests should run and succeed. That's a requirement for pull request to be merged.

Best,
Sebastian

On 11/16/2017 06:46 PM, Raffaele Palmieri wrote:
> Hi Sebastian and Lewis,
> in the meantime you can find the commit with some stuff for mavenization
> here: https://github.com/zirafel/nutch/tree/NUTCH-2292
> <https://github.com/zirafel/nutch/tree/NUTCH-2292>
> I haven't verified the tests. 
> @Lewis let me know if the state of 2292 hasn't changed, so that I make a pull request.
> Best regards,
> RP.
> Best,
> RP.
> 
> 
> Il 02 Nov 2017 12:06, "Sebastian Nagel" <wastl.nagel@googlemail.com
> <ma...@googlemail.com>> ha scritto:
> 
>     Hi Raffaele,
> 
>     please also have a look on
>       https://issues.apache.org/jira/browse/NUTCH-2292
>     <https://issues.apache.org/jira/browse/NUTCH-2292>
>     (there is already work done)
> 
>     @Lewis: can you summarize the status of NUTCH-2292?
> 
>     Thanks,
>     Sebastian
> 
> 
> 
>     On 10/31/2017 07:04 PM, Raffaele Palmieri wrote:
>     > Hi Sebastian, effectively you're right. Initially I forked repo, then there have been many
>     updates,
>     > and preferred to start from scratch. For me it is important to know that this could a feasible
>     > alternative to current configuration, eventually well accepted from the community.
>     > In the next days, I'll try to get back to history, let you know,
>     > best regards,
>     > RP.
>     >
>     > 2017-10-31 18:23 GMT+01:00 Sebastian Nagel <wastl.nagel@googlemail.com
>     <ma...@googlemail.com>
>     > <mailto:wastl.nagel@googlemail.com <ma...@googlemail.com>>>:
>     >
>     >     Hi Raffaele,
>     >
>     >     thanks for your work. If time I'll have a look at it.
>     >
>     >     One questions: the repository does not contain any of the existing commits,
>     >     the entire history is lost. That makes it impossible to track the origin of the code.
>     >     Is it possible for you to do the work starting from a fresh checkout
>     >     of https://github.com/apache/nutch/ <https://github.com/apache/nutch/>
>     <https://github.com/apache/nutch/ <https://github.com/apache/nutch/>> (master branch)?
>     >
>     >     Best,
>     >     Sebastian
>     >
>     >
>     >     On 10/27/2017 04:02 PM, Raffaele Palmieri wrote:
>     >     > Dear developers,
>     >     > as follow-up of proposal, I share the experimetal project with mavenized
>     >     > Nutch: https://github.com/zirafel/apache_nutch <https://github.com/zirafel/apache_nutch>
>     <https://github.com/zirafel/apache_nutch <https://github.com/zirafel/apache_nutch>>
>     >     > Comments and feedbacks are well accepted. For the deployment process, assembly plugin
>     could be used.
>     >     > Do you think it is feasible?
>     >     > Best,
>     >     > RP.
>     >     >
>     >     >
>     >     > ---------- Forwarded message ----------
>     >     > From: *Raffaele Palmieri* <raffaele.palmieri@gmail.com
>     <ma...@gmail.com> <mailto:raffaele.palmieri@gmail.com
>     <ma...@gmail.com>>
>     >     <mailto:raffaele.palmieri@gmail.com <ma...@gmail.com>
>     <mailto:raffaele.palmieri@gmail.com <ma...@gmail.com>>>>
>     >     > Date: 2017-09-21 18:33 GMT+02:00
>     >     > Subject: Maven configuration
>     >     > To: dev@nutch.apache.org <ma...@nutch.apache.org> <mailto:dev@nutch.apache.org
>     <ma...@nutch.apache.org>> <mailto:dev@nutch.apache.org <ma...@nutch.apache.org>
>     >     <mailto:dev@nutch.apache.org <ma...@nutch.apache.org>>>
>     >     >
>     >     >
>     >     > Hi devs,
>     >     > I've seen that maven configuration argument had been discussed and there is also an
>     issue on Jira
>     >     > (https://issues.apache.org/jira/browse/NUTCH-1371
>     <https://issues.apache.org/jira/browse/NUTCH-1371>
>     >     <https://issues.apache.org/jira/browse/NUTCH-1371
>     <https://issues.apache.org/jira/browse/NUTCH-1371>>
>     >     <https://issues.apache.org/jira/browse/NUTCH-1371
>     <https://issues.apache.org/jira/browse/NUTCH-1371>
>     >     <https://issues.apache.org/jira/browse/NUTCH-1371
>     <https://issues.apache.org/jira/browse/NUTCH-1371>>>).
>     >     > Our need is developing/integrating some plugins for Nutch and I realize that using current
>     >     > configuration Ant/Ivy is quite difficult to use for the development and unit testing.
>     >     > Is there any updates to migrate to Maven?
>     >     > Our proposal is:
>     >     > pom.xml (aggregator)
>     >     > - main_module (actually src)
>     >     > - nutch_plugins (pom plugin aggregator)
>     >     > - creativecommons (actually /src/plugin/creativecommons)
>     >     > - index-anchor
>     >     >                 - ...
>     >     > It's true that's a problem for who runs current configuration, but Maven could be a better
>     >     solution
>     >     > also for them,
>     >     > Best,
>     >     > RP.
>     >     >
>     >
>     >
> 
> 


Re: Fwd: Maven configuration

Posted by Raffaele Palmieri <ra...@gmail.com>.
Hi Sebastian and Lewis,
in the meantime you can find the commit with some stuff for mavenization
here: https://github.com/zirafel/nutch/tree/NUTCH-2292
I haven't verified the tests.
@Lewis let me know if the state of 2292 hasn't changed, so that I make a
pull request.
Best regards,
RP.
Best,
RP.


Il 02 Nov 2017 12:06, "Sebastian Nagel" <wa...@googlemail.com> ha
scritto:

Hi Raffaele,

please also have a look on
  https://issues.apache.org/jira/browse/NUTCH-2292
(there is already work done)

@Lewis: can you summarize the status of NUTCH-2292?

Thanks,
Sebastian



On 10/31/2017 07:04 PM, Raffaele Palmieri wrote:
> Hi Sebastian, effectively you're right. Initially I forked repo, then
there have been many updates,
> and preferred to start from scratch. For me it is important to know that
this could a feasible
> alternative to current configuration, eventually well accepted from the
community.
> In the next days, I'll try to get back to history, let you know,
> best regards,
> RP.
>
> 2017-10-31 18:23 GMT+01:00 Sebastian Nagel <wastl.nagel@googlemail.com
> <ma...@googlemail.com>>:
>
>     Hi Raffaele,
>
>     thanks for your work. If time I'll have a look at it.
>
>     One questions: the repository does not contain any of the existing
commits,
>     the entire history is lost. That makes it impossible to track the
origin of the code.
>     Is it possible for you to do the work starting from a fresh checkout
>     of https://github.com/apache/nutch/ <https://github.com/apache/nutch/>
(master branch)?
>
>     Best,
>     Sebastian
>
>
>     On 10/27/2017 04:02 PM, Raffaele Palmieri wrote:
>     > Dear developers,
>     > as follow-up of proposal, I share the experimetal project with
mavenized
>     > Nutch: https://github.com/zirafel/apache_nutch <
https://github.com/zirafel/apache_nutch>
>     > Comments and feedbacks are well accepted. For the deployment
process, assembly plugin could be used.
>     > Do you think it is feasible?
>     > Best,
>     > RP.
>     >
>     >
>     > ---------- Forwarded message ----------
>     > From: *Raffaele Palmieri* <raffaele.palmieri@gmail.com <mailto:
raffaele.palmieri@gmail.com>
>     <mailto:raffaele.palmieri@gmail.com <mailto:raffaele.palmieri@
gmail.com>>>
>     > Date: 2017-09-21 18:33 GMT+02:00
>     > Subject: Maven configuration
>     > To: dev@nutch.apache.org <ma...@nutch.apache.org> <mailto:
dev@nutch.apache.org
>     <ma...@nutch.apache.org>>
>     >
>     >
>     > Hi devs,
>     > I've seen that maven configuration argument had been discussed and
there is also an issue on Jira
>     > (https://issues.apache.org/jira/browse/NUTCH-1371
>     <https://issues.apache.org/jira/browse/NUTCH-1371>
>     <https://issues.apache.org/jira/browse/NUTCH-1371
>     <https://issues.apache.org/jira/browse/NUTCH-1371>>).
>     > Our need is developing/integrating some plugins for Nutch and I
realize that using current
>     > configuration Ant/Ivy is quite difficult to use for the development
and unit testing.
>     > Is there any updates to migrate to Maven?
>     > Our proposal is:
>     > pom.xml (aggregator)
>     > - main_module (actually src)
>     > - nutch_plugins (pom plugin aggregator)
>     > - creativecommons (actually /src/plugin/creativecommons)
>     > - index-anchor
>     >                 - ...
>     > It's true that's a problem for who runs current configuration, but
Maven could be a better
>     solution
>     > also for them,
>     > Best,
>     > RP.
>     >
>
>