You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Doğacan Güney <do...@agmlab.com> on 2006/09/07 16:34:59 UTC

clustering

Hi,

I have lately been looking at the clustering-carrot2 plugin. What I want 
to do is to make some changes in carrot2 code to change the way 
clustering-carrot2 plugin behaves. For this, I pulled carrot's code from 
svn, made some changes to the code, then compiled carrot2-nutch-plugin. 
The problem is that, the carrot jars (under 
nutch/src/plugin/clustering-carrot2/lib) have a com.dawidweiss 
namespace, while jars produced by carrot2-nutch-plugin have org.carrot2 
namespace. They also seem to have different public APIs.

So, how can I change the carrot code in nutch? Or, where can I find the 
com.dawidweiss source code?

Good day,
Doğacan Güney

Re: clustering

Posted by Dawid Weiss <da...@cs.put.poznan.pl>.
> It seems at least several classes(like ClustersConsumerOutputComponent) 
> are missing and some classes changed their API(RawDocumentBase no longer 
> has a empty constructor). Anyway, if changes are only minor, I will try 
> to find the right mapping between old and new API.

Ah, right -- this class has been renamed to ArrayOutputComponent. If you 
take a look at the API example:

http://svn.sourceforge.net/viewvc/carrot2/trunk/carrot2/applications/carrot2-demo-api-example/src/org/carrot2/apiexample/Example.java?revision=1426&view=markup

it contains verbose JavaDoc and the code should be very similar to what 
you can find in Nutch. I'm sorry for this inconvenience, I will port the 
plugin code in Nutch but I wanted to wait until we have made a release 
so that everything goes in a single patch.

Dawid