You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Doğacan Güney <do...@agmlab.com> on 2006/09/07 16:34:59 UTC
clustering
Hi,
I have lately been looking at the clustering-carrot2 plugin. What I want
to do is to make some changes in carrot2 code to change the way
clustering-carrot2 plugin behaves. For this, I pulled carrot's code from
svn, made some changes to the code, then compiled carrot2-nutch-plugin.
The problem is that, the carrot jars (under
nutch/src/plugin/clustering-carrot2/lib) have a com.dawidweiss
namespace, while jars produced by carrot2-nutch-plugin have org.carrot2
namespace. They also seem to have different public APIs.
So, how can I change the carrot code in nutch? Or, where can I find the
com.dawidweiss source code?
Good day,
Doğacan Güney
Re: clustering
Posted by Dawid Weiss <da...@cs.put.poznan.pl>.
> It seems at least several classes(like ClustersConsumerOutputComponent)
> are missing and some classes changed their API(RawDocumentBase no longer
> has a empty constructor). Anyway, if changes are only minor, I will try
> to find the right mapping between old and new API.
Ah, right -- this class has been renamed to ArrayOutputComponent. If you
take a look at the API example:
http://svn.sourceforge.net/viewvc/carrot2/trunk/carrot2/applications/carrot2-demo-api-example/src/org/carrot2/apiexample/Example.java?revision=1426&view=markup
it contains verbose JavaDoc and the code should be very similar to what
you can find in Nutch. I'm sorry for this inconvenience, I will port the
plugin code in Nutch but I wanted to wait until we have made a release
so that everything goes in a single patch.
Dawid