You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Kris K (JIRA)" <ji...@apache.org> on 2006/05/08 14:45:20 UTC

[jira] Created: (NUTCH-265) Getting Clustered results in better form.

Getting Clustered results in better form.
-----------------------------------------

         Key: NUTCH-265
         URL: http://issues.apache.org/jira/browse/NUTCH-265
     Project: Nutch
        Type: Improvement

  Components: searcher  
    Versions: 0.7.2    
    Reporter: Kris K


The cluster results are coming with title and link to URL. For improvement it should be clustered keyword phrases (Like  Vivisimo type). Any person can share their views on it. 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-265) Getting Clustered results in better form.

Posted by "Kris K (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-265?page=comments#action_12413214 ] 

Kris K commented on NUTCH-265:
------------------------------

Dear Dawid, Yeah I want the same interface as you showed me before but I am not able to do that. My area of concern is like that: If I am searching for keyword "Java" the clustered results should come in the following format
Java SourceCode
Java Book
Java Compiler
Java Programming
Java Server
Java Technology
Java Servlets
Java Applets
JavaScript .....

I really appreciate you for your help to provide me the right direction. 



> Getting Clustered results in better form.
> -----------------------------------------
>
>          Key: NUTCH-265
>          URL: http://issues.apache.org/jira/browse/NUTCH-265
>      Project: Nutch
>         Type: Improvement

>   Components: searcher
>     Versions: 0.7.2
>     Reporter: Kris K

>
> The cluster results are coming with title and link to URL. For improvement it should be clustered keyword phrases (Like  Vivisimo type). Any person can share their views on it. 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Updated: (NUTCH-265) Getting Clustered results in better form.

Posted by "Kris K (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-265?page=all ]

Kris K updated NUTCH-265:
-------------------------


Any updates on this issue. Please help me to solve this problem. I want the clustered results should come with keyword or phrases not by the title.

> Getting Clustered results in better form.
> -----------------------------------------
>
>          Key: NUTCH-265
>          URL: http://issues.apache.org/jira/browse/NUTCH-265
>      Project: Nutch
>         Type: Improvement

>   Components: searcher
>     Versions: 0.7.2
>     Reporter: Kris K

>
> The cluster results are coming with title and link to URL. For improvement it should be clustered keyword phrases (Like  Vivisimo type). Any person can share their views on it. 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-265) Getting Clustered results in better form.

Posted by "Dawid Weiss (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-265?page=comments#action_12413072 ] 

Dawid Weiss commented on NUTCH-265:
-----------------------------------

Chris, the current clusterer in Nutch _does_ discover phrases for clusters, so I don't know what you really mean. Did you take a look at my previous post? Would that kind of user interface make you happy?

> Getting Clustered results in better form.
> -----------------------------------------
>
>          Key: NUTCH-265
>          URL: http://issues.apache.org/jira/browse/NUTCH-265
>      Project: Nutch
>         Type: Improvement

>   Components: searcher
>     Versions: 0.7.2
>     Reporter: Kris K

>
> The cluster results are coming with title and link to URL. For improvement it should be clustered keyword phrases (Like  Vivisimo type). Any person can share their views on it. 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-265) Getting Clustered results in better form.

Posted by "Dawid Weiss (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-265?page=comments#action_12378425 ] 

Dawid Weiss commented on NUTCH-265:
-----------------------------------

The clustering interface is very simple in Nutch because it usually needs to be adjusted to the needs of a particular application. Maintaing a complex user interface is not among Nutch's objectives, so I doubt if it's possible. Carrot2, which Nutch internally uses, has a JavaScript-powered interface which could be added to Nutch if there are folks that really think it is worth the effort.

See this one:
http://carrot.cs.put.poznan.pl/carrot2-remote-controller/newsearch.do?query=nutch&processingChain=carrot2.process.lingo-yahooapi&resultsRequested=100

> Getting Clustered results in better form.
> -----------------------------------------
>
>          Key: NUTCH-265
>          URL: http://issues.apache.org/jira/browse/NUTCH-265
>      Project: Nutch
>         Type: Improvement

>   Components: searcher
>     Versions: 0.7.2
>     Reporter: Kris K

>
> The cluster results are coming with title and link to URL. For improvement it should be clustered keyword phrases (Like  Vivisimo type). Any person can share their views on it. 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-265) Getting Clustered results in better form.

Posted by "Dawid Weiss (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-265?page=comments#action_12413220 ] 

Dawid Weiss commented on NUTCH-265:
-----------------------------------

If you just mean the user interface, then you can simply take the XSLT stylesheet from Carrot2 and reuse it in Nutch with the opensearch XML -- I believe there is even an example in Carrot2 of using opensearch, so you shouldn't have much troubles.

Now, the phrases you wish to see on your screen won't always be so beautiful because search results clustering works on snippets extracted from search results. If you want clean and accurate labels then you'd need to use a predefined ontology or something -- I can't help you with that. 

Try playing around with Carrot2 demo and see if the results satisfy your needs. If so, then rewriting Nutch's user interface to suit your needs shouldn't be a problem. If your expectations are more demanding then you'll need to think of some other solution.


> Getting Clustered results in better form.
> -----------------------------------------
>
>          Key: NUTCH-265
>          URL: http://issues.apache.org/jira/browse/NUTCH-265
>      Project: Nutch
>         Type: Improvement

>   Components: searcher
>     Versions: 0.7.2
>     Reporter: Kris K

>
> The cluster results are coming with title and link to URL. For improvement it should be clustered keyword phrases (Like  Vivisimo type). Any person can share their views on it. 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira