You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Hoss Man (JIRA)" <ji...@apache.org> on 2010/12/22 19:12:01 UTC

[jira] Reopened: (SOLR-2282) Distributed Support for Search Result Clustering

     [ https://issues.apache.org/jira/browse/SOLR-2282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hoss Man reopened SOLR-2282:
----------------------------


Reopening issue.

The new test added by this issue...

org.apache.solr.handler.clustering.DistributedClusteringComponentTest.testDistribSearch

...was failing consistently on both hudson, and robert muir's machine, so rmuir disabled it with @Ignore.

we should get to the bottom of this before resolving

error from hudson...

{quote}
Error Message

Some threads threw uncaught exceptions!

Stacktrace

junit.framework.AssertionFailedError: Some threads threw uncaught exceptions!
	at org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:950)
	at org.apache.lucene.util.LuceneTestCase$LuceneTestCaseRunner.runChild(LuceneTestCase.java:888)
	at org.apache.lucene.util.LuceneTestCase.tearDown(LuceneTestCase.java:371)
	at org.apache.solr.SolrTestCaseJ4.tearDown(SolrTestCaseJ4.java:78)
	at org.apache.solr.BaseDistributedSearchTestCase.tearDown(BaseDistributedSearchTestCase.java:130)

Standard Error

22-Dec-2010 6:27:38 AM org.apache.solr.common.SolrException log
SEVERE: java.lang.Error: Error: could not match input
	at org.carrot2.text.analysis.ExtendedWhitespaceTokenizerImpl.zzScanError(ExtendedWhitespaceTokenizerImpl.java:687)
	at org.carrot2.text.analysis.ExtendedWhitespaceTokenizerImpl.getNextToken(ExtendedWhitespaceTokenizerImpl.java:836)
	at org.carrot2.text.analysis.ExtendedWhitespaceTokenizer.nextToken(ExtendedWhitespaceTokenizer.java:46)
	at org.carrot2.text.preprocessing.Tokenizer.tokenize(Tokenizer.java:147)
	at org.carrot2.text.preprocessing.pipeline.CompletePreprocessingPipeline.preprocess(CompletePreprocessingPipeline.java:54)
	at org.carrot2.text.preprocessing.pipeline.BasicPreprocessingPipeline.preprocess(BasicPreprocessingPipeline.java:92)
	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm.cluster(LingoClusteringAlgorithm.java:199)
	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm.access$000(LingoClusteringAlgorithm.java:44)
	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm$1.process(LingoClusteringAlgorithm.java:178)
	at org.carrot2.text.clustering.MultilingualClustering.clusterByLanguage(MultilingualClustering.java:222)
	at org.carrot2.text.clustering.MultilingualClustering.process(MultilingualClustering.java:110)
	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm.process(LingoClusteringAlgorithm.java:171)
	at org.carrot2.core.ControllerUtils.performProcessing(ControllerUtils.java:101)
	at org.carrot2.core.Controller.process(Controller.java:287)
	at org.carrot2.core.Controller.process(Controller.java:180)
	at org.apache.solr.handler.clustering.carrot2.CarrotClusteringEngine.cluster(CarrotClusteringEngine.java:105)
	at org.apache.solr.handler.clustering.ClusteringComponent.finishStage(ClusteringComponent.java:171)
	at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:296)
	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1358)
	at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:341)
	at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:244)
	at org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212)
	at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399)
	at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
	at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
	at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152)
	at org.mortbay.jetty.Server.handle(Server.java:326)
	at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542)
	at org.mortbay.jetty.HttpConnection$RequestHandler.headerComplete(HttpConnection.java:928)
	at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:549)
	at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212)
	at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404)
	at org.mortbay.jetty.bio.SocketConnector$Connection.run(SocketConnector.java:228)
	at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582)

NOTE: reproduce with: ant test -Dtestcase=DistributedClusteringComponentTest -Dtestmethod=testDistribSearch -Dtests.seed=412049972111174180:6405396687385598457 -Dtests.multiplier=3
The following exceptions were thrown by threads:
*** Thread: Thread-13 ***
junit.framework.AssertionFailedError: .clusters.length:4!=5
	at junit.framework.Assert.fail(Assert.java:47)
	at org.apache.solr.BaseDistributedSearchTestCase.compareResponses(BaseDistributedSearchTestCase.java:494)
	at org.apache.solr.BaseDistributedSearchTestCase$5.run(BaseDistributedSearchTestCase.java:262)
*** Thread: Thread-14 ***
java.lang.RuntimeException: org.apache.solr.client.solrj.SolrServerException: Error executing query
	at org.apache.solr.BaseDistributedSearchTestCase$5.run(BaseDistributedSearchTestCase.java:265)
Caused by: org.apache.solr.client.solrj.SolrServerException: Error executing query
	at org.apache.solr.client.solrj.request.QueryRequest.process(QueryRequest.java:95)
	at org.apache.solr.client.solrj.SolrServer.query(SolrServer.java:118)
	at org.apache.solr.BaseDistributedSearchTestCase$5.run(BaseDistributedSearchTestCase.java:260)
Caused by: org.apache.solr.common.SolrException: Error: could not match input  java.lang.Error: Error: could not match input 	at org.carrot2.text.analysis.ExtendedWhitespaceTokenizerImpl.zzScanError(ExtendedWhitespaceTokenizerImpl.java:687) 	at org.carrot2.text.analysis.ExtendedWhitespaceTokenizerImpl.getNextToken(ExtendedWhitespaceTokenizerImpl.java:836) 	at org.carrot2.text.analysis.ExtendedWhitespaceTokenizer.nextToken(ExtendedWhitespaceTokenizer.java:46) 	at org.carrot2.text.preprocessing.Tokenizer.tokenize(Tokenizer.java:147) 	at org.carrot2.text.preprocessing.pipeline.CompletePreprocessingPipeline.preprocess(CompletePreprocessingPipeline.java:54) 	at org.carrot2.text.preprocessing.pipeline.BasicPreprocessingPipeline.preprocess(BasicPreprocessingPipeline.java:92) 	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm.cluster(LingoClusteringAlgorithm.java:199) 	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm.access$000(LingoClusteringAlgorithm.java:44) 	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm$1.process(LingoClusteringAlgorithm.java:178) 	at org.carrot2.text.clustering.MultilingualClustering.clusterByLanguage(MultilingualClustering.java:222) 	at org.carrot2.text.clustering.MultilingualClustering.process(MultilingualClustering.java:110) 	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm.process(LingoClusteringAlgorithm.java:171) 	at org.carrot2.core.ControllerUtils.performProcessing(ControllerUtils.java:101) 	at org.carrot2.core.Controller.process(Controller.java:287) 	at org.carrot2.core.Controller.process(Controller.java:180) 	at org.apache.solr.handler.clustering.carrot2.CarrotClusteringEngine.cluster(CarrotClusteringEngine.java:105) 	at org.apache.solr.handler.clustering.ClusteringComponent.finishStage(ClusteringComponent.java:171) 	at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:296) 	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) 	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1358) 	at org.apache.solr.servlet.SolrDispatchFilter.

Error: could not match input  java.lang.Error: Error: could not match input 	at org.carrot2.text.analysis.ExtendedWhitespaceTokenizerImpl.zzScanError(ExtendedWhitespaceTokenizerImpl.java:687) 	at org.carrot2.text.analysis.ExtendedWhitespaceTokenizerImpl.getNextToken(ExtendedWhitespaceTokenizerImpl.java:836) 	at org.carrot2.text.analysis.ExtendedWhitespaceTokenizer.nextToken(ExtendedWhitespaceTokenizer.java:46) 	at org.carrot2.text.preprocessing.Tokenizer.tokenize(Tokenizer.java:147) 	at org.carrot2.text.preprocessing.pipeline.CompletePreprocessingPipeline.preprocess(CompletePreprocessingPipeline.java:54) 	at org.carrot2.text.preprocessing.pipeline.BasicPreprocessingPipeline.preprocess(BasicPreprocessingPipeline.java:92) 	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm.cluster(LingoClusteringAlgorithm.java:199) 	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm.access$000(LingoClusteringAlgorithm.java:44) 	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm$1.process(LingoClusteringAlgorithm.java:178) 	at org.carrot2.text.clustering.MultilingualClustering.clusterByLanguage(MultilingualClustering.java:222) 	at org.carrot2.text.clustering.MultilingualClustering.process(MultilingualClustering.java:110) 	at org.carrot2.clustering.lingo.LingoClusteringAlgorithm.process(LingoClusteringAlgorithm.java:171) 	at org.carrot2.core.ControllerUtils.performProcessing(ControllerUtils.java:101) 	at org.carrot2.core.Controller.process(Controller.java:287) 	at org.carrot2.core.Controller.process(Controller.java:180) 	at org.apache.solr.handler.clustering.carrot2.CarrotClusteringEngine.cluster(CarrotClusteringEngine.java:105) 	at org.apache.solr.handler.clustering.ClusteringComponent.finishStage(ClusteringComponent.java:171) 	at org.apache.solr.handler.component.SearchHandler.handleRequestBody(SearchHandler.java:296) 	at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) 	at org.apache.solr.core.SolrCore.execute(SolrCore.java:1358) 	at org.apache.solr.servlet.SolrDispatchFilter.

request: http://localhost:14333/solr/select?clustering=true&q=*:*&sort=id desc&clustering.results=true&shards=localhost:14333/solr&wt=javabin&version=2
	at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:435)
	at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
	at org.apache.solr.client.solrj.request.QueryRequest.process(QueryRequest.java:89)
	... 2 more
WARNING: test class left thread running: Thread[MultiThreadedHttpConnectionManager cleanup,5,main]
WARNING: test class left thread running: Thread[pool-7-thread-1,5,main]
WARNING: test class left thread running: Thread[pool-7-thread-3,5,main]
WARNING: test class left thread running: Thread[pool-7-thread-2,5,main]
WARNING: test class left thread running: Thread[pool-7-thread-4,5,main]
WARNING: test class left thread running: Thread[pool-7-thread-5,5,main]
WARNING: test class left thread running: Thread[pool-7-thread-6,5,main]
RESOURCE LEAK: test class left 7 thread(s) running
NOTE: test params are: locale=en_CA, timezone=Asia/Ashgabat
NOTE: all tests run in this JVM:
[ClusteringComponentTest, DistributedClusteringComponentTest]
{quote}

> Distributed Support for Search Result Clustering
> ------------------------------------------------
>
>                 Key: SOLR-2282
>                 URL: https://issues.apache.org/jira/browse/SOLR-2282
>             Project: Solr
>          Issue Type: New Feature
>          Components: contrib - Clustering
>    Affects Versions: 1.4, 1.4.1
>            Reporter: Koji Sekiguchi
>            Assignee: Koji Sekiguchi
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch, SOLR-2282.patch
>
>
> Brad Giaccio contributed a patch for this in SOLR-769. I'd like to incorporate it.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org