You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "tamanjit.bindra@yahoo.co.in" <ta...@yahoo.co.in> on 2011/06/15 08:46:20 UTC

Index not getting cleaned up

Hi,
When I run the cleaner script i.e.

for f in $FILES
do
  echo "Runing $f .......";
  bin/nutch solrclean crawl/$f/crawldb/ http://solrip
  echo "Finished $f ......";
done

Though the log says:

2011-06-15 12:06:02,007 INFO  solr.SolrClean - SolrClean: starting at
2011-06-15 12:06:02
2011-06-15 12:06:03,050 INFO  solr.SolrClean - SolrClean: deleting 34
documents
2011-06-15 12:06:03,907 INFO  solr.SolrClean - SolrClean: deleted a total of
34 documents
2011-06-15 12:06:04,752 INFO  solr.SolrClean - SolrClean: finished at
2011-06-15 12:06:04, elapsed: 00:00:02
2011-06-15 12:06:05,343 INFO  solr.SolrClean - SolrClean: starting at
2011-06-15 12:06:05
2011-06-15 12:06:06,527 INFO  solr.SolrClean - SolrClean: deleting 23
documents
2011-06-15 12:06:07,720 INFO  solr.SolrClean - SolrClean: deleted a total of
23 documents
2011-06-15 12:06:08,265 INFO  solr.SolrClean - SolrClean: finished at
2011-06-15 12:06:08, elapsed: 00:00:02


There is no actual decrement in count of the index. Moreover, on repeating
the solrclean command it again says that it has deleted 34 + 23 documents.
Don't know if I am doing something wrong or is this behaviour a bug. Has
anyone faced similar issue?

The solrclean command seems to work occasionally.

--
View this message in context: http://lucene.472066.n3.nabble.com/Index-not-getting-cleaned-up-tp3066254p3066254.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Index not getting cleaned up

Posted by "tamanjit.bindra@yahoo.co.in" <ta...@yahoo.co.in>.
That explains it. Thanks.

--
View this message in context: http://lucene.472066.n3.nabble.com/Index-not-getting-cleaned-up-tp3066254p3066366.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Index not getting cleaned up

Posted by Markus Jelsma <ma...@openindex.io>.
It sends delete commands even when the documents are not in the index anyway. 

> Hi,
> When I run the cleaner script i.e.
> 
> for f in $FILES
> do
>   echo "Runing $f .......";
>   bin/nutch solrclean crawl/$f/crawldb/ http://solrip
>   echo "Finished $f ......";
> done
> 
> Though the log says:
> 
> 2011-06-15 12:06:02,007 INFO  solr.SolrClean - SolrClean: starting at
> 2011-06-15 12:06:02
> 2011-06-15 12:06:03,050 INFO  solr.SolrClean - SolrClean: deleting 34
> documents
> 2011-06-15 12:06:03,907 INFO  solr.SolrClean - SolrClean: deleted a total
> of 34 documents
> 2011-06-15 12:06:04,752 INFO  solr.SolrClean - SolrClean: finished at
> 2011-06-15 12:06:04, elapsed: 00:00:02
> 2011-06-15 12:06:05,343 INFO  solr.SolrClean - SolrClean: starting at
> 2011-06-15 12:06:05
> 2011-06-15 12:06:06,527 INFO  solr.SolrClean - SolrClean: deleting 23
> documents
> 2011-06-15 12:06:07,720 INFO  solr.SolrClean - SolrClean: deleted a total
> of 23 documents
> 2011-06-15 12:06:08,265 INFO  solr.SolrClean - SolrClean: finished at
> 2011-06-15 12:06:08, elapsed: 00:00:02
> 
> 
> There is no actual decrement in count of the index. Moreover, on repeating
> the solrclean command it again says that it has deleted 34 + 23 documents.
> Don't know if I am doing something wrong or is this behaviour a bug. Has
> anyone faced similar issue?
> 
> The solrclean command seems to work occasionally.
> 
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Index-not-getting-cleaned-up-tp3066254p
> 3066254.html Sent from the Nutch - User mailing list archive at Nabble.com.