You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "tamanjit.bindra@yahoo.co.in" <ta...@yahoo.co.in> on 2011/06/15 08:46:20 UTC
Index not getting cleaned up
Hi,
When I run the cleaner script i.e.
for f in $FILES
do
echo "Runing $f .......";
bin/nutch solrclean crawl/$f/crawldb/ http://solrip
echo "Finished $f ......";
done
Though the log says:
2011-06-15 12:06:02,007 INFO solr.SolrClean - SolrClean: starting at
2011-06-15 12:06:02
2011-06-15 12:06:03,050 INFO solr.SolrClean - SolrClean: deleting 34
documents
2011-06-15 12:06:03,907 INFO solr.SolrClean - SolrClean: deleted a total of
34 documents
2011-06-15 12:06:04,752 INFO solr.SolrClean - SolrClean: finished at
2011-06-15 12:06:04, elapsed: 00:00:02
2011-06-15 12:06:05,343 INFO solr.SolrClean - SolrClean: starting at
2011-06-15 12:06:05
2011-06-15 12:06:06,527 INFO solr.SolrClean - SolrClean: deleting 23
documents
2011-06-15 12:06:07,720 INFO solr.SolrClean - SolrClean: deleted a total of
23 documents
2011-06-15 12:06:08,265 INFO solr.SolrClean - SolrClean: finished at
2011-06-15 12:06:08, elapsed: 00:00:02
There is no actual decrement in count of the index. Moreover, on repeating
the solrclean command it again says that it has deleted 34 + 23 documents.
Don't know if I am doing something wrong or is this behaviour a bug. Has
anyone faced similar issue?
The solrclean command seems to work occasionally.
--
View this message in context: http://lucene.472066.n3.nabble.com/Index-not-getting-cleaned-up-tp3066254p3066254.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Index not getting cleaned up
Posted by "tamanjit.bindra@yahoo.co.in" <ta...@yahoo.co.in>.
That explains it. Thanks.
--
View this message in context: http://lucene.472066.n3.nabble.com/Index-not-getting-cleaned-up-tp3066254p3066366.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Index not getting cleaned up
Posted by Markus Jelsma <ma...@openindex.io>.
It sends delete commands even when the documents are not in the index anyway.
> Hi,
> When I run the cleaner script i.e.
>
> for f in $FILES
> do
> echo "Runing $f .......";
> bin/nutch solrclean crawl/$f/crawldb/ http://solrip
> echo "Finished $f ......";
> done
>
> Though the log says:
>
> 2011-06-15 12:06:02,007 INFO solr.SolrClean - SolrClean: starting at
> 2011-06-15 12:06:02
> 2011-06-15 12:06:03,050 INFO solr.SolrClean - SolrClean: deleting 34
> documents
> 2011-06-15 12:06:03,907 INFO solr.SolrClean - SolrClean: deleted a total
> of 34 documents
> 2011-06-15 12:06:04,752 INFO solr.SolrClean - SolrClean: finished at
> 2011-06-15 12:06:04, elapsed: 00:00:02
> 2011-06-15 12:06:05,343 INFO solr.SolrClean - SolrClean: starting at
> 2011-06-15 12:06:05
> 2011-06-15 12:06:06,527 INFO solr.SolrClean - SolrClean: deleting 23
> documents
> 2011-06-15 12:06:07,720 INFO solr.SolrClean - SolrClean: deleted a total
> of 23 documents
> 2011-06-15 12:06:08,265 INFO solr.SolrClean - SolrClean: finished at
> 2011-06-15 12:06:08, elapsed: 00:00:02
>
>
> There is no actual decrement in count of the index. Moreover, on repeating
> the solrclean command it again says that it has deleted 34 + 23 documents.
> Don't know if I am doing something wrong or is this behaviour a bug. Has
> anyone faced similar issue?
>
> The solrclean command seems to work occasionally.
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Index-not-getting-cleaned-up-tp3066254p
> 3066254.html Sent from the Nutch - User mailing list archive at Nabble.com.