You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Armin Schleicher <Ar...@uibk.ac.at> on 2011/11/14 16:26:06 UTC

Solr index is not being updated when using nutch solrindex

Hello,

i have a problem using the nutch solrindex tool. everything seems to 
work fine, i do not get any error messages, but the solr index doesn't 
seem to get updated. i can see in the solr log, that the updates were 
commited, but when i search against solr, it doesn't return any results.
what could be the problem here? i'm using solr 3.4 and nutch 1.3.
any help will be highly appreciated, i'm kind of stuck here...

bests, Armin

Re: Solr index is not being updated when using nutch solrindex

Posted by jc <jv...@gmail.com>.
Hi,

I was mistaken, docs were indexing just fine, the error was i was sending
q=*:* to a request handlar that was using dismax by default, not standard

Regards
El 06/08/2012 21:19, "veryblues_cn [via Lucene]" <
ml-node+s472066n3999503h20@n3.nabble.com> escribió:

> Hi ,I got that problem too.But have not solved it yet, do you have any
> solution or now ? Or any new understanding?
> thanks
>
> jc wrote
> please, I have the same exact issue
> I'm using nutch 1.3 and solr 3.5, everything was working fine until
> recently, it all changed when I moved my solr installation to version
> control
>
> Any news about how to resolve this?
>
>
>
> ------------------------------
>  If you reply to this email, your message will be added to the discussion
> below:
>
> http://lucene.472066.n3.nabble.com/Solr-index-is-not-being-updated-when-using-nutch-solrindex-tp3507122p3999503.html
>  To unsubscribe from Solr index is not being updated when using nutch
> solrindex, click here<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=3507122&code=anZpenVldGFAZ21haWwuY29tfDM1MDcxMjJ8MTI1MzY5Njk0NQ==>
> .
> NAML<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml>
>




--
View this message in context: http://lucene.472066.n3.nabble.com/Solr-index-is-not-being-updated-when-using-nutch-solrindex-tp3507122p3999626.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Solr index is not being updated when using nutch solrindex

Posted by veryblues_cn <lh...@gmail.com>.
Hi ,I got that problem too.But have not solved it yet, do you have any
solution or now ? Or any new understanding?
thanks

jc wrote
> 
> please, I have the same exact issue
> I'm using nutch 1.3 and solr 3.5, everything was working fine until
> recently, it all changed when I moved my solr installation to version
> control
> 
> Any news about how to resolve this?
> 




--
View this message in context: http://lucene.472066.n3.nabble.com/Solr-index-is-not-being-updated-when-using-nutch-solrindex-tp3507122p3999503.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Solr index is not being updated when using nutch solrindex

Posted by jc <jv...@gmail.com>.
please, I have the same exact issue
I'm using nutch 1.3 and solr 3.5, everything was working fine until
recently, it all changed when I moved my solr installation to version
control

Any news about how to resolve this?

--
View this message in context: http://lucene.472066.n3.nabble.com/Solr-index-is-not-being-updated-when-using-nutch-solrindex-tp3507122p3994077.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Solr index is not being updated when using nutch solrindex

Posted by Armin Schleicher <Ar...@uibk.ac.at>.
thanks  for your reply!

q=*:* returns nothing. I double checked the nutch scheme.xml is in the 
right place and i don't get any errors at all and when i dump the 
segments into a textfile, all the data is there...
i create the segments using nutchwax import command from *.warc.gz files 
created by archive-it! (heritrix) and then create crawldb and linkdb 
using nutch updatedb and invertlinks commands.
here is my procedure:

*create solrindex*

    /sh /nutch-1.3/runtime/local/bin/nutch solrindex
    http://127.0.0.1:8983/solr/ /crawldb /linkdb /segments_test//


*nutch output:

*

    /SolrIndexer: starting at 2011-11-15 08:45:53
    SolrIndexer: finished at 2011-11-15 08:45:57, elapsed: 00:00:03/

*
*
*this is the resulting solr/jetty output:*

    /15.11.2011 08:45:57 org.apache.solr.update.DirectUpdateHandler2 commit
    INFO: start
    commit(optimize=false,waitFlush=true,waitSearcher=true,expungeDeletes=false)
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher <init>
    INFO: Opening Searcher@3d015a9e main
    15.11.2011 08:45:57 org.apache.solr.update.DirectUpdateHandler2 commit
    INFO: end_commit_flush
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher warm
    INFO: autowarming Searcher@3d015a9e main from Searcher@4743bf3d main
        
    fieldValueCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher warm
    INFO: autowarming result for Searcher@3d015a9e main
        
    fieldValueCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher warm
    INFO: autowarming Searcher@3d015a9e main from Searcher@4743bf3d main
        
    filterCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher warm
    INFO: autowarming result for Searcher@3d015a9e main
        
    filterCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher warm
    INFO: autowarming Searcher@3d015a9e main from Searcher@4743bf3d main
        
    queryResultCache{lookups=1,hits=0,hitratio=0.00,inserts=2,evictions=0,size=2,warmupTime=0,cumulative_lookups=1,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=1,cumulative_evictions=0}
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher warm
    INFO: autowarming result for Searcher@3d015a9e main
        
    queryResultCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=1,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=1,cumulative_evictions=0}
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher warm
    INFO: autowarming Searcher@3d015a9e main from Searcher@4743bf3d main
        
    documentCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher warm
    INFO: autowarming result for Searcher@3d015a9e main
        
    documentCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
    15.11.2011 08:45:57 org.apache.solr.core.QuerySenderListener newSearcher
    INFO: QuerySenderListener sending requests to Searcher@3d015a9e main
    15.11.2011 08:45:57 org.apache.solr.core.QuerySenderListener newSearcher
    INFO: QuerySenderListener done.
    15.11.2011 08:45:57 org.apache.solr.core.SolrCore registerSearcher
    INFO: [] Registered new searcher Searcher@3d015a9e main
    15.11.2011 08:45:57 org.apache.solr.search.SolrIndexSearcher close
    INFO: Closing Searcher@4743bf3d main
        
    fieldValueCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
        
    filterCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
        
    queryResultCache{lookups=1,hits=0,hitratio=0.00,inserts=2,evictions=0,size=2,warmupTime=0,cumulative_lookups=1,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=1,cumulative_evictions=0}
        
    documentCache{lookups=0,hits=0,hitratio=0.00,inserts=0,evictions=0,size=0,warmupTime=0,cumulative_lookups=0,cumulative_hits=0,cumulative_hitratio=0.00,cumulative_inserts=0,cumulative_evictions=0}
    15.11.2011 08:45:57
    org.apache.solr.update.processor.LogUpdateProcessor finish
    INFO: {commit=} 0 48
    15.11.2011 08:45:57 org.apache.solr.core.SolrCore execute
    INFO: [] webapp=/solr path=/update
    params={waitSearcher=true&waitFlush=true&wt=javabin&commit=true&version=2}
    status=0 QTime=48



    /



On 2011-11-14 16:29, Markus Jelsma wrote:
> If Solr's log tells you data is being indexed then there's no Nutch issue
> here. Do you commit? Are you using bad analyzers that won't return any
> results? What does q=*:* return?
>
> On Monday 14 November 2011 16:26:06 Armin Schleicher wrote:
>> Hello,
>>
>> i have a problem using the nutch solrindex tool. everything seems to
>> work fine, i do not get any error messages, but the solr index doesn't
>> seem to get updated. i can see in the solr log, that the updates were
>> commited, but when i search against solr, it doesn't return any results.
>> what could be the problem here? i'm using solr 3.4 and nutch 1.3.
>> any help will be highly appreciated, i'm kind of stuck here...
>>
>> bests, Armin


Re: Solr index is not being updated when using nutch solrindex

Posted by Markus Jelsma <ma...@openindex.io>.
If Solr's log tells you data is being indexed then there's no Nutch issue 
here. Do you commit? Are you using bad analyzers that won't return any 
results? What does q=*:* return?

On Monday 14 November 2011 16:26:06 Armin Schleicher wrote:
> Hello,
> 
> i have a problem using the nutch solrindex tool. everything seems to
> work fine, i do not get any error messages, but the solr index doesn't
> seem to get updated. i can see in the solr log, that the updates were
> commited, but when i search against solr, it doesn't return any results.
> what could be the problem here? i'm using solr 3.4 and nutch 1.3.
> any help will be highly appreciated, i'm kind of stuck here...
> 
> bests, Armin

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350