You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by SEZNEC Bruno <br...@finances.gouv.fr> on 2009/09/03 10:24:06 UTC

Solr question

Hi,
 
Following solr tuto,
I send doc to solr by request :
curl
'http://localhost:8983/solr/update/extract?literal.id=doc1&uprefix=attr_&map.
content=attr_content&commit=true' --F "myfile=@Oxiane.pdf"
<response>
<lst name="responseHeader"><int name="status">0</int><int
name="QTime">23717</int></lst>
</response>

Reply seems OK, content is in the index,
but after no query match the doc...
 
TIA
Regards
Bruno
 

RE: Solr question

Posted by SEZNEC Bruno <br...@finances.gouv.fr>.
 Thanks
My idea was that is I have 
<dynamicField name="attr_*" type="textgen" indexed="true" stored="true"
multiValued="true"/>
in schema.xml
Eveything was stored in the index.
The query "solr" or other stuff works well only with text given in the sample
files
Rgds
Bruno


> -----Message d'origine-----
> De : Erik Hatcher [mailto:erik.hatcher@gmail.com] 
> Envoyé : jeudi 3 septembre 2009 13:40
> À : solr-user@lucene.apache.org
> Objet : Re: Solr question
> 
> 
> On Sep 3, 2009, at 1:24 AM, SEZNEC Bruno wrote:
> 
> > Hi,
> >
> > Following solr tuto,
> > I send doc to solr by request :
> > curl
> > 
> 'http://localhost:8983/solr/update/extract?literal.id=doc1&uprefix=att
> > r_&map
> > .
> > content=attr_content&commit=true' --F "myfile=@Oxiane.pdf"
> > <response>
> > <lst name="responseHeader"><int name="status">0</int><int 
> > name="QTime">23717</int></lst> </response>
> >
> > Reply seems OK, content is in the index, but after no query 
> match the 
> > doc...
> 
> Not even a *:* query?  What queries are you trying?  What's 
> your default search field?  What does the query parse to, as 
> seen in the  
> response using &debugQuery=true ?   Likely the problem is that you  
> aren't searching on the field the content was indexed into, 
> or that it was not analyzed as you need.
> 
> 	Erik
> 
> 

RE: Solr question

Posted by SEZNEC Bruno <br...@finances.gouv.fr>.
Response with id:doc4 is OK

<response>
−
<lst name="responseHeader">
<int name="status">0</int>
<int name="QTime">3</int>
−
<lst name="params">
<str name="indent">on</str>
<str name="start">0</str>
<str name="q">id:doc4</str>
<str name="version">2.2</str>
<str name="rows">10</str>
</lst>
</lst>
−
<result name="response" numFound="1" start="0">
−
<doc>
−
<arr name="attr_Author">
<str>Sami Siren</str>
</arr>
−
<arr name="attr_Content-Type">
<str>application/pdf</str>
</arr>
−
<arr name="attr_content">
−
<str>
   Example PDF document     Tika Solr Cell
This is a sample piece of content for Tika Solr Cell article.    
</str>
</arr>
−
<arr name="attr_created">
<str>Wed Dec 31 10:17:13 CET 2008</str>
</arr>
−
<arr name="attr_creator">
<str>Writer</str>
</arr>
−
<arr name="attr_producer">
<str>OpenOffice.org 3.0</str>
</arr>
−
<arr name="attr_stream_content_type">
<str>application/octet-stream</str>
</arr>
−
<arr name="attr_stream_name">
<str>SampleDocument.pdf</str>
</arr>
−
<arr name="attr_stream_size">
<str>18408</str>
</arr>
−
<arr name="attr_stream_source_info">
<str>myfile</str>
</arr>
<str name="id">doc4</str>
<str name="title">Example PDF document</str>
</doc>
</result>
</response>

What I don't understand is why a simple search on title or content
Doesn't works
:
<response>
−
<lst name="responseHeader">
<int name="status">0</int>
<int name="QTime">3</int>
−
<lst name="params">
<str name="indent">on</str>
<str name="start">0</str>
<str name="q">PDF</str>
<str name="version">2.2</str>
<str name="rows">10</str>
</lst>
</lst>
<result name="response" numFound="0" start="0"/>
</response>

Thanks 

> -----Message d'origine-----
> De : Erik Hatcher [mailto:erik.hatcher@gmail.com] 
> Envoyé : jeudi 3 septembre 2009 13:40
> À : solr-user@lucene.apache.org
> Objet : Re: Solr question
> 
> 
> On Sep 3, 2009, at 1:24 AM, SEZNEC Bruno wrote:
> 
> > Hi,
> >
> > Following solr tuto,
> > I send doc to solr by request :
> > curl
> > 
> 'http://localhost:8983/solr/update/extract?literal.id=doc1&uprefix=att
> > r_&map
> > .
> > content=attr_content&commit=true' --F "myfile=@Oxiane.pdf"
> > <response>
> > <lst name="responseHeader"><int name="status">0</int><int 
> > name="QTime">23717</int></lst> </response>
> >
> > Reply seems OK, content is in the index, but after no query 
> match the 
> > doc...
> 
> Not even a *:* query?  What queries are you trying?  What's 
> your default search field?  What does the query parse to, as 
> seen in the  
> response using &debugQuery=true ?   Likely the problem is that you  
> aren't searching on the field the content was indexed into, 
> or that it was not analyzed as you need.
> 
> 	Erik
> 
> 

Re: Solr question

Posted by Erik Hatcher <er...@gmail.com>.
On Sep 3, 2009, at 1:24 AM, SEZNEC Bruno wrote:

> Hi,
>
> Following solr tuto,
> I send doc to solr by request :
> curl
> 'http://localhost:8983/solr/update/extract?literal.id=doc1&uprefix=attr_&map 
> .
> content=attr_content&commit=true' --F "myfile=@Oxiane.pdf"
> <response>
> <lst name="responseHeader"><int name="status">0</int><int
> name="QTime">23717</int></lst>
> </response>
>
> Reply seems OK, content is in the index,
> but after no query match the doc...

Not even a *:* query?  What queries are you trying?  What's your  
default search field?  What does the query parse to, as seen in the  
response using &debugQuery=true ?   Likely the problem is that you  
aren't searching on the field the content was indexed into, or that it  
was not analyzed as you need.

	Erik