You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by SEZNEC Bruno <br...@finances.gouv.fr> on 2009/09/03 10:24:06 UTC
Solr question
Hi,
Following solr tuto,
I send doc to solr by request :
curl
'http://localhost:8983/solr/update/extract?literal.id=doc1&uprefix=attr_&map.
content=attr_content&commit=true' --F "myfile=@Oxiane.pdf"
<response>
<lst name="responseHeader"><int name="status">0</int><int
name="QTime">23717</int></lst>
</response>
Reply seems OK, content is in the index,
but after no query match the doc...
TIA
Regards
Bruno
RE: Solr question
Posted by SEZNEC Bruno <br...@finances.gouv.fr>.
Thanks
My idea was that is I have
<dynamicField name="attr_*" type="textgen" indexed="true" stored="true"
multiValued="true"/>
in schema.xml
Eveything was stored in the index.
The query "solr" or other stuff works well only with text given in the sample
files
Rgds
Bruno
> -----Message d'origine-----
> De : Erik Hatcher [mailto:erik.hatcher@gmail.com]
> Envoyé : jeudi 3 septembre 2009 13:40
> À : solr-user@lucene.apache.org
> Objet : Re: Solr question
>
>
> On Sep 3, 2009, at 1:24 AM, SEZNEC Bruno wrote:
>
> > Hi,
> >
> > Following solr tuto,
> > I send doc to solr by request :
> > curl
> >
> 'http://localhost:8983/solr/update/extract?literal.id=doc1&uprefix=att
> > r_&map
> > .
> > content=attr_content&commit=true' --F "myfile=@Oxiane.pdf"
> > <response>
> > <lst name="responseHeader"><int name="status">0</int><int
> > name="QTime">23717</int></lst> </response>
> >
> > Reply seems OK, content is in the index, but after no query
> match the
> > doc...
>
> Not even a *:* query? What queries are you trying? What's
> your default search field? What does the query parse to, as
> seen in the
> response using &debugQuery=true ? Likely the problem is that you
> aren't searching on the field the content was indexed into,
> or that it was not analyzed as you need.
>
> Erik
>
>
RE: Solr question
Posted by SEZNEC Bruno <br...@finances.gouv.fr>.
Response with id:doc4 is OK
<response>
−
<lst name="responseHeader">
<int name="status">0</int>
<int name="QTime">3</int>
−
<lst name="params">
<str name="indent">on</str>
<str name="start">0</str>
<str name="q">id:doc4</str>
<str name="version">2.2</str>
<str name="rows">10</str>
</lst>
</lst>
−
<result name="response" numFound="1" start="0">
−
<doc>
−
<arr name="attr_Author">
<str>Sami Siren</str>
</arr>
−
<arr name="attr_Content-Type">
<str>application/pdf</str>
</arr>
−
<arr name="attr_content">
−
<str>
Example PDF document Tika Solr Cell
This is a sample piece of content for Tika Solr Cell article.
</str>
</arr>
−
<arr name="attr_created">
<str>Wed Dec 31 10:17:13 CET 2008</str>
</arr>
−
<arr name="attr_creator">
<str>Writer</str>
</arr>
−
<arr name="attr_producer">
<str>OpenOffice.org 3.0</str>
</arr>
−
<arr name="attr_stream_content_type">
<str>application/octet-stream</str>
</arr>
−
<arr name="attr_stream_name">
<str>SampleDocument.pdf</str>
</arr>
−
<arr name="attr_stream_size">
<str>18408</str>
</arr>
−
<arr name="attr_stream_source_info">
<str>myfile</str>
</arr>
<str name="id">doc4</str>
<str name="title">Example PDF document</str>
</doc>
</result>
</response>
What I don't understand is why a simple search on title or content
Doesn't works
:
<response>
−
<lst name="responseHeader">
<int name="status">0</int>
<int name="QTime">3</int>
−
<lst name="params">
<str name="indent">on</str>
<str name="start">0</str>
<str name="q">PDF</str>
<str name="version">2.2</str>
<str name="rows">10</str>
</lst>
</lst>
<result name="response" numFound="0" start="0"/>
</response>
Thanks
> -----Message d'origine-----
> De : Erik Hatcher [mailto:erik.hatcher@gmail.com]
> Envoyé : jeudi 3 septembre 2009 13:40
> À : solr-user@lucene.apache.org
> Objet : Re: Solr question
>
>
> On Sep 3, 2009, at 1:24 AM, SEZNEC Bruno wrote:
>
> > Hi,
> >
> > Following solr tuto,
> > I send doc to solr by request :
> > curl
> >
> 'http://localhost:8983/solr/update/extract?literal.id=doc1&uprefix=att
> > r_&map
> > .
> > content=attr_content&commit=true' --F "myfile=@Oxiane.pdf"
> > <response>
> > <lst name="responseHeader"><int name="status">0</int><int
> > name="QTime">23717</int></lst> </response>
> >
> > Reply seems OK, content is in the index, but after no query
> match the
> > doc...
>
> Not even a *:* query? What queries are you trying? What's
> your default search field? What does the query parse to, as
> seen in the
> response using &debugQuery=true ? Likely the problem is that you
> aren't searching on the field the content was indexed into,
> or that it was not analyzed as you need.
>
> Erik
>
>
Re: Solr question
Posted by Erik Hatcher <er...@gmail.com>.
On Sep 3, 2009, at 1:24 AM, SEZNEC Bruno wrote:
> Hi,
>
> Following solr tuto,
> I send doc to solr by request :
> curl
> 'http://localhost:8983/solr/update/extract?literal.id=doc1&uprefix=attr_&map
> .
> content=attr_content&commit=true' --F "myfile=@Oxiane.pdf"
> <response>
> <lst name="responseHeader"><int name="status">0</int><int
> name="QTime">23717</int></lst>
> </response>
>
> Reply seems OK, content is in the index,
> but after no query match the doc...
Not even a *:* query? What queries are you trying? What's your
default search field? What does the query parse to, as seen in the
response using &debugQuery=true ? Likely the problem is that you
aren't searching on the field the content was indexed into, or that it
was not analyzed as you need.
Erik