You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Erick Erickson <er...@gmail.com> on 2009/11/06 15:56:33 UTC

Re: DataImportHandler showing 167000 documents, search only every returns 1

Sure. Get a copy of Luke (google Lucene Luke) and point it at your index
files (it's easier than it sounds). That'll let you examine what's actually
in your index...

That'll help you figure out what your queries *should* return. After that,
try adding debug=true to your query in SOLR to see how the search
is actually performed.

And if that doesn't help, you might post more information on your schema,
query, etc.

Best
Erick

On Fri, Nov 6, 2009 at 9:25 AM, Mark Ellul <ma...@catalystic.com> wrote:

> Hi,
>
> I am using Solr 1.3 on Tomcat (on Ubuntu) importing data from postgres!
>
> I have successfully imported over 167000 documents using the
> DataImportHandler, doing a full-import
>
> I have set a field with a template using the TemplateTransformer to have 1
> value for all documents.
>
> Thus giving me the ability to search for all of the documents.
>
> However it only every brings back 1 document when I use this value. If I
> try
> to queries with anything not in that 1 document it returns no documents.
>
> Is there a step or anything I have to do after a command=full-import (
> http://localhost:8080/solr/dataimport/?command=full-import) ?
>
> Or is there any other way I can make sure the documents are actually
> indexed?
>
> Below is the result from calling  http://localhost:8080/solr/dataimport
>
> <response> <lst name="responseHeader"><int name="status">0</int><int name="
> QTime">0</int></lst><lst name="initArgs"><lst name="defaults"><str name="
> config">/opt/tomcat/solr/conf/data-config.xml</str></lst></lst><str name="
> status">idle</str><str name="importResponse"/><lst
> name="statusMessages"><str
> name="Total Requests made to DataSource">16142</str><str name="Total Rows
> Fetched">167447</str><str name="Total Documents Skipped">0</str><str
> name="Full
> Dump Started">2009-11-06 13:44:18</str><str name="">Indexing completed.
> Added/Updated: 167447 documents. Deleted 0 documents.</str><str name="
> Committed">2009-11-06 13:50:21</str><str name="Optimized">2009-11-06
> 13:50:21</str><str name="Time taken ">0:19:49.223</str></lst><str name="
> WARNING">This response format is experimental. It is likely to change in
> the
> future.</str> </response>
>
> Any Help would be really appreciated!
>
> Regards
>
> Mark
>

Re: DataImportHandler showing 167000 documents, search only every returns 1

Posted by Mark Ellul <ma...@catalystic.com>.
Chantal,

Thanks for the response...

Actually yes... its storing all the documents into one...

I am a total newb... and I copied config from a live server, but thats not
my issue... am doing a select in my DataImportHandler, maybe its not
populating properly...

Is it ok for the unique identifier to be a string in the schema?

PS: http://www.getopt.org/luke/ AWESOME Site for LUKE! Should maybe be on
wiki!

Regards

Mark


On Fri, Nov 6, 2009 at 4:10 PM, Chantal Ackermann <
chantal.ackermann@btelligent.de> wrote:

>
>  Sure. Get a copy of Luke (google Lucene Luke) and point it at your index
>>> files (it's easier than it sounds). That'll let you examine what's
>>> actually
>>> in your index...
>>>
>>
> Most of Luke is already available in the admin. Just point your browser to
> http://my-solr-server:????/solr/[core?]/admin/schema.jsp
>
> If there is really only one document in your index, maybe you did not
> define the correct unique identifier? So that SOLR stored everything into
> one document only.
>
> Cheers,
> Chantal
>

Re: DataImportHandler showing 167000 documents, search only every returns 1

Posted by Chantal Ackermann <ch...@btelligent.de>.
>> Sure. Get a copy of Luke (google Lucene Luke) and point it at your index
>> files (it's easier than it sounds). That'll let you examine what's actually
>> in your index...

Most of Luke is already available in the admin. Just point your browser to
http://my-solr-server:????/solr/[core?]/admin/schema.jsp

If there is really only one document in your index, maybe you did not 
define the correct unique identifier? So that SOLR stored everything 
into one document only.

Cheers,
Chantal

Re: DataImportHandler showing 167000 documents, search only every returns 1

Posted by Mark Ellul <ma...@catalystic.com>.
Hi Erick,

Thanks! Will give that a shot!

REgards

Mark

On Fri, Nov 6, 2009 at 3:56 PM, Erick Erickson <er...@gmail.com>wrote:

> Sure. Get a copy of Luke (google Lucene Luke) and point it at your index
> files (it's easier than it sounds). That'll let you examine what's actually
> in your index...
>
> That'll help you figure out what your queries *should* return. After that,
> try adding debug=true to your query in SOLR to see how the search
> is actually performed.
>
> And if that doesn't help, you might post more information on your schema,
> query, etc.
>
> Best
> Erick
>
> On Fri, Nov 6, 2009 at 9:25 AM, Mark Ellul <ma...@catalystic.com> wrote:
>
> > Hi,
> >
> > I am using Solr 1.3 on Tomcat (on Ubuntu) importing data from postgres!
> >
> > I have successfully imported over 167000 documents using the
> > DataImportHandler, doing a full-import
> >
> > I have set a field with a template using the TemplateTransformer to have
> 1
> > value for all documents.
> >
> > Thus giving me the ability to search for all of the documents.
> >
> > However it only every brings back 1 document when I use this value. If I
> > try
> > to queries with anything not in that 1 document it returns no documents.
> >
> > Is there a step or anything I have to do after a command=full-import (
> > http://localhost:8080/solr/dataimport/?command=full-import) ?
> >
> > Or is there any other way I can make sure the documents are actually
> > indexed?
> >
> > Below is the result from calling  http://localhost:8080/solr/dataimport
> >
> > <response> <lst name="responseHeader"><int name="status">0</int><int
> name="
> > QTime">0</int></lst><lst name="initArgs"><lst name="defaults"><str name="
> > config">/opt/tomcat/solr/conf/data-config.xml</str></lst></lst><str
> name="
> > status">idle</str><str name="importResponse"/><lst
> > name="statusMessages"><str
> > name="Total Requests made to DataSource">16142</str><str name="Total Rows
> > Fetched">167447</str><str name="Total Documents Skipped">0</str><str
> > name="Full
> > Dump Started">2009-11-06 13:44:18</str><str name="">Indexing completed.
> > Added/Updated: 167447 documents. Deleted 0 documents.</str><str name="
> > Committed">2009-11-06 13:50:21</str><str name="Optimized">2009-11-06
> > 13:50:21</str><str name="Time taken ">0:19:49.223</str></lst><str name="
> > WARNING">This response format is experimental. It is likely to change in
> > the
> > future.</str> </response>
> >
> > Any Help would be really appreciated!
> >
> > Regards
> >
> > Mark
> >
>