You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Kate Kas <ka...@gmail.com> on 2015/12/05 00:23:10 UTC

Indexing Wikipedia

Hi,

i tried to index .xml files from wikipedia articles (
https://dumps.wikimedia.org/enwiki/20150702/) using the method, which is
proposed by solr tutorial (
https://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia).

I think that some fields are not indexed, because when i use q equal to *:*
and fl equal to * (
http://localhost:8983/solr/wikipedia/select?q=*%3A*&fl=*&wt=json&indent=true)
, i receive results only for "id" and "_version_"  .

Any idea which could be the problem?

Thank you in advance.

Best,
Kate

Re: Indexing Wikipedia

Posted by Paul Libbrecht <pa...@hoplahup.net>.
SImply... some fields are not stored so they are only searched through
(being indexed) but not given back?
(title and text in the tutorial you refer to). Are these the missing fields?

Paul
> Kate Kas <ma...@gmail.com>
> 5 décembre 2015 00:23
> Hi,
>
> i tried to index .xml files from wikipedia articles (
> https://dumps.wikimedia.org/enwiki/20150702/) using the method, which is
> proposed by solr tutorial (
> https://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia).
>
> I think that some fields are not indexed, because when i use q equal
> to *:*
> and fl equal to * (
> http://localhost:8983/solr/wikipedia/select?q=*%3A*&fl=*&wt=json&indent=true)
> , i receive results only for "id" and "_version_" .
>
> Any idea which could be the problem?
>
> Thank you in advance.
>
> Best,
> Kate
>