You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by MilleBii <mi...@gmail.com> on 2009/12/11 22:21:59 UTC

Luke reading index in hdfs

Guys is there a way you can get Luke to read the index from hdfs:// ???
Or you have to copy it out to the local filesystem?

-- 
-MilleBii-

Re: Luke reading index in hdfs

Posted by MilleBii <mi...@gmail.com>.
Great thx I can open it will help,
 but I don't get the summary page to be populated is this normal ???

2009/12/11 Andrzej Bialecki <ab...@getopt.org>

> On 2009-12-11 22:21, MilleBii wrote:
>
>> Guys is there a way you can get Luke to read the index from hdfs:// ???
>> Or you have to copy it out to the local filesystem?
>>
>>
> Luke 0.9.9 can open indexes directly from HDFS hosted on Hadoop 0.19.x.
> Luke 0.9.9.1 can do the same, but uses Hadoop 0.20.1.
>
> Start Luke, dismiss the open dialog, and then go to Plugins / Hadoop, and
> enter the full URL of the index directory (including the hdfs:// part). You
> can also open multiple parts of the index (e.g. if you follow the Nutch
> naming convention, you can directly open the "indexes/" directory that
> contains part-NNNNN partial indexes).
>
>
> --
> Best regards,
> Andrzej Bialecki     <><
>  ___. ___ ___ ___ _ _   __________________________________
> [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
> ___|||__||  \|  ||  |  Embedded Unix, System Integration
> http://www.sigram.com  Contact: info at sigram dot com
>
>


-- 
-MilleBii-

Re: Luke reading index in hdfs

Posted by Andrzej Bialecki <ab...@getopt.org>.
On 2009-12-11 22:21, MilleBii wrote:
> Guys is there a way you can get Luke to read the index from hdfs:// ???
> Or you have to copy it out to the local filesystem?
>

Luke 0.9.9 can open indexes directly from HDFS hosted on Hadoop 0.19.x.
Luke 0.9.9.1 can do the same, but uses Hadoop 0.20.1.

Start Luke, dismiss the open dialog, and then go to Plugins / Hadoop, 
and enter the full URL of the index directory (including the hdfs:// 
part). You can also open multiple parts of the index (e.g. if you follow 
the Nutch naming convention, you can directly open the "indexes/" 
directory that contains part-NNNNN partial indexes).


-- 
Best regards,
Andrzej Bialecki     <><
  ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com