You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by howard chen <ho...@gmail.com> on 2006/12/03 19:22:57 UTC
Converting Lucene Demo to search index in DFS
Hi,
I have been playing with Lucene & Hadoop.
The Lucene demo (Web) is a great tutorial for people to understand Lucene,
e.g. http://lucene.apache.org/java/docs/demo3.html
I want to ask
1. If I put the index in the DFS of Hadoop, is it easy to modify the
codes to search in the DFS, rather than local FS? (ignore abt
mapreduce first, I mean just search index in the DFS from web server)
2. More than (1), now if I want to search the index from serveral
running nodes using mapreduce, is the wordcount example a good
starting point?
Thanks for any comments and suggestions.
howa.
Re: Converting Lucene Demo to search index in DFS
Posted by howard chen <ho...@gmail.com>.
On 12/4/06, howard chen <ho...@gmail.com> wrote:
> Hi,
>
> I have been playing with Lucene & Hadoop.
>
> The Lucene demo (Web) is a great tutorial for people to understand Lucene,
>
> e.g. http://lucene.apache.org/java/docs/demo3.html
>
> I want to ask
>
> 1. If I put the index in the DFS of Hadoop, is it easy to modify the
> codes to search in the DFS, rather than local FS? (ignore abt
> mapreduce first, I mean just search index in the DFS from web server)
>
> 2. More than (1), now if I want to search the index from serveral
> running nodes using mapreduce, is the wordcount example a good
> starting point?
>
> Thanks for any comments and suggestions.
>
> howa.
>
in fact, the problem maybe just i don't know how to split the lucene
index, if i can split the index, the flow i suppose is similar to the
word count example.
can anyone experience in nutch can tell me how to deal with index splitting?
thanks.