You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Sam Lee <vi...@yahoo.com> on 2005/10/26 21:34:53 UTC

Would Someone Give Me Pointer On How to Index Database?

Hi,
  I want to use Lucene/Nutch to index my mysql
database.  I think of using JDBC, is it a good idea? 
I searched all over the web, but all the examples are
non-lucene/Nutch related.  Would you guys give me
pointers or websites or examples on how to use JDBC on
Lucene/Nutch to index mysql database?  

Many thanks.


		
__________________________________ 
Start your day with Yahoo! - Make it your home page! 
http://www.yahoo.com/r/hs

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


RE: Would Someone Give Me Pointer On How to Index Database?

Posted by Nick Temple <ni...@nicktemple.com>.
It sounds to me like this is not a Nutch / lucene issue at all.

Assuming you are "following" the AdWords model, I see this as two really seperate problems that "come together" in the presentation layer.

I believe what you will want to do is modify the JSP's ... one thread retrieves the search results from Nutch,  the second would query your database for the advertising, then you'd stitch it together at the presentation layer.

As far as indexing the DB, I don't believe you need to - you simply need to query it.

Nick

*********** REPLY SEPARATOR  ***********

On 10/26/2005 at 8:56 PM Sam Lee wrote:

>The data I need to index are webpages which will be
>retrieved by the Nutch's crawler, but it doesn't
>retrieve all webpages.  I will give it a list of
>webpages to crawl.  
>
>In my mysql database, it has all the data similar to
>the ones the advertisers enter in Google Adwords, e.g.
>ad content, keywords, etc.  So I need to index these
>data in mysql db.
> 
>
>--- Fuad Efendi <fu...@efendi.ca> wrote:
>
>> Question: what do you need to index?
>> 
>> Simple answer "I need to index my MySQL" is not
>> enough... MySQL has own
>> indexes...
>> 
>> Nutch is an Internet Search Engine; Lucene is a
>> framework for indexing and
>> searching of any text information... Does your
>> database contain huge text
>> fields, "Documents"? 
>> 
>> 
>> -----Original Message-----
>> From: Sam Lee [mailto:victor_emailbox@yahoo.com] 
>> Sent: Wednesday, October 26, 2005 3:35 PM
>> To: java-user@lucene.apache.org
>> Subject: Would Someone Give Me Pointer On How to
>> Index Database?
>> 
>> 
>> Hi,
>>   I want to use Lucene/Nutch to index my mysql
>> database.  I think of using JDBC, is it a good idea?
>> 
>> I searched all over the web, but all the examples
>> are
>> non-lucene/Nutch related.  Would you guys give me
>> pointers or websites or examples on how to use JDBC
>> on
>> Lucene/Nutch to index mysql database?  
>> 
>> Many thanks.
>> 
>> 
>> 		
>> __________________________________ 
>> Start your day with Yahoo! - Make it your home page!
>> 
>> http://www.yahoo.com/r/hs
>> 
>> 
>> 
>
>
>
>	
>		
>__________________________________ 
>Yahoo! Mail - PC Magazine Editors' Choice 2005 
>http://mail.yahoo.com




Re: Would Someone Give Me Pointer On How to Index Database?

Posted by Stefan Groschupf <sg...@media-style.com>.
Just use lucene to index your mysql data.
There is a great book from Erik that gives you all you need to  
understand lucene.
BTW it has a chapter over nutch as well.

Stefan

Am 27.10.2005 um 05:56 schrieb Sam Lee:

> The data I need to index are webpages which will be
> retrieved by the Nutch's crawler, but it doesn't
> retrieve all webpages.  I will give it a list of
> webpages to crawl.
>
> In my mysql database, it has all the data similar to
> the ones the advertisers enter in Google Adwords, e.g.
> ad content, keywords, etc.  So I need to index these
> data in mysql db.
>
>
> --- Fuad Efendi <fu...@efendi.ca> wrote:
>
>
>> Question: what do you need to index?
>>
>> Simple answer "I need to index my MySQL" is not
>> enough... MySQL has own
>> indexes...
>>
>> Nutch is an Internet Search Engine; Lucene is a
>> framework for indexing and
>> searching of any text information... Does your
>> database contain huge text
>> fields, "Documents"?
>>
>>
>> -----Original Message-----
>> From: Sam Lee [mailto:victor_emailbox@yahoo.com]
>> Sent: Wednesday, October 26, 2005 3:35 PM
>> To: java-user@lucene.apache.org
>> Subject: Would Someone Give Me Pointer On How to
>> Index Database?
>>
>>
>> Hi,
>>   I want to use Lucene/Nutch to index my mysql
>> database.  I think of using JDBC, is it a good idea?
>>
>> I searched all over the web, but all the examples
>> are
>> non-lucene/Nutch related.  Would you guys give me
>> pointers or websites or examples on how to use JDBC
>> on
>> Lucene/Nutch to index mysql database?
>>
>> Many thanks.
>>
>>
>>
>> __________________________________
>> Start your day with Yahoo! - Make it your home page!
>>
>> http://www.yahoo.com/r/hs
>>
>>
>>
>>
>
>
>
>
>
> __________________________________
> Yahoo! Mail - PC Magazine Editors' Choice 2005
> http://mail.yahoo.com
>
>


RE: Would Someone Give Me Pointer On How to Index Database?

Posted by Sam Lee <vi...@yahoo.com>.
The data I need to index are webpages which will be
retrieved by the Nutch's crawler, but it doesn't
retrieve all webpages.  I will give it a list of
webpages to crawl.  

In my mysql database, it has all the data similar to
the ones the advertisers enter in Google Adwords, e.g.
ad content, keywords, etc.  So I need to index these
data in mysql db.
 

--- Fuad Efendi <fu...@efendi.ca> wrote:

> Question: what do you need to index?
> 
> Simple answer "I need to index my MySQL" is not
> enough... MySQL has own
> indexes...
> 
> Nutch is an Internet Search Engine; Lucene is a
> framework for indexing and
> searching of any text information... Does your
> database contain huge text
> fields, "Documents"? 
> 
> 
> -----Original Message-----
> From: Sam Lee [mailto:victor_emailbox@yahoo.com] 
> Sent: Wednesday, October 26, 2005 3:35 PM
> To: java-user@lucene.apache.org
> Subject: Would Someone Give Me Pointer On How to
> Index Database?
> 
> 
> Hi,
>   I want to use Lucene/Nutch to index my mysql
> database.  I think of using JDBC, is it a good idea?
> 
> I searched all over the web, but all the examples
> are
> non-lucene/Nutch related.  Would you guys give me
> pointers or websites or examples on how to use JDBC
> on
> Lucene/Nutch to index mysql database?  
> 
> Many thanks.
> 
> 
> 		
> __________________________________ 
> Start your day with Yahoo! - Make it your home page!
> 
> http://www.yahoo.com/r/hs
> 
> 
> 



	
		
__________________________________ 
Yahoo! Mail - PC Magazine Editors' Choice 2005 
http://mail.yahoo.com

RE: Would Someone Give Me Pointer On How to Index Database?

Posted by Fuad Efendi <fu...@efendi.ca>.
Question: what do you need to index?

Simple answer "I need to index my MySQL" is not enough... MySQL has own
indexes...

Nutch is an Internet Search Engine; Lucene is a framework for indexing and
searching of any text information... Does your database contain huge text
fields, "Documents"? 


-----Original Message-----
From: Sam Lee [mailto:victor_emailbox@yahoo.com] 
Sent: Wednesday, October 26, 2005 3:35 PM
To: java-user@lucene.apache.org
Subject: Would Someone Give Me Pointer On How to Index Database?


Hi,
  I want to use Lucene/Nutch to index my mysql
database.  I think of using JDBC, is it a good idea? 
I searched all over the web, but all the examples are
non-lucene/Nutch related.  Would you guys give me
pointers or websites or examples on how to use JDBC on
Lucene/Nutch to index mysql database?  

Many thanks.


		
__________________________________ 
Start your day with Yahoo! - Make it your home page! 
http://www.yahoo.com/r/hs