You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Ben <ne...@gmail.com> on 2005/03/06 13:20:52 UTC
Alerts for search results
Hi
I would like to allow users on my site be able to create alerts base
on the search keywords for every newly posted article. It would be
good to send out the alert emails hourly, daily or weekly, etc.
Anyone has any experience in this area? What are the best practices
when implement such feature? I would imagine it's going to take a lot
of resources to do a search for each keyword.
Any guidance is greatly appreciated. Thanks!
Ben
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Alerts for search results
Posted by Ben <ne...@gmail.com>.
Thanks for your help, something for me to start with.
PS: Sorry about the double posts
-Ben
On Sun, 06 Mar 2005 16:35:56 +0400, Nader Henein <ns...@bayt.net> wrote:
> Well since you're doing it by keyword, it's a little tricky coz if you
> want to batch like searches with each other there won't be much
> similarity, especially if you allow the user to select Any Word | All
> the Words | Exact phrase, we run 45 000 search agents which dispatch an
> email to users daily to inform them of new jobs added to the site, we
> tor the date of the run and the Lucene search string and then the
> scheduler runs hourly to fetch results/ or result count and send the
> mailer to each person who's email is due. Since we allow for 19 separate
> search criteria batching has not helped us much because the probability
> of enough users having the same search criteria is lower. But you should
> try it though, allow your users to start adding alerts, and do your
> calculations whether batching would help or not.
>
> That was the pull scenario, there is also the push scenario using a
> hashing function, so when a new article is added it's words are compared
> to the words that your users have flagged in their alerts and an email
> is dispatched to interested users, be aware that this only makes sense
> if the total number of distinct words in the document is less than the
> number of distinct keywords your users have flagged, which becomes the
> case when you accumulate a lot of agents.
>
> Here's a quick and dirty calculation:
> Pull Scenario: you will run as many searches as there are distinct alerts
> Push scenario : you will run as many searches as there are distinct
> words per document.
>
> Hope this helps.
>
> Nader Henein
>
>
> Ben wrote:
>
> >Hi
> >
> >I would like to allow users on my site be able to create alerts base
> >on the search keywords for every newly posted article. It would be
> >good to send out the alert emails hourly, daily or weekly, etc.
> >
> >Anyone has any experience in this area? What are the best practices
> >when implement such feature? I would imagine it's going to take a lot
> >of resources to do a search for each keyword.
> >
> >Any guidance is greatly appreciated. Thanks!
> >
> >Ben
> >
> >---------------------------------------------------------------------
> >To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> >For additional commands, e-mail: java-user-help@lucene.apache.org
> >
> >
> >
> >
> >
> >
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Alerts for search results
Posted by Nader Henein <ns...@bayt.net>.
Well since you're doing it by keyword, it's a little tricky coz if you
want to batch like searches with each other there won't be much
similarity, especially if you allow the user to select Any Word | All
the Words | Exact phrase, we run 45 000 search agents which dispatch an
email to users daily to inform them of new jobs added to the site, we
tor the date of the run and the Lucene search string and then the
scheduler runs hourly to fetch results/ or result count and send the
mailer to each person who's email is due. Since we allow for 19 separate
search criteria batching has not helped us much because the probability
of enough users having the same search criteria is lower. But you should
try it though, allow your users to start adding alerts, and do your
calculations whether batching would help or not.
That was the pull scenario, there is also the push scenario using a
hashing function, so when a new article is added it's words are compared
to the words that your users have flagged in their alerts and an email
is dispatched to interested users, be aware that this only makes sense
if the total number of distinct words in the document is less than the
number of distinct keywords your users have flagged, which becomes the
case when you accumulate a lot of agents.
Here's a quick and dirty calculation:
Pull Scenario: you will run as many searches as there are distinct alerts
Push scenario : you will run as many searches as there are distinct
words per document.
Hope this helps.
Nader Henein
Ben wrote:
>Hi
>
>I would like to allow users on my site be able to create alerts base
>on the search keywords for every newly posted article. It would be
>good to send out the alert emails hourly, daily or weekly, etc.
>
>Anyone has any experience in this area? What are the best practices
>when implement such feature? I would imagine it's going to take a lot
>of resources to do a search for each keyword.
>
>Any guidance is greatly appreciated. Thanks!
>
>Ben
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>
>
>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org