You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by pa...@wipro.com on 2013/04/12 08:19:14 UTC

Create custom cleanup job for deleting index older than given time

Hi,

We have a requirement to create a custom job for Solr Output connector which will delete the index information older than a given time(say 1year). Additionally, I need to send an email about the cleanup indexes for documents, to the Admin group. If anyone has come across similar use case or have any idea on how this can be achieved, then please assist?


Thanks!

Regards,
Pankaj

Please do not print this email unless it is absolutely necessary. 

The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments. 

WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email. 

www.wipro.com

Re: Create custom cleanup job for deleting index older than given time

Posted by Karl Wright <da...@gmail.com>.
Expiration of documents goes through the output connection, just like
document deletion.

Karl



On Fri, Apr 12, 2013 at 7:28 AM, <pa...@wipro.com> wrote:

> Hi Karl,
>
> That was really helpful, it's exactly what I have been looking for. Also,
> can you please let me know where I can find the logic which checks and runs
> the expiration over index. I would like to customize that to generate the
> report of documents being deleted from current run and send it to admin.
>
> Thanks!
>
> Regards,
> Pankaj
>
> -----Original Message-----
> From: Karl Wright [mailto:daddywri@gmail.com]
> Sent: Friday, April 12, 2013 1:14 PM
> To: dev
> Subject: Re: Create custom cleanup job for deleting index older than given
> time
>
> For jobs that run continuously, you can use the expiration functionality
> to remove things from the index.  But if the documents still exist in the
> repository, they will be rediscovered.  It sounds to me, then, like you
> want to remove things from the repository, which is clearly beyond the
> scope of ManifoldCF.
>
>
>
> On Fri, Apr 12, 2013 at 2:19 AM, <pa...@wipro.com> wrote:
>
> > Hi,
> >
> > We have a requirement to create a custom job for Solr Output connector
> > which will delete the index information older than a given time(say
> 1year).
> > Additionally, I need to send an email about the cleanup indexes for
> > documents, to the Admin group. If anyone has come across similar use
> > case or have any idea on how this can be achieved, then please assist?
> >
> >
> > Thanks!
> >
> > Regards,
> > Pankaj
> >
> > Please do not print this email unless it is absolutely necessary.
> >
> > The information contained in this electronic message and any
> > attachments to this message are intended for the exclusive use of the
> > addressee(s) and may contain proprietary, confidential or privileged
> > information. If you are not the intended recipient, you should not
> > disseminate, distribute or copy this e-mail. Please notify the sender
> > immediately and destroy all copies of this message and any attachments.
> >
> > WARNING: Computer viruses can be transmitted via email. The recipient
> > should check this email and any attachments for the presence of viruses.
> > The company accepts no liability for any damage caused by any virus
> > transmitted by this email.
> >
> > www.wipro.com
> >
>
> Please do not print this email unless it is absolutely necessary.
>
> The information contained in this electronic message and any attachments
> to this message are intended for the exclusive use of the addressee(s) and
> may contain proprietary, confidential or privileged information. If you are
> not the intended recipient, you should not disseminate, distribute or copy
> this e-mail. Please notify the sender immediately and destroy all copies of
> this message and any attachments.
>
> WARNING: Computer viruses can be transmitted via email. The recipient
> should check this email and any attachments for the presence of viruses.
> The company accepts no liability for any damage caused by any virus
> transmitted by this email.
>
> www.wipro.com
>

RE: Create custom cleanup job for deleting index older than given time

Posted by pa...@wipro.com.
Hi Karl,

That was really helpful, it's exactly what I have been looking for. Also, can you please let me know where I can find the logic which checks and runs the expiration over index. I would like to customize that to generate the report of documents being deleted from current run and send it to admin.

Thanks!

Regards,
Pankaj

-----Original Message-----
From: Karl Wright [mailto:daddywri@gmail.com] 
Sent: Friday, April 12, 2013 1:14 PM
To: dev
Subject: Re: Create custom cleanup job for deleting index older than given time

For jobs that run continuously, you can use the expiration functionality to remove things from the index.  But if the documents still exist in the repository, they will be rediscovered.  It sounds to me, then, like you want to remove things from the repository, which is clearly beyond the scope of ManifoldCF.



On Fri, Apr 12, 2013 at 2:19 AM, <pa...@wipro.com> wrote:

> Hi,
>
> We have a requirement to create a custom job for Solr Output connector 
> which will delete the index information older than a given time(say 1year).
> Additionally, I need to send an email about the cleanup indexes for 
> documents, to the Admin group. If anyone has come across similar use 
> case or have any idea on how this can be achieved, then please assist?
>
>
> Thanks!
>
> Regards,
> Pankaj
>
> Please do not print this email unless it is absolutely necessary.
>
> The information contained in this electronic message and any 
> attachments to this message are intended for the exclusive use of the 
> addressee(s) and may contain proprietary, confidential or privileged 
> information. If you are not the intended recipient, you should not 
> disseminate, distribute or copy this e-mail. Please notify the sender 
> immediately and destroy all copies of this message and any attachments.
>
> WARNING: Computer viruses can be transmitted via email. The recipient 
> should check this email and any attachments for the presence of viruses.
> The company accepts no liability for any damage caused by any virus 
> transmitted by this email.
>
> www.wipro.com
>

Please do not print this email unless it is absolutely necessary. 

The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments. 

WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email. 

www.wipro.com

Re: Create custom cleanup job for deleting index older than given time

Posted by Karl Wright <da...@gmail.com>.
For jobs that run continuously, you can use the expiration functionality to
remove things from the index.  But if the documents still exist in the
repository, they will be rediscovered.  It sounds to me, then, like you
want to remove things from the repository, which is clearly beyond the
scope of ManifoldCF.



On Fri, Apr 12, 2013 at 2:19 AM, <pa...@wipro.com> wrote:

> Hi,
>
> We have a requirement to create a custom job for Solr Output connector
> which will delete the index information older than a given time(say 1year).
> Additionally, I need to send an email about the cleanup indexes for
> documents, to the Admin group. If anyone has come across similar use case
> or have any idea on how this can be achieved, then please assist?
>
>
> Thanks!
>
> Regards,
> Pankaj
>
> Please do not print this email unless it is absolutely necessary.
>
> The information contained in this electronic message and any attachments
> to this message are intended for the exclusive use of the addressee(s) and
> may contain proprietary, confidential or privileged information. If you are
> not the intended recipient, you should not disseminate, distribute or copy
> this e-mail. Please notify the sender immediately and destroy all copies of
> this message and any attachments.
>
> WARNING: Computer viruses can be transmitted via email. The recipient
> should check this email and any attachments for the presence of viruses.
> The company accepts no liability for any damage caused by any virus
> transmitted by this email.
>
> www.wipro.com
>