You are viewing a plain text version of this content. The canonical link for it is here.
Posted to droids-dev@incubator.apache.org by "Thorsten Scherler (JIRA)" <ji...@apache.org> on 2010/04/30 18:22:53 UTC

[jira] Updated: (DROIDS-77) Be able to modify URL rules while crawler is running

     [ https://issues.apache.org/jira/browse/DROIDS-77?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thorsten Scherler updated DROIDS-77:
------------------------------------

    Affects Version/s: 0.0.2
                           (was: 0.0.1)

> Be able to modify URL rules while crawler is running
> ----------------------------------------------------
>
>                 Key: DROIDS-77
>                 URL: https://issues.apache.org/jira/browse/DROIDS-77
>             Project: Droids
>          Issue Type: New Feature
>          Components: core
>            Reporter: Richard Frovarp
>            Priority: Minor
>
> It would be nice to be able to modify the URL rules while a crawler is running. This would allow me to dynamically exclude areas from being crawled based on results being returned. Basically I want to look for certain markers inside a page, then not crawl those pages without having update a robots file. Different paths of our site is going to enter into the index from a different method than the main crawl, so I can skip them once I find them. 
> Having a modifiable filter would allow people to load their rules from places other than a file without having to write their own implementation or extension. I'll try to work up a patch sometime this week.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.