You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Karl Wright (JIRA)" <ji...@apache.org> on 2010/09/08 15:12:37 UTC

[jira] Resolved: (CONNECTORS-104) Make it easier to limit a web crawl to a single site

     [ https://issues.apache.org/jira/browse/CONNECTORS-104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Karl Wright resolved CONNECTORS-104.
------------------------------------

         Assignee: Karl Wright
    Fix Version/s: LCF Release 0.5
       Resolution: Fixed

r995042.


> Make it easier to limit a web crawl to a single site
> ----------------------------------------------------
>
>                 Key: CONNECTORS-104
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-104
>             Project: Apache Connectors Framework
>          Issue Type: Improvement
>          Components: Web connector
>            Reporter: Jack Krupansky
>            Assignee: Karl Wright
>            Priority: Minor
>             Fix For: LCF Release 0.5
>
>
> Unless the user explicitly enters an include regex carefully, a web crawl can quickly get out of control and start crawling the entire web when all the user may really want is to crawl just a single web site or portion thereof. So, it would be preferable if either by default or with a simple button the crawl could be limited to the seed web site(s).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.