You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Karl Wright (JIRA)" <ji...@apache.org> on 2010/09/08 15:12:37 UTC
[jira] Resolved: (CONNECTORS-104) Make it easier to limit a web
crawl to a single site
[ https://issues.apache.org/jira/browse/CONNECTORS-104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Karl Wright resolved CONNECTORS-104.
------------------------------------
Assignee: Karl Wright
Fix Version/s: LCF Release 0.5
Resolution: Fixed
r995042.
> Make it easier to limit a web crawl to a single site
> ----------------------------------------------------
>
> Key: CONNECTORS-104
> URL: https://issues.apache.org/jira/browse/CONNECTORS-104
> Project: Apache Connectors Framework
> Issue Type: Improvement
> Components: Web connector
> Reporter: Jack Krupansky
> Assignee: Karl Wright
> Priority: Minor
> Fix For: LCF Release 0.5
>
>
> Unless the user explicitly enters an include regex carefully, a web crawl can quickly get out of control and start crawling the entire web when all the user may really want is to crawl just a single web site or portion thereof. So, it would be preferable if either by default or with a simple button the crawl could be limited to the seed web site(s).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.