You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Karl Wright (JIRA)" <ji...@apache.org> on 2014/12/15 13:22:13 UTC
[jira] [Created] (CONNECTORS-1122) Explore ways to make job start
be faster in systems with lots of documents
Karl Wright created CONNECTORS-1122:
---------------------------------------
Summary: Explore ways to make job start be faster in systems with lots of documents
Key: CONNECTORS-1122
URL: https://issues.apache.org/jira/browse/CONNECTORS-1122
Project: ManifoldCF
Issue Type: Improvement
Components: Framework crawler agent
Affects Versions: ManifoldCF 1.8, ManifoldCF 2.0
Reporter: Karl Wright
Assignee: Karl Wright
Fix For: ManifoldCF 1.9, ManifoldCF 2.1
Job start requires all documents to be marked as needing reprioritization now. We should consider ways in which we can reduce the need to do this as much as possible. For example, if there are NO documents at all for a job, reprioritization is by definition unneeded. Alternatively, coming up with a way of determining if there are any bin-level overlaps between documents made active by a job start at documents elsewhere, we could be more targeted.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)