You are viewing a plain text version of this content. The canonical link for it is here.
Posted to droids-dev@incubator.apache.org by Javier Puerto <jp...@gmail.com> on 2010/04/30 13:35:38 UTC
Summary of open issues
Release targets:
* BUGFIX
* DROIDS-74: LinkExtractor doesn't set the anchorText on the link.
* FEATURES
* DROIDS-52: Optimize memory usage of TaskQueue and History.
* TASK
* DROIDS-13: Review stream handling in samples.
* DROIDS-51: review MultiThreadedTaskMaster thread creation/looping.
(Attached patch)
* DROIDS-52: Optimize memory usage of TaskQueue and History. (Attached
patch)
* DROIDS-86: Empty files
* DROIDS-87: Add missing license headers
* CHANGES
* DROIDS-11: The core parser OutgoingLinks should change. Implement a more
generic TaskExtractor inteface that should return new task. (
http://markmail.org/message/vijohov4narssuv6)
* DROIDS-35: Provide ability for large content entities to overflow to a
temp file
* DROIDS-56: Change the TaskQueue interface to java.util.Queue
Discussion:
* DROIDS-45: Fail to resolve outlink correctly. Mingfai Ma patches. Changes
some base ideas.
* DROIDS-48: Support prioritizing in the TaskQueue. Mingfai Ma patches.
Changes some base ideas.
* DROIDS-53: Implement a unique hash function for Task ID.
* DROIDS-54: Make LinkTask supports arbitrary data by extends HashMap, and
consider to refactor Task, Link, and LinkTask. Mingfai Ma patches. Changes
some base ideas.
* DROIDS-82: Use camel.apache.org for externals communications
Future releases:
Features:
* DROIDS-27: add more functions to get basic status monitoring.
* DROIDS-77: Be able to modify URL rules while crawler is running
Re: Summary of open issues
Posted by Chapuis Bertil <bc...@agimem.com>.
For 51, I may be able to merge what i did in my branch and submit a patch.
The branch also include the Queue issue (56) but without extending an
existing queue. the queue interface is choosen by the user. I'll do my best
to propose something this evening.
On Fri, Apr 30, 2010 at 9:12 PM, Richard Frovarp <rf...@apache.org>wrote:
> Thorsten Scherler wrote:
>
>>
>>
>> I updated the status to none.
>>
>>
>> So we have 2 issues that needs patches (35 and 56) and one that needs
>> refactoring the attached patch 51.
>>
>> WDOT?
>>
>>
>
> 35 could wait for the next release. I've provided a patch for 56.
>
> We should also go through the release and graduation requirements and try
> to take care as many of those items as possible as well.
>
> Richard
>
Re: Summary of open issues
Posted by Richard Frovarp <rf...@apache.org>.
Thorsten Scherler wrote:
>
>
> I updated the status to none.
>
>
> So we have 2 issues that needs patches (35 and 56) and one that needs refactoring the attached patch 51.
>
> WDOT?
>
35 could wait for the next release. I've provided a patch for 56.
We should also go through the release and graduation requirements and
try to take care as many of those items as possible as well.
Richard
Re: Summary of open issues
Posted by Thorsten Scherler <th...@apache.org>.
On 30/04/2010, at 15:35, Javier Puerto wrote:
> Release targets:
>
Thanks Javier. I just added a new fix version 0.0.2 to better filter the issues we really want in this release. More comments on your list inline.
> * BUGFIX
> * DROIDS-74: LinkExtractor doesn't set the anchorText on the link.
fixed
>
> * FEATURES
> * DROIDS-52: Optimize memory usage of TaskQueue and History.
>
I am tempted to put it in 0.0.2
> * TASK
> * DROIDS-13: Review stream handling in samples.
fixed
> * DROIDS-51: review MultiThreadedTaskMaster thread creation/looping. (Attached patch)
needs review but I think Bertil has done some work here in his branch.
> * DROIDS-52: Optimize memory usage of TaskQueue and History. (Attached patch)
see above: I am tempted to put it in 0.0.2
> * DROIDS-86: Empty files
fixed
> * DROIDS-87: Add missing license headers
fixed
>
> * CHANGES
> * DROIDS-11: The core parser OutgoingLinks should change. Implement a more generic TaskExtractor inteface that should return new task. (http://markmail.org/message/vijohov4narssuv6)
fixed
> * DROIDS-35: Provide ability for large content entities to overflow to a temp file
needs a patch so I am tempted to put it in 0.0.2
> * DROIDS-56: Change the TaskQueue interface to java.util.Queue
needs a patch but I would like it in 0.0.1 if somebody can provide a patch. otherwise I am tempted to put it in 0.0.2
>
> Discussion:
>
> * DROIDS-45: Fail to resolve outlink correctly. Mingfai Ma patches. Changes some base ideas.
> * DROIDS-48: Support prioritizing in the TaskQueue. Mingfai Ma patches. Changes some base ideas.
> * DROIDS-53: Implement a unique hash function for Task ID.
> * DROIDS-54: Make LinkTask supports arbitrary data by extends HashMap, and consider to refactor Task, Link, and LinkTask. Mingfai Ma patches. Changes some base ideas.
All the above I do not see in 0.0.1 but as good starting point for a refactored droids
> * DROIDS-82: Use camel.apache.org for externals communications
>
That is 0.0.2 or even later IMO
> Future releases:
>
> Features:
> * DROIDS-27: add more functions to get basic status monitoring.
> * DROIDS-77: Be able to modify URL rules while crawler is running
> <summary.txt>
I updated the status to none.
So we have 2 issues that needs patches (35 and 56) and one that needs refactoring the attached patch 51.
WDOT?
salu2
Thorsten Scherler <thorsten.at.apache.org>
Open Source Java <consulting, training and solutions>