You are viewing a plain text version of this content. The canonical link for it is here.
Posted to droids-dev@incubator.apache.org by Javier Puerto <jp...@gmail.com> on 2010/04/30 13:35:38 UTC

Summary of open issues

Release targets:

* BUGFIX
 * DROIDS-74: LinkExtractor doesn't set the anchorText on the link.

* FEATURES
 * DROIDS-52: Optimize memory usage of TaskQueue and History.

* TASK
 * DROIDS-13: Review stream handling in samples.
 * DROIDS-51: review MultiThreadedTaskMaster thread creation/looping.
(Attached patch)
 * DROIDS-52: Optimize memory usage of TaskQueue and History. (Attached
patch)
 * DROIDS-86: Empty files
 * DROIDS-87: Add missing license headers

* CHANGES
 * DROIDS-11: The core parser OutgoingLinks should change. Implement a more
generic TaskExtractor inteface that should return new task. (
http://markmail.org/message/vijohov4narssuv6)
 * DROIDS-35: Provide ability for large content entities to overflow to a
temp file
 * DROIDS-56: Change the TaskQueue interface to java.util.Queue

Discussion:

 * DROIDS-45: Fail to resolve outlink correctly. Mingfai Ma patches. Changes
some base ideas.
 * DROIDS-48: Support prioritizing in the TaskQueue. Mingfai Ma patches.
Changes some base ideas.
 * DROIDS-53: Implement a unique hash function for Task ID.
 * DROIDS-54: Make LinkTask supports arbitrary data by extends HashMap, and
consider to refactor Task, Link, and LinkTask. Mingfai Ma patches. Changes
some base ideas.
 * DROIDS-82: Use camel.apache.org for externals communications

Future releases:

Features:
 * DROIDS-27: add more functions to get basic status monitoring.
 * DROIDS-77: Be able to modify URL rules while crawler is running

Re: Summary of open issues

Posted by Chapuis Bertil <bc...@agimem.com>.
For 51, I may be able to merge what i did in my branch and submit a patch.
The branch also include the Queue issue (56) but without extending an
existing queue. the queue interface is choosen by the user. I'll do my best
to propose something this evening.


On Fri, Apr 30, 2010 at 9:12 PM, Richard Frovarp <rf...@apache.org>wrote:

> Thorsten Scherler wrote:
>
>>
>>
>>  I updated the status to none.
>>
>>
>> So we have 2 issues that needs patches (35 and 56) and one that needs
>> refactoring the attached patch 51.
>>
>> WDOT?
>>
>>
>
> 35 could wait for the next release. I've provided a patch for 56.
>
> We should also go through the release and graduation requirements and try
> to take care as many of those items as possible as well.
>
> Richard
>

Re: Summary of open issues

Posted by Richard Frovarp <rf...@apache.org>.
Thorsten Scherler wrote:
>
>
>  I updated the status to none.
>
>
> So we have 2 issues that needs patches (35 and 56) and one that needs refactoring the attached patch 51.
>
> WDOT?
>   

35 could wait for the next release. I've provided a patch for 56.

We should also go through the release and graduation requirements and 
try to take care as many of those items as possible as well.

Richard

Re: Summary of open issues

Posted by Thorsten Scherler <th...@apache.org>.
On 30/04/2010, at 15:35, Javier Puerto wrote:

> Release targets:
> 

Thanks Javier. I just added a new fix version 0.0.2 to better filter the issues we really want in this release. More comments on your list inline.

> * BUGFIX
>  * DROIDS-74: LinkExtractor doesn't set the anchorText on the link.

fixed

> 
> * FEATURES
>  * DROIDS-52: Optimize memory usage of TaskQueue and History. 
>  

I am tempted to put it in 0.0.2 

> * TASK
>  * DROIDS-13: Review stream handling in samples.

fixed

>  * DROIDS-51: review MultiThreadedTaskMaster thread creation/looping. (Attached patch)

needs review but I think Bertil has done some work here in his branch.

>  * DROIDS-52: Optimize memory usage of TaskQueue and History. (Attached patch)

see above: I am tempted to put it in 0.0.2

>  * DROIDS-86: Empty files

fixed

>  * DROIDS-87: Add missing license headers

fixed

> 
> * CHANGES
>  * DROIDS-11: The core parser OutgoingLinks should change. Implement a more generic TaskExtractor inteface that should return new task. (http://markmail.org/message/vijohov4narssuv6)

fixed

>  * DROIDS-35: Provide ability for large content entities to overflow to a temp file

needs a patch so I am tempted to put it in 0.0.2

>  * DROIDS-56: Change the TaskQueue interface to java.util.Queue

needs a patch but I would like it in 0.0.1 if somebody can provide a patch. otherwise I am tempted to put it in 0.0.2

> 
> Discussion: 
> 
>  * DROIDS-45: Fail to resolve outlink correctly. Mingfai Ma patches. Changes some base ideas.
>  * DROIDS-48: Support prioritizing in the TaskQueue. Mingfai Ma patches. Changes some base ideas.
>  * DROIDS-53: Implement a unique hash function for Task ID.
>  * DROIDS-54: Make LinkTask supports arbitrary data by extends HashMap, and consider to refactor Task, Link, and LinkTask. Mingfai Ma patches. Changes some base ideas.

All the above I do not see in 0.0.1 but as good starting point for a refactored droids

>  * DROIDS-82: Use camel.apache.org for externals communications
> 

That is  0.0.2 or even later IMO

> Future releases:
> 
> Features:
>  * DROIDS-27: add more functions to get basic status monitoring.
>  * DROIDS-77: Be able to modify URL rules while crawler is running
> <summary.txt>

 I updated the status to none.


So we have 2 issues that needs patches (35 and 56) and one that needs refactoring the attached patch 51.

WDOT?

salu2

Thorsten Scherler <thorsten.at.apache.org>
Open Source Java <consulting, training and solutions>