You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Doug Cutting (JIRA)" <ji...@apache.org> on 2005/05/02 19:33:06 UTC

[jira] Commented: (NUTCH-54) Fetcher improvements

     [ http://issues.apache.org/jira/browse/NUTCH-54?page=comments#action_64270 ]
     
Doug Cutting commented on NUTCH-54:
-----------------------------------

Currently a Parse is decomposed into ParseText and ParseData so that these may be stored separately on disk so that, when constructing summaries, the text may be accessed independently.  The full Parse is reconstructed when indexing and passed to indexing filters.

Should ParseStatus be available to indexing filters?  If so, then it must be stored.  The simplest way to do this would be to make ParseStatus a field of ParseData rather than Parse itself.  Even if not, the indexing code must still be modified to be able to construct a Parse without a ParseStatus.

Also, a minor note, I would name the method Parser.getStatus(), not parser.getPArserStatus().

> Fetcher  improvements
> ---------------------
>
>          Key: NUTCH-54
>          URL: http://issues.apache.org/jira/browse/NUTCH-54
>      Project: Nutch
>         Type: Improvement
>   Components: fetcher
>     Reporter: Andrzej Bialecki 
>     Assignee: Andrzej Bialecki 
>  Attachments: parsestatus.patch
>
> Fetcher improvements.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira