You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2011/03/27 19:09:05 UTC

[jira] [Created] (NUTCH-973) Remove Segment Merger in 1.3

Remove Segment Merger in 1.3
----------------------------

                 Key: NUTCH-973
                 URL: https://issues.apache.org/jira/browse/NUTCH-973
             Project: Nutch
          Issue Type: Task
            Reporter: Julien Nioche
            Priority: Minor
             Fix For: 1.3


The code for the segment merging is still in 1.3, as far as I understand its original function it was mostly useful for having a single data structure where the search app could get the cached data from. Now that we've delegated the indexing and search to SOLR we don't really need to worry about the cache anymore. Would it make sense to purge it or do you guys think it would still be useful? 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (NUTCH-973) Remove Segment Merger in 1.3

Posted by "Markus Jelsma (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-973?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13014566#comment-13014566 ] 

Markus Jelsma commented on NUTCH-973:
-------------------------------------

I'm not sure we should. In 1.x fetches still generate loose segments which i just might want to merge for simplicity. For example, i have a test setup where multiple segments are generated each day, then they get merged so i have a single segment per day. It's a bit easier to maintain and reindex fewer segments. I do delete segments older than the fetch interval but in my case that's just deleting one segment a day. This is in a local environment.

> Remove Segment Merger in 1.3
> ----------------------------
>
>                 Key: NUTCH-973
>                 URL: https://issues.apache.org/jira/browse/NUTCH-973
>             Project: Nutch
>          Issue Type: Task
>            Reporter: Julien Nioche
>            Priority: Minor
>             Fix For: 1.3
>
>
> The code for the segment merging is still in 1.3, as far as I understand its original function it was mostly useful for having a single data structure where the search app could get the cached data from. Now that we've delegated the indexing and search to SOLR we don't really need to worry about the cache anymore. Would it make sense to purge it or do you guys think it would still be useful? 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Closed] (NUTCH-973) Remove Segment Merger in 1.3

Posted by "Julien Nioche (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Nioche closed NUTCH-973.
-------------------------------

    Resolution: Not A Problem

You are right, let's leave it for now. It won't be a problem once we're on 2.0 anyway

> Remove Segment Merger in 1.3
> ----------------------------
>
>                 Key: NUTCH-973
>                 URL: https://issues.apache.org/jira/browse/NUTCH-973
>             Project: Nutch
>          Issue Type: Task
>            Reporter: Julien Nioche
>            Priority: Minor
>             Fix For: 1.3
>
>
> The code for the segment merging is still in 1.3, as far as I understand its original function it was mostly useful for having a single data structure where the search app could get the cached data from. Now that we've delegated the indexing and search to SOLR we don't really need to worry about the cache anymore. Would it make sense to purge it or do you guys think it would still be useful? 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira