You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucy.apache.org by "Marvin Humphrey (JIRA)" <ji...@apache.org> on 2010/11/09 03:46:22 UTC

[lucy-issues] [jira] Created: (LUCY-125) Bundle Snowball stemming libraries

Bundle Snowball stemming libraries
----------------------------------

                 Key: LUCY-125
                 URL: https://issues.apache.org/jira/browse/LUCY-125
             Project: Lucy
          Issue Type: Improvement
          Components: Analysis
            Reporter: Marvin Humphrey
            Assignee: Marvin Humphrey


The snowball stemming libraries are available under a BSD license.  Rather
than rely on the CPAN module Lingua::Stem::Snowball for access, we should
bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries

Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marvin Humphrey updated LUCY-125:
---------------------------------

    Attachment: snowstem_test_content.patch

The diff file "snowstem_test_content.patch" provides an autogenerated
"tests.json" file derived from the stemmer test vocab in the Snowball svn
repository.

> Bundle Snowball stemming libraries
> ----------------------------------
>
>                 Key: LUCY-125
>                 URL: https://issues.apache.org/jira/browse/LUCY-125
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>         Attachments: snowball_content.patch, snowstem.patch, snowstem.patch, snowstem_test_content.patch
>
>
> The snowball stemming libraries are available under a BSD license.  Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[lucy-issues] [jira] Resolved: (LUCY-125) Bundle Snowball stemming libraries

Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marvin Humphrey resolved LUCY-125.
----------------------------------

    Resolution: Fixed

Committed test_snowstem.patch and snowstem_test_content.patch as r1038445.

> Bundle Snowball stemming libraries
> ----------------------------------
>
>                 Key: LUCY-125
>                 URL: https://issues.apache.org/jira/browse/LUCY-125
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>             Fix For: 0.1-incubating
>
>         Attachments: snowball_content.patch, snowstem.patch, snowstem.patch, snowstem_test_content.patch, test_snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license.  Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries

Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marvin Humphrey updated LUCY-125:
---------------------------------

    Attachment: snowball_content.patch

The attached diff, snowball_content.patch, contains BSD-licensed content from
Snowball.  It was extracted (and one minor patch applied) using the
update_snowstem.pl script, working against revision 541 of the Snowball 
Subversion repository.

I expect to commit this soon, since no one has raised objections to the patch 
from yesterday.

> Bundle Snowball stemming libraries
> ----------------------------------
>
>                 Key: LUCY-125
>                 URL: https://issues.apache.org/jira/browse/LUCY-125
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>         Attachments: snowball_content.patch, snowstem.patch, snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license.  Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries

Posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann updated LUCY-125:
-----------------------------------

    Fix Version/s: 0.1-incubating

- schedule

> Bundle Snowball stemming libraries
> ----------------------------------
>
>                 Key: LUCY-125
>                 URL: https://issues.apache.org/jira/browse/LUCY-125
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>             Fix For: 0.1-incubating
>
>         Attachments: snowball_content.patch, snowstem.patch, snowstem.patch, snowstem_test_content.patch, test_snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license.  Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[lucy-issues] [jira] Resolved: (LUCY-125) Bundle Snowball stemming libraries

Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marvin Humphrey resolved LUCY-125.
----------------------------------

    Resolution: Fixed

Both patches committed as r1033549.

Mailing list thread: [http://s.apache.org/9jW].

> Bundle Snowball stemming libraries
> ----------------------------------
>
>                 Key: LUCY-125
>                 URL: https://issues.apache.org/jira/browse/LUCY-125
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>         Attachments: snowball_content.patch, snowstem.patch, snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license.  Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries

Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marvin Humphrey updated LUCY-125:
---------------------------------

    Attachment: test_snowstem.patch

The attached "test_snowstem.patch" file provides modifications to
update_snowstem.pl which generate the "tests.json" file, as well as mods to
TestStemmer.c to process the tests.  Only a small sampling of the Snowball
test vocab is used (10 words per language).

> Bundle Snowball stemming libraries
> ----------------------------------
>
>                 Key: LUCY-125
>                 URL: https://issues.apache.org/jira/browse/LUCY-125
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>         Attachments: snowball_content.patch, snowstem.patch, snowstem.patch, snowstem_test_content.patch, test_snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license.  Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries

Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marvin Humphrey updated LUCY-125:
---------------------------------

    Attachment: snowstem.patch

The patch file "snowstem.patch" contains all the adaptations to the Lucy
library itself, but does not add any Snowball source code. 

A perl script, update_snowstem.pl, is used to extract only the needed files
from the libstemmer_c download from snowball.tartarus.org and apply one minor
patch (an include guard on the libstemmer.h header).


> Bundle Snowball stemming libraries
> ----------------------------------
>
>                 Key: LUCY-125
>                 URL: https://issues.apache.org/jira/browse/LUCY-125
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>         Attachments: snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license.  Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[lucy-issues] [jira] Reopened: (LUCY-125) Bundle Snowball stemming libraries

Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marvin Humphrey reopened LUCY-125:
----------------------------------


Reopening this issue to add tests.

> Bundle Snowball stemming libraries
> ----------------------------------
>
>                 Key: LUCY-125
>                 URL: https://issues.apache.org/jira/browse/LUCY-125
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>         Attachments: snowball_content.patch, snowstem.patch, snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license.  Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries

Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marvin Humphrey updated LUCY-125:
---------------------------------

    Attachment: snowstem.patch

Here's a new version of snowstem.patch, which changes the update_snowstem.pl
script to work off of a checkout of the Snowball svn repository, rather than
the unversioned libstemmer_c.tgz.

> Bundle Snowball stemming libraries
> ----------------------------------
>
>                 Key: LUCY-125
>                 URL: https://issues.apache.org/jira/browse/LUCY-125
>             Project: Lucy
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>         Attachments: snowstem.patch, snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license.  Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.