You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucy.apache.org by "Marvin Humphrey (JIRA)" <ji...@apache.org> on 2010/11/09 03:46:22 UTC
[lucy-issues] [jira] Created: (LUCY-125) Bundle Snowball stemming libraries
Bundle Snowball stemming libraries
----------------------------------
Key: LUCY-125
URL: https://issues.apache.org/jira/browse/LUCY-125
Project: Lucy
Issue Type: Improvement
Components: Analysis
Reporter: Marvin Humphrey
Assignee: Marvin Humphrey
The snowball stemming libraries are available under a BSD license. Rather
than rely on the CPAN module Lingua::Stem::Snowball for access, we should
bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries
Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marvin Humphrey updated LUCY-125:
---------------------------------
Attachment: snowstem_test_content.patch
The diff file "snowstem_test_content.patch" provides an autogenerated
"tests.json" file derived from the stemmer test vocab in the Snowball svn
repository.
> Bundle Snowball stemming libraries
> ----------------------------------
>
> Key: LUCY-125
> URL: https://issues.apache.org/jira/browse/LUCY-125
> Project: Lucy
> Issue Type: Improvement
> Components: Analysis
> Reporter: Marvin Humphrey
> Assignee: Marvin Humphrey
> Attachments: snowball_content.patch, snowstem.patch, snowstem.patch, snowstem_test_content.patch
>
>
> The snowball stemming libraries are available under a BSD license. Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[lucy-issues] [jira] Resolved: (LUCY-125) Bundle Snowball stemming libraries
Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marvin Humphrey resolved LUCY-125.
----------------------------------
Resolution: Fixed
Committed test_snowstem.patch and snowstem_test_content.patch as r1038445.
> Bundle Snowball stemming libraries
> ----------------------------------
>
> Key: LUCY-125
> URL: https://issues.apache.org/jira/browse/LUCY-125
> Project: Lucy
> Issue Type: Improvement
> Components: Analysis
> Reporter: Marvin Humphrey
> Assignee: Marvin Humphrey
> Fix For: 0.1-incubating
>
> Attachments: snowball_content.patch, snowstem.patch, snowstem.patch, snowstem_test_content.patch, test_snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license. Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries
Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marvin Humphrey updated LUCY-125:
---------------------------------
Attachment: snowball_content.patch
The attached diff, snowball_content.patch, contains BSD-licensed content from
Snowball. It was extracted (and one minor patch applied) using the
update_snowstem.pl script, working against revision 541 of the Snowball
Subversion repository.
I expect to commit this soon, since no one has raised objections to the patch
from yesterday.
> Bundle Snowball stemming libraries
> ----------------------------------
>
> Key: LUCY-125
> URL: https://issues.apache.org/jira/browse/LUCY-125
> Project: Lucy
> Issue Type: Improvement
> Components: Analysis
> Reporter: Marvin Humphrey
> Assignee: Marvin Humphrey
> Attachments: snowball_content.patch, snowstem.patch, snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license. Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries
Posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chris A. Mattmann updated LUCY-125:
-----------------------------------
Fix Version/s: 0.1-incubating
- schedule
> Bundle Snowball stemming libraries
> ----------------------------------
>
> Key: LUCY-125
> URL: https://issues.apache.org/jira/browse/LUCY-125
> Project: Lucy
> Issue Type: Improvement
> Components: Analysis
> Reporter: Marvin Humphrey
> Assignee: Marvin Humphrey
> Fix For: 0.1-incubating
>
> Attachments: snowball_content.patch, snowstem.patch, snowstem.patch, snowstem_test_content.patch, test_snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license. Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[lucy-issues] [jira] Resolved: (LUCY-125) Bundle Snowball stemming libraries
Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marvin Humphrey resolved LUCY-125.
----------------------------------
Resolution: Fixed
Both patches committed as r1033549.
Mailing list thread: [http://s.apache.org/9jW].
> Bundle Snowball stemming libraries
> ----------------------------------
>
> Key: LUCY-125
> URL: https://issues.apache.org/jira/browse/LUCY-125
> Project: Lucy
> Issue Type: Improvement
> Components: Analysis
> Reporter: Marvin Humphrey
> Assignee: Marvin Humphrey
> Attachments: snowball_content.patch, snowstem.patch, snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license. Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries
Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marvin Humphrey updated LUCY-125:
---------------------------------
Attachment: test_snowstem.patch
The attached "test_snowstem.patch" file provides modifications to
update_snowstem.pl which generate the "tests.json" file, as well as mods to
TestStemmer.c to process the tests. Only a small sampling of the Snowball
test vocab is used (10 words per language).
> Bundle Snowball stemming libraries
> ----------------------------------
>
> Key: LUCY-125
> URL: https://issues.apache.org/jira/browse/LUCY-125
> Project: Lucy
> Issue Type: Improvement
> Components: Analysis
> Reporter: Marvin Humphrey
> Assignee: Marvin Humphrey
> Attachments: snowball_content.patch, snowstem.patch, snowstem.patch, snowstem_test_content.patch, test_snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license. Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries
Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marvin Humphrey updated LUCY-125:
---------------------------------
Attachment: snowstem.patch
The patch file "snowstem.patch" contains all the adaptations to the Lucy
library itself, but does not add any Snowball source code.
A perl script, update_snowstem.pl, is used to extract only the needed files
from the libstemmer_c download from snowball.tartarus.org and apply one minor
patch (an include guard on the libstemmer.h header).
> Bundle Snowball stemming libraries
> ----------------------------------
>
> Key: LUCY-125
> URL: https://issues.apache.org/jira/browse/LUCY-125
> Project: Lucy
> Issue Type: Improvement
> Components: Analysis
> Reporter: Marvin Humphrey
> Assignee: Marvin Humphrey
> Attachments: snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license. Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[lucy-issues] [jira] Reopened: (LUCY-125) Bundle Snowball stemming libraries
Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marvin Humphrey reopened LUCY-125:
----------------------------------
Reopening this issue to add tests.
> Bundle Snowball stemming libraries
> ----------------------------------
>
> Key: LUCY-125
> URL: https://issues.apache.org/jira/browse/LUCY-125
> Project: Lucy
> Issue Type: Improvement
> Components: Analysis
> Reporter: Marvin Humphrey
> Assignee: Marvin Humphrey
> Attachments: snowball_content.patch, snowstem.patch, snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license. Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[lucy-issues] [jira] Updated: (LUCY-125) Bundle Snowball stemming libraries
Posted by "Marvin Humphrey (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCY-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Marvin Humphrey updated LUCY-125:
---------------------------------
Attachment: snowstem.patch
Here's a new version of snowstem.patch, which changes the update_snowstem.pl
script to work off of a checkout of the Snowball svn repository, rather than
the unversioned libstemmer_c.tgz.
> Bundle Snowball stemming libraries
> ----------------------------------
>
> Key: LUCY-125
> URL: https://issues.apache.org/jira/browse/LUCY-125
> Project: Lucy
> Issue Type: Improvement
> Components: Analysis
> Reporter: Marvin Humphrey
> Assignee: Marvin Humphrey
> Attachments: snowstem.patch, snowstem.patch
>
>
> The snowball stemming libraries are available under a BSD license. Rather
> than rely on the CPAN module Lingua::Stem::Snowball for access, we should
> bundle them with Lucy itself.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.