You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2011/07/16 15:22:00 UTC

[jira] [Commented] (NUTCH-657) Estonian N-gram profile has wrong name

    [ https://issues.apache.org/jira/browse/NUTCH-657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13066426#comment-13066426 ] 

Lewis John McGibbney commented on NUTCH-657:
--------------------------------------------

I have been unsuccessful in submitting a patch for a file name change as oppose to content changes within the file... any pointers please? I am not familiar with submitting patches for file name changes.

Yes Markus, non of these files exist within trunk... strange. From doing some background reading into the classes I can see that two authors are Sami Siren and Jerome Charron. Is there anyone on board that has experience working with the language identifier code? This is really the first time I have looked over it...

> Estonian N-gram profile has wrong name
> --------------------------------------
>
>                 Key: NUTCH-657
>                 URL: https://issues.apache.org/jira/browse/NUTCH-657
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 0.8.1, 0.9.0
>            Reporter: Jonathan Young
>            Priority: Trivial
>
> The Nutch language identifier plugin contains an ngram profile, ee.ngp, in src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang .  "ee" is the ISO-3166-1-alpha-2 code for Estonia (see http://www.iso.org/iso/country_codes/iso_3166_code_lists/english_country_names_and_code_elements.htm), but it is the ISO-639-2 code for Ewe (see 
> http://www.loc.gov/standards/iso639-2/php/English_list.php).  "et" is the ISO-639-2 code for Estonian, and the language profile in ee.ngp is clearly Estonian.
> Proposed solution: rename ee.ngp to et.ngp .

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira