You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@uima.apache.org by "Marshall Schor (JIRA)" <ui...@incubator.apache.org> on 2008/01/27 12:30:34 UTC

[jira] Created: (UIMA-722) Fix parsing of language specifications to normalize them

Fix parsing of language specifications to normalize them
--------------------------------------------------------

                 Key: UIMA-722
                 URL: https://issues.apache.org/jira/browse/UIMA-722
             Project: UIMA
          Issue Type: Bug
          Components: Core Java Framework
    Affects Versions: 2.2.1
            Reporter: Marshall Schor
            Assignee: Adam Lally
            Priority: Minor


Language specifications in capabilities are compared frequently in the operation of the UIMA Framework.  In some places, the language strings are "normalized" by coverting to lower case and replacing '_' with '-'.  In other places this isn't done.  This should be done uniformly, when the language specification is initially parsed, for performance reasons.  A suggested normalization would be something like
         language.toLowerCase(Locale.ENGLISH).replace('_','-')


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (UIMA-722) Fix parsing of language specifications to normalize them

Posted by "Marshall Schor (JIRA)" <ui...@incubator.apache.org>.

    [ https://issues.apache.org/jira/browse/UIMA-722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12562988#action_12562988 ] 

Marshall Schor commented on UIMA-722:
-------------------------------------

This will be fixed by fixing the setter for the languages in the Capability_impl.  This will fix it for the parsers as well as for programmatic (API) calls that change this.  Reassigned to Schor

> Fix parsing of language specifications to normalize them
> --------------------------------------------------------
>
>                 Key: UIMA-722
>                 URL: https://issues.apache.org/jira/browse/UIMA-722
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.2.1
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>
> Language specifications in capabilities are compared frequently in the operation of the UIMA Framework.  In some places, the language strings are "normalized" by coverting to lower case and replacing '_' with '-'.  In other places this isn't done.  This should be done uniformly, when the language specification is initially parsed, for performance reasons.  A suggested normalization would be something like
>          language.toLowerCase(Locale.ENGLISH).replace('_','-')

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (UIMA-722) Fix parsing of language specifications to normalize them

Posted by "Marshall Schor (JIRA)" <ui...@incubator.apache.org>.

     [ https://issues.apache.org/jira/browse/UIMA-722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor resolved UIMA-722.
---------------------------------

       Resolution: Fixed
    Fix Version/s: 2.3

> Fix parsing of language specifications to normalize them
> --------------------------------------------------------
>
>                 Key: UIMA-722
>                 URL: https://issues.apache.org/jira/browse/UIMA-722
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.2.1
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3
>
>
> Language specifications in capabilities are compared frequently in the operation of the UIMA Framework.  In some places, the language strings are "normalized" by coverting to lower case and replacing '_' with '-'.  In other places this isn't done.  This should be done uniformly, when the language specification is initially parsed, for performance reasons.  A suggested normalization would be something like
>          language.toLowerCase(Locale.ENGLISH).replace('_','-')

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Assigned: (UIMA-722) Fix parsing of language specifications to normalize them

Posted by "Marshall Schor (JIRA)" <ui...@incubator.apache.org>.

     [ https://issues.apache.org/jira/browse/UIMA-722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor reassigned UIMA-722:
-----------------------------------

    Assignee: Marshall Schor  (was: Adam Lally)

> Fix parsing of language specifications to normalize them
> --------------------------------------------------------
>
>                 Key: UIMA-722
>                 URL: https://issues.apache.org/jira/browse/UIMA-722
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.2.1
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>
> Language specifications in capabilities are compared frequently in the operation of the UIMA Framework.  In some places, the language strings are "normalized" by coverting to lower case and replacing '_' with '-'.  In other places this isn't done.  This should be done uniformly, when the language specification is initially parsed, for performance reasons.  A suggested normalization would be something like
>          language.toLowerCase(Locale.ENGLISH).replace('_','-')

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Closed: (UIMA-722) Fix parsing of language specifications to normalize them

Posted by "Marshall Schor (JIRA)" <ui...@incubator.apache.org>.

     [ https://issues.apache.org/jira/browse/UIMA-722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor closed UIMA-722.
-------------------------------


created new Jira http://issues.apache.org/jira/browse/UIMA-962 for remaining cleanup

> Fix parsing of language specifications to normalize them
> --------------------------------------------------------
>
>                 Key: UIMA-722
>                 URL: https://issues.apache.org/jira/browse/UIMA-722
>             Project: UIMA
>          Issue Type: Bug
>          Components: Core Java Framework
>    Affects Versions: 2.2.1
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.2.2
>
>
> Language specifications in capabilities are compared frequently in the operation of the UIMA Framework.  In some places, the language strings are "normalized" by coverting to lower case and replacing '_' with '-'.  In other places this isn't done.  This should be done uniformly, when the language specification is initially parsed, for performance reasons.  A suggested normalization would be something like
>          language.toLowerCase(Locale.ENGLISH).replace('_','-')

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.