You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Peter Klügl (JIRA)" <de...@uima.apache.org> on 2015/07/01 10:50:04 UTC

[jira] [Commented] (UIMA-4453) MARKTABLE action works improperly

    [ https://issues.apache.org/jira/browse/UIMA-4453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14609760#comment-14609760 ] 

Peter Klügl commented on UIMA-4453:
-----------------------------------

I'm glad to hear that. I don't know what went wrong, but I created a new issue in order to investigate that.

Btw, I personally work always with the latest sources, also in the UIMA Ruta Workbench. I do not create new update sites, but start a complete Eclipse (with all plugin) from within Eclispe (workspace contains the trunk) by "run as Eclipse Application". This way, I have the current functionality of the trunk and hot code replacement.

> MARKTABLE action works improperly
> ---------------------------------
>
>                 Key: UIMA-4453
>                 URL: https://issues.apache.org/jira/browse/UIMA-4453
>             Project: UIMA
>          Issue Type: Bug
>          Components: ruta
>    Affects Versions: 2.3.0ruta
>         Environment: OS X 10.9.1, Java v8u45, Eclipse Luna
> Windows 7, Java v8u45, Eclipse Luna
>            Reporter: Oleg Fedoriaka
>            Assignee: Peter Klügl
>             Fix For: 2.3.1ruta
>
>         Attachments: ruta-2.3.1-SNAPSHOT.tar.gz
>
>   Original Estimate: 96h
>  Remaining Estimate: 96h
>
> New available UIMA Ruta Runtime 2.7.0 & Workbench 2.3.0 for Eclipse has lost proper functionality of MARKTABLE action.  This action stopped annotating of all words from a csv file. I had noticed that the problem happened only for words written in Cyrillic witch contains spaces, i.e. for Latin it works fine. Please use sample outlined below in order to reproduce the problem i'm talking about.
> # script/main.ruta
> WORDTABLE Dict = 'dict.csv';
> DECLARE Annotation Test (STRING meaning);
> Document {-> MARKTABLE(Test,1,Dict, "meaning" = 2)};
> # resources/dict.csv
> від;from
> с какой стати;why
> с которой;fromWhich
> сюда;here
> по какому;which
> сюди;here
> как нибудь;somehow
> сколько;howMuch
> # input/test.txt
> від с какой стати с которой сюда по какому сюди как нибудь сколько
> After main.ruta script execution we wont get annotated everything from test.txt Worth mentioning that Cyrillic letter like 'с' at the beginning of string, somehow affecting on processing behavior. Moreover, by removing lines with spaces, will get rid us from the issue described above.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)