You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Peter Klügl (JIRA)" <de...@uima.apache.org> on 2017/07/10 09:43:00 UTC

[jira] [Commented] (UIMA-5416) Ruta: strange behavior of # matching

    [ https://issues.apache.org/jira/browse/UIMA-5416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16080071#comment-16080071 ] 

Peter Klügl commented on UIMA-5416:
-----------------------------------

I created a new ticket for the problem with combination of the wildcard and the optional rule element. The second problem with the high amount of matches is actually not a problem but a feature. There are double annotations on the periods causing alternative matches all the time.

> Ruta: strange behavior of # matching
> ------------------------------------
>
>                 Key: UIMA-5416
>                 URL: https://issues.apache.org/jira/browse/UIMA-5416
>             Project: UIMA
>          Issue Type: Bug
>          Components: Ruta
>    Affects Versions: 2.6.0ruta
>         Environment: MacOS X and Linux
>            Reporter: Hugues de Mazancourt
>            Assignee: Peter Klügl
>             Fix For: 2.6.1ruta
>
>         Attachments: Loop.ruta, text.txt
>
>
> The attached ruleset Loop.ruta has a strange behavior on the attached text file : the last rule matches 1009 times and creates 1008 annotations where only two are expected. What is very strange is the ability for # to "cross" over the right boundary and match the two lines. 
> This is (at least) counterintuitive as # is supposed to behave like a non-greedy catch-all.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)