You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@daffodil.apache.org by "Michael Beckerle (JIRA)" <ji...@apache.org> on 2018/10/23 16:47:00 UTC

[jira] [Updated] (DAFFODIL-1917) Support for new character classes

     [ https://issues.apache.org/jira/browse/DAFFODIL-1917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael Beckerle updated DAFFODIL-1917:
---------------------------------------
    Priority: Minor  (was: Major)

> Support for new character classes
> ---------------------------------
>
>                 Key: DAFFODIL-1917
>                 URL: https://issues.apache.org/jira/browse/DAFFODIL-1917
>             Project: Daffodil
>          Issue Type: New Feature
>          Components: Back End, Front End, Middle &quot;End&quot;
>            Reporter: Steve Lawrence
>            Priority: Minor
>
> Daffodil v2.0 wishlist includes new character class entities LSP, LSP* and LSP+, which are the same as WSP except they do not match line endings. This is useful as currently it is impossible to match an empty line followed by a newline via something like %WSP*;%NL; due to the greediness of WSP* and the inability for our delimiter scanner to backtrack of forward lookahead.
> Related, it might be useful to also add NL+ and NL* to match an unbounded number of newlines.
> Additionally, it might be a good idea to cause an SDE if a character class that contains * or
> + is immediately followed by a character inside that character class. The reason being that such sequences will always fail without lookahead or backtracking, which probalby isn't worth implementing.
> For security reasons, we should also implement a tunable to limit the number or repetitions of these character entities.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)