You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@daffodil.apache.org by "Dave Thompson (Jira)" <ji...@apache.org> on 2020/11/17 22:02:00 UTC

[jira] [Closed] (DAFFODIL-2363) pattern facet can't use  notation. Makes validating NUL very hard.

     [ https://issues.apache.org/jira/browse/DAFFODIL-2363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dave Thompson closed DAFFODIL-2363.
-----------------------------------

Verified the specified commit (commit bc4526448d52359faed40e9b5ee3e8e8924e1797) is included in the latest pull from the incubator-daffodil repository.

Verified the all incubator-daffodil sbt test suites execute successfully including the added test.

Verified the nightly test schemas compile and save successfully.

Verified the nightly test suite executes successfully.

> pattern facet can't use &#xE000; notation. Makes validating NUL very hard. 
> ---------------------------------------------------------------------------
>
>                 Key: DAFFODIL-2363
>                 URL: https://issues.apache.org/jira/browse/DAFFODIL-2363
>             Project: Daffodil
>          Issue Type: Bug
>          Components: Back End
>    Affects Versions: 2.7.0
>            Reporter: Mike Beckerle
>            Assignee: Mike Beckerle
>            Priority: Critical
>              Labels: beginner
>             Fix For: 3.0.0
>
>
> See test_nulPattern1. 
> This bug is a real pain in the neck. 
> I want to capture NUL padding regions and insure they are all NUL. 
> These come through to XML as These U+E000 characters. I need to insure that the string contains only those.
> So I'd like to use a 
> {code:xml}
> <xs:simpleType name="allNULStringType>
>   <xs:restriction base="xs:string">
>      <xs:pattern value="&#xE000;*"/>
>   </xs:restriction>
> </xs:simpleType>
> {code}
> I consider this data well-formed (should parse) even if other bytes are there that aren't NUL, but such data is invalid. So I want the facet to check for all NUL chars (or these E000 things that Daffodil puts in XML because XML can't contain actual NUL chars).
> The test fails with 
> {code:java}
> org.apache.daffodil.tdml.TDMLExceptionImpl: (Implementation: daffodil) Validation errors found where none were expected by the test case.
> Validation Error: ex:foo failed facet checks due to: facet pattern(s): *
> {code}
> If github does the right thing that pattern will look like the E000 box char and *. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)