You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@any23.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2014/05/22 18:17:01 UTC

[jira] [Created] (ANY23-214) ë (e-umlaut or diaeresis) not decoded in RDF output

Lewis John McGibbney created ANY23-214:
------------------------------------------

             Summary: ë (e-umlaut or diaeresis) not decoded in RDF output
                 Key: ANY23-214
                 URL: https://issues.apache.org/jira/browse/ANY23-214
             Project: Apache Any23
          Issue Type: Improvement
    Affects Versions: 1.0
            Reporter: Lewis John McGibbney
             Fix For: 1.1


{
  "quads": [
    [
      {
        "type": "uri",
        "value": "http:\/\/any23.org\/tmp\/"
      },
      "http:\/\/purl.org\/dc\/terms\/title",
      {
        "type": "literal",
        "value": "Ti\u00c3\u00absto - Wikipedia, the free encyclopedia",
        "lang": null,
        "datatype": null
      },
      null
    ],
    [
      {
        "type": "bnode",
        "value": "node57ae60d4152c8125e46a2ce59389333b"
      },
      "http:\/\/www.w3.org\/2006\/vcard\/ns#fn",
      {
        "type": "literal",
        "value": "Ti\u00c3\u00absto",
        "lang": null,
        "datatype": null
      },
      null
    ],
...

The above is the output from extracting triples from the following page with the Service.
https://en.wikipedia.org/wiki/Tiesto
You can easily view that we are not decoding the e-umlaut. 
We should fix this.



--
This message was sent by Atlassian JIRA
(v6.2#6252)