You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@atlas.apache.org by "Ashutosh Mestry (Jira)" <ji...@apache.org> on 2020/10/01 16:14:00 UTC

[jira] [Resolved] (ATLAS-3953) JSON Files from Export API with "?" char for string with special chars

     [ https://issues.apache.org/jira/browse/ATLAS-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ashutosh Mestry resolved ATLAS-3953.
------------------------------------
    Resolution: Fixed

> JSON Files from Export API with "?" char for string with special chars 
> -----------------------------------------------------------------------
>
>                 Key: ATLAS-3953
>                 URL: https://issues.apache.org/jira/browse/ATLAS-3953
>             Project: Atlas
>          Issue Type: Bug
>          Components:  atlas-core
>    Affects Versions: 2.1.0
>         Environment: Apache Atlas 2.1.0 embedded HBASE and SOLR
>            Reporter: Carlos Alberto Rocha Cardoso
>            Assignee: Ashutosh Mestry
>            Priority: Major
>         Attachments: 9fdc3ad0-46c2-430a-89c4-4a751d31c064.json, ATLAS-3953-Export-ZipSink-Specify-character-endcodin.patch, Asset_Imported.PNG, AtlasServer.PNG, a5c148bf-5ab6-4c49-853e-855842102128.json, atlas_export.zip, path.zip
>
>
> The Export API returns a ZIP file with some JSON files describing Atlas Entities and TypeDefs.
> I am having an issue where some special chars in JSON are being replaced by "?" chars.
> An Entity name like "Distribuição" was exported in JSON file like "Distribui??o". The special chars "çã" was replaced for the "??" chars.
> I tried to change the exported JSON file encoding and the request header for Export API but without success.
> After analyzing the Atlas source code, especially the *splitAndWriteBytes* method of the *[ZipSink|https://github.com/apache/atlas/blob/cc601d7371fae1dbc16b55d1ca84f06b745700dc/repository/src/main/java/org/apache/atlas/repository/impexp/ZipSink.java] class*, I thought if maybe the problem is because the *s.getBytes()* is returning the JSON string to be written to ZIP with another encoding than *UTF-8*, and maybe set the encode like *s.getBytes(StandardCharsets.UTF_8)* could be a solution.
> It's my first contact with the Atlas source code, and I'm not a JAVA programmer, so it's only a guess.
> I saw that it's possible to set the default to encode to the platform or JVM, but how they said in this below discussion, perhaps this doesn't work properly in all situations.
> [https://stackoverflow.com/questions/361975/setting-the-default-java-character-encoding]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)