You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@superset.apache.org by GitBox <gi...@apache.org> on 2021/02/03 10:18:55 UTC

[GitHub] [superset] ethanhuang1009 opened a new issue #12916: when i upload a csv file with chinese characters, faced a error

ethanhuang1009 opened a new issue #12916:
URL: https://github.com/apache/superset/issues/12916


   A clear and concise description of what the bug is.
   
   ### Expected results
   upload success
   
   ### Actual results
   upload failed and I faced an error without any solutions
   
   #### Screenshots
   
   **Unable to upload CSV file "11.csv" to table "11" in database "test". Error message: 'utf-8' codec can't decode byte 0xd4 in position 0: invalid continuation byte**
   
   
   #### How to reproduce the bug
   
   1. Go to 'upload a csv'
   2. Click on 'select a file.'
   3. Scroll down to '....'
   4. See an error
   
   ### Environment
   
   (please complete the following information):
   
   - superset version: `1.0.0`
   - python version: `python3.8`
   - node.js version: `node -v`
   
   ### Checklist
   
   Make sure to follow these steps before submitting your issue - thank you!
   
   - [ ] I have checked the superset logs for python stacktraces and included it here as text if there are any.
   - [ ] I have reproduced the issue with at least the latest released version of superset.
   - [ ] I have checked the issue tracker for the same issue and I haven't found one similar.
   
   ### Additional [context]
   
   Add any other context about the problem here.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] ethanhuang1009 commented on issue #12916: How can I upload a CSV file with non-UTF8 character set

Posted by GitBox <gi...@apache.org>.
ethanhuang1009 commented on issue #12916:
URL: https://github.com/apache/superset/issues/12916#issuecomment-772950494


   @zhaoyongjie  Thanks you very much. I change the file encoding format from **utf8-with-bom** to utf8, then I can upload the file successfully. And I hope Superset can find and give us a better solution for the non-utf8 character set files in the near future.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] zhaoyongjie edited a comment on issue #12916: when i upload a csv file with chinese characters, faced a error

Posted by GitBox <gi...@apache.org>.
zhaoyongjie edited a comment on issue #12916:
URL: https://github.com/apache/superset/issues/12916#issuecomment-772602488


   @ethanhuang1009 
   Please check the CSV file enconding.
   1. Do not use CSV file exported from Excel on Windows plantform, Excel exported CSV file encoding is `utf8-with-bom`. 
   2. The Superset only supports utf8 encoding CSV file import, so please carefully check the file encoding format.
   
   This is a rough encoding issue,  in the future we can use some solution guess that file encoding.
   
   ======================
   In Chinese
   抱歉, 这是个 CSV编码问题.
   1. 不要直接使用 Excel 导出的CSV, 他是 UTF8-bom 格式.
   2. 当前只支持 UTF8 格式导入, 其他比如 CP936(GB2312) 等没有很好支持. 
   这是个比较麻烦的编码问题, 您检查下文件编码格式.如果是 UTF8 一定可以. 你也可以使用 SublimeText 这类工具转换一下编码.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] zhaoyongjie edited a comment on issue #12916: when i upload a csv file with chinese characters, faced a error

Posted by GitBox <gi...@apache.org>.
zhaoyongjie edited a comment on issue #12916:
URL: https://github.com/apache/superset/issues/12916#issuecomment-772602488


   @ethanhuang1009 
   Please check the CSV file enconding.
   1. Do not use CSV file exported from Excel on Windows plantform, Excel exported CSV file encoding is `utf8-with-bom`. 
   2. The Superset only supports utf8 encoding CSV file import, so please carefully check the file encoding format.
   
   This is a rough encoding issue,  in the future we can use some solution guess that file encoding.
   
   ======================
   抱歉, 这是个 CSV编码问题.
   1. 不要直接使用 Excel 导出的CSV, 他是 UTF8-bom 格式.
   2. 当前只支持 UTF8 格式导入, 其他比如 CP936(GB2312) 等没有很好支持. 
   这是个比较麻烦的编码问题, 您检查下文件编码格式.如果是 UTF8 一定可以. 你也可以使用 SublimeText 这类工具转换一下编码.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org


[GitHub] [superset] zhaoyongjie commented on issue #12916: when i upload a csv file with chinese characters, faced a error

Posted by GitBox <gi...@apache.org>.
zhaoyongjie commented on issue #12916:
URL: https://github.com/apache/superset/issues/12916#issuecomment-772602488


   @ethanhuang1009 
   Please check the CSV file enconding.
   1. Do not use CSV file exported from Excel on Windows plantform, Excel exported CSV file encoding is `utf8-with-bom`. 
   2. The Superset only supports utf8 encoding CSV file import, so please carefully check the file encoding format.
   
   This is a rough encoding issue,  in the future we can use some solution guess that file encoding.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscribe@superset.apache.org
For additional commands, e-mail: notifications-help@superset.apache.org