You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@devlake.apache.org by GitBox <gi...@apache.org> on 2022/10/25 02:40:15 UTC

[GitHub] [incubator-devlake] likyh opened a new issue, #3564: [Bug][github] invalid byte sequence for encoding "UTF8": 0x00 (SQLSTATE 22021)

likyh opened a new issue, #3564:
URL: https://github.com/apache/incubator-devlake/issues/3564

   ### Search before asking
   
   - [X] I had searched in the [issues](https://github.com/apache/incubator-devlake/issues?q=is%3Aissue) and found no similar issues.
   
   
   ### What happened
   
   ![image](https://user-images.githubusercontent.com/3294100/197649941-0a14fd1c-ed98-4a56-ab76-9e932285abe8.png)
   invalid byte sequence for encoding "UTF8": 0x00 (SQLSTATE 22021).
   
   `desc` in this issue [INSERT SELECT: SETTINGS clause after FORMAT can lead to parsing issues · Issue #20343 · ClickHouse/C](https://github.com/ClickHouse/ClickHouse/issues/20343) fail to save .
    
   https://github.com/likyh/test-for-0x00/issues/1。
   
   ### What you expected to happen
   
   collect success
   
   ### How to reproduce
   
   try to collect ClickHouse `https://github.com/ClickHouse/ClickHouse` or the mock repo `https://github.com/likyh/test-for-0x00`. The mock repo does not always fail.
   
   ### Anything else
   
   _No response_
   
   ### Version
   
   main
   
   ### Are you willing to submit PR?
   
   - [X] Yes I am willing to submit a PR!
   
   ### Code of Conduct
   
   - [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@devlake.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [incubator-devlake] likyh commented on issue #3564: [Bug][github] invalid byte sequence for encoding "UTF8": 0x00 (SQLSTATE 22021)

Posted by GitBox <gi...@apache.org>.
likyh commented on issue #3564:
URL: https://github.com/apache/incubator-devlake/issues/3564#issuecomment-1293649886

   The char '0x00' can not be saved in `text` in Postgres. It can be saved in type `bytea` but it must define the column in golang as `[]byte`.
   
   ![image](https://user-images.githubusercontent.com/3294100/198324068-cc29f0a5-40a7-4de4-8cc6-e95fdbb64ac5.png)
   
   
   It is not convenient to be used.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@devlake.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [incubator-devlake] likyh commented on issue #3564: [Bug][github] invalid byte sequence for encoding "UTF8": 0x00 (SQLSTATE 22021)

Posted by GitBox <gi...@apache.org>.
likyh commented on issue #3564:
URL: https://github.com/apache/incubator-devlake/issues/3564#issuecomment-1294389877

   Unicode     |        UTF-8 
   ----------------------+---------------------------------------------
   0000 0000-0000 007F | 0xxxxxxx
   0000 0080-0000 07FF | 110xxxxx 10xxxxxx
   0000 0800-0000 FFFF | 1110xxxx 10xxxxxx 10xxxxxx
   0001 0000-0010 FFFF | 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
   
   All multi-byte char in utf8 are start with byte `1`. So all ASCII char will not exist in other char.
   
   https://www.ietf.org/rfc/rfc3629.txt
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@devlake.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [incubator-devlake] klesh closed issue #3564: [Bug][github] invalid byte sequence for encoding "UTF8": 0x00 (SQLSTATE 22021)

Posted by GitBox <gi...@apache.org>.
klesh closed issue #3564: [Bug][github] invalid byte sequence for encoding "UTF8": 0x00 (SQLSTATE 22021)
URL: https://github.com/apache/incubator-devlake/issues/3564


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@devlake.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org