You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Antoine Pitrou (Jira)" <ji...@apache.org> on 2020/10/15 13:06:00 UTC
[jira] [Created] (ARROW-10313) [C++] Improve UTF8 validation speed
and CSV string conversion
Antoine Pitrou created ARROW-10313:
--------------------------------------
Summary: [C++] Improve UTF8 validation speed and CSV string conversion
Key: ARROW-10313
URL: https://issues.apache.org/jira/browse/ARROW-10313
Project: Apache Arrow
Issue Type: Improvement
Components: C++
Reporter: Antoine Pitrou
Assignee: Antoine Pitrou
Fix For: 3.0.0
Based on profiling from ARROW-10308, UTF8 validation is a bottleneck of CSV string conversion.
This is because we must validate many small UTF8 strings individually.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)