You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "tustvold (via GitHub)" <gi...@apache.org> on 2023/04/25 19:25:36 UTC

[GitHub] [arrow-rs] tustvold opened a new issue, #4129: Inconsistent CSV Inference and Parsing DateTime Handling

tustvold opened a new issue, #4129:
URL: https://github.com/apache/arrow-rs/issues/4129

   **Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
   <!--
   A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] 
   (This section helps Arrow developers understand the context and *why* for this feature, in addition to  the *what*)
   -->
   
   Following #3746 the CSV schema inference infers datetime strings as Timestamps, not Date64, as this was incorrect #3744.
   
   However, the CSV reader still uses datetime_regex for Date64 columns, which is incorrect.
   
   Following #3794 the timestamp parsing logic has got a whole lot more sophisticated, and it is unclear how to support arbitrary format strings with it.
   
   **Describe the solution you'd like**
   <!--
   A clear and concise description of what you want to happen.
   -->
   
   Given this functionality has been broken since #3746 and nobody has noticed, and no tests failed, I think we should just remove it
   
   **Describe alternatives you've considered**
   <!--
   A clear and concise description of any alternative solutions or features you've considered.
   -->
   
   **Additional context**
   <!--
   Add any other context or screenshots about the feature request here.
   -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-rs] tustvold closed issue #4129: Inconsistent CSV Inference and Parsing DateTime Handling

Posted by "tustvold (via GitHub)" <gi...@apache.org>.
tustvold closed issue #4129: Inconsistent CSV Inference and Parsing DateTime Handling
URL: https://github.com/apache/arrow-rs/issues/4129


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-rs] tustvold commented on issue #4129: Inconsistent CSV Inference and Parsing DateTime Handling

Posted by "tustvold (via GitHub)" <gi...@apache.org>.
tustvold commented on issue #4129:
URL: https://github.com/apache/arrow-rs/issues/4129#issuecomment-1536259443

   `label_issue.py` automatically added labels {'arrow'} from #4133


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-rs] tustvold commented on issue #4129: Inconsistent CSV Inference and Parsing DateTime Handling

Posted by "tustvold (via GitHub)" <gi...@apache.org>.
tustvold commented on issue #4129:
URL: https://github.com/apache/arrow-rs/issues/4129#issuecomment-1536259402

   `label_issue.py` automatically added labels {'parquet'} from #4133


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org