You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kenneth Knowles (Jira)" <ji...@apache.org> on 2021/09/09 17:20:00 UTC

[jira] [Commented] (BEAM-12730) Add custom delimiters to Python TextIO reads

    [ https://issues.apache.org/jira/browse/BEAM-12730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17412719#comment-17412719 ] 

Kenneth Knowles commented on BEAM-12730:
----------------------------------------

Have someone asking about it: https://stackoverflow.com/questions/69113972/does-beam-supports-custom-delimiter-when-reading-from-text-file/69122348

> Add custom delimiters to Python TextIO reads
> --------------------------------------------
>
>                 Key: BEAM-12730
>                 URL: https://issues.apache.org/jira/browse/BEAM-12730
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-py-common, io-py-files
>            Reporter: Daniel Oliveira
>            Priority: P2
>              Labels: beginner, newbie, starter
>
> A common request by users is to be able to separate a text files read by TextIO with delimiters other than newline. The Java SDK already supports this feature.
> The current delimiter code is [located here|https://github.com/apache/beam/blob/v2.31.0/sdks/python/apache_beam/io/textio.py#L236] and defaults to newlines. This function could easily be modified to also handle custom delimiters. Changing this would also necessitate changing the API for the various TextIO.Read methods and adding documentation.
> This seems like a good starter bug for making more in-depth contributions to Beam Python.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)