You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kenneth Knowles (Jira)" <ji...@apache.org> on 2022/01/16 14:16:00 UTC

[jira] [Commented] (BEAM-10061) ReadAllFromTextWithFilename

    [ https://issues.apache.org/jira/browse/BEAM-10061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476799#comment-17476799 ] 

Kenneth Knowles commented on BEAM-10061:
----------------------------------------

Great news - you can do this with FileIO.readMatches. There are examples at https://beam.apache.org/releases/javadoc/2.35.0/org/apache/beam/sdk/io/FileIO.html

> ReadAllFromTextWithFilename
> ---------------------------
>
>                 Key: BEAM-10061
>                 URL: https://issues.apache.org/jira/browse/BEAM-10061
>             Project: Beam
>          Issue Type: New Feature
>          Components: io-java-files, io-py-files
>         Environment: Dataflow with Python
>            Reporter: Ryan Canty
>            Priority: P3
>              Labels: Clarified
>
> I am trying to create a job that reads from GCS executes some code against each line and creates a PCollection with the line and the file. So basically what I'd like is a combination of textio.ReadTextWithFilename and textio.ReadAllFromText



--
This message was sent by Atlassian Jira
(v8.20.1#820001)