You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Kenneth Knowles (Jira)" <ji...@apache.org> on 2022/01/16 14:16:00 UTC
[jira] [Commented] (BEAM-10061) ReadAllFromTextWithFilename
[ https://issues.apache.org/jira/browse/BEAM-10061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476799#comment-17476799 ]
Kenneth Knowles commented on BEAM-10061:
----------------------------------------
Great news - you can do this with FileIO.readMatches. There are examples at https://beam.apache.org/releases/javadoc/2.35.0/org/apache/beam/sdk/io/FileIO.html
> ReadAllFromTextWithFilename
> ---------------------------
>
> Key: BEAM-10061
> URL: https://issues.apache.org/jira/browse/BEAM-10061
> Project: Beam
> Issue Type: New Feature
> Components: io-java-files, io-py-files
> Environment: Dataflow with Python
> Reporter: Ryan Canty
> Priority: P3
> Labels: Clarified
>
> I am trying to create a job that reads from GCS executes some code against each line and creates a PCollection with the line and the file. So basically what I'd like is a combination of textio.ReadTextWithFilename and textio.ReadAllFromText
--
This message was sent by Atlassian Jira
(v8.20.1#820001)