You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "Beam JIRA Bot (Jira)" <ji...@apache.org> on 2020/08/22 17:07:02 UTC

[jira] [Commented] (BEAM-10268) AssertionError('Missing boto3 requirement')

    [ https://issues.apache.org/jira/browse/BEAM-10268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17182438#comment-17182438 ] 

Beam JIRA Bot commented on BEAM-10268:
--------------------------------------

This issue is assigned but has not received an update in 30 days so it has been labeled "stale-assigned". If you are still working on the issue, please give an update and remove the label. If you are no longer working on the issue, please unassign so someone else may work on it. In 7 days the issue will be automatically unassigned.

> AssertionError('Missing boto3 requirement')
> -------------------------------------------
>
>                 Key: BEAM-10268
>                 URL: https://issues.apache.org/jira/browse/BEAM-10268
>             Project: Beam
>          Issue Type: Bug
>          Components: beam-community
>            Reporter: Nikhil
>            Assignee: Aizhamal Nurmamat kyzy
>            Priority: P2
>              Labels: newbie, stale-assigned
>
> *Error as:*
> apache_beam.io.filesystem.BeamIOError: Match operation failed with exceptions \{'s3://xxxxxx.csv': BeamIOError("exists() operation failed with exceptions {'s3://xxxxxx.csv': AssertionError('Missing boto3 requirement')}")}
>  
> *Agenda:*
> I am writing a dataflow pipeline in python to import data from s3 bucket. I am running this code as dataflow runner.
> Could you please guide where I am going wrong?
>  
> *I have written code as:*
> *class CustomPipelineOptions(PipelineOptions):*
>  @classmethod
>  def _add_argparse_args(cls, parser):
>   parser.add_value_provider_argument('--input', help='Path of the file to read from')
>   parser.add_value_provider_argument('--output',help='Output file to write results to.')
>   parser.add_value_provider_argument('--source_bucket',help='AWS source bucket name.')
>   parser.add_value_provider_argument('--aws_access_key_id',help='AWS access key id.')
>  parser.add_value_provider_argument('--aws_secret_access_key',help='Your AWS secret  access key.')
>   parser.add_value_provider_argument('--aws_default_region',help='Sets STS endpoint resolution logic.')
>  
> *if __name__ == '__main__':*
>   parser = argparse.ArgumentParser()
>   # Parse arguments from the command line.
>   pipeline_args = parser.parse_known_args(argv)
>   pipeline_options = PipelineOptions(pipeline_args)
>   pipeline_options.view_as(SetupOptions).save_main_session = True
>   p = beam.Pipeline(options=pipeline_options)
>  
>   p1 = (p | 'Read assetdb' >> beam.io.ReadFromText(custom_options.input))
>   p.run().wait_until_finish()



--
This message was sent by Atlassian Jira
(v8.3.4#803005)