You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by "Afshin, Bardia" <ba...@changehealthcare.com> on 2018/03/07 18:21:34 UTC

Spark-submit Py-files with EMR add step?

I’m writing this email to reach out to the community to demisty the py-files parameter when working with spark-submit and python projects.

Currently I have a project, say

Src/

  *   Main.py
  *   Modules/module1.py

When I zip up the src directory and submit it to spark via emr add step , the namespacing is lost.

Main.py example:
From Modules.module1 import SomeClass

My code returns and error that it cannot find this class, now this works if I goto the instance download my project, and submit it to spark from within the EMR instance via spark-submit , but not when adding it as a step in emr from external call.


Help?

Best,
Bardia
This message is confidential, intended only for the named recipient(s) and may contain information that is privileged or exempt from disclosure under applicable law. If you are not the intended recipient(s), you are notified that the dissemination, distribution, or copying of this message is strictly prohibited. If you receive this message in error or are not the named recipient(s), please notify the sender by return email and delete this message. Thank you.