You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@zeppelin.apache.org by bzz <gi...@git.apache.org> on 2016/11/04 10:11:25 UTC

[GitHub] zeppelin issue #928: [ZEPPELIN-116] Add Apache Mahout Interpreter

Github user bzz commented on the issue:

    https://github.com/apache/zeppelin/pull/928
  
    Sorry for delay, @rawkintrevo ! 
    
    Thank you for kind reminder on this great stuff. I want to mention this work on ApacheCon, so hope we can merge this asap :)
    
    I tied it and got `IOError: [Errno 28] No space left on device` while downloading Mahout. While I'm cleaning the space, here are few things that I noticed:
      - script does not "fail fast", meaning that if downloading of Mahout failed due to disk space, it went on trying to create dirs, update .json, etc. How do you think, is it worth adding it?
    
      - output uses the term `terp`
    
      ```
    created new terp 'sparkMahout' from terp 'spark
    created new terp 'flinkMahout' from terp 'flink
      ```
      I assume it's used instead of longer `interpreter`. But other users may get confused at this point, as documentation does not mention such term. Do you think we use `interpreter` consistently here as well?
    
    - is `python scripts/mahout/add_mahout.py` idempotent? 
    
      Meaning, if it failed once for some reason, is it safe to re-run again with same args and not get 2 more interpreters created?
    
    I will clean up some space and try it again and post back.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---