You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Saurabh Agrawal <sa...@markit.com> on 2014/11/20 15:04:01 UTC

Please help me get started on Apache Spark

Friends,

I am pretty new to Spark as much as to Scala, MLib and the entire Hadoop stack!! It would be so much help if I could be pointed to some good books on Spark and MLib?

Further, does MLib support any algorithms for B2B cross sell/ upsell or customer retention (out of the box preferably) that I could run on my Sales force data? I am currently using Collaborative filtering but that's essentially B2C.

Thanks in advance!!

Regards,
Saurabh Agrawal

________________________________
This e-mail, including accompanying communications and attachments, is strictly confidential and only for the intended recipient. Any retention, use or disclosure not expressly authorised by Markit is prohibited. This email is subject to all waivers and other terms at the following link: http://www.markit.com/en/about/legal/email-disclaimer.page

Please visit http://www.markit.com/en/about/contact/contact-us.page? for contact information on our offices worldwide.

MarkitSERV Limited has its registered office located at Level 4, Ropemaker Place, 25 Ropemaker Street, London, EC2Y 9LY and is authorized and regulated by the Financial Conduct Authority with registration number 207294

Re: Please help me get started on Apache Spark

Posted by "Guibert. J Tchinde" <jg...@gmail.com>.
For Spark,
You can start with a new book like :
https://www.safaribooksonline.com/library/view/learning-spark/9781449359034/ch01.html
I think the paper book is out now,

You can also have a look on tutorials documentation guide available on :
https://spark.apache.org/docs/1.1.0/mllib-guide.html

There is a lot of good tutorials (google), but I think the best manner
remain to get a case study

Cheers

2014-11-20 15:04 GMT+01:00 Saurabh Agrawal <sa...@markit.com>:

>
>
> Friends,
>
>
>
> I am pretty new to Spark as much as to Scala, MLib and the entire Hadoop
> stack!! It would be so much help if I could be pointed to some good books
> on Spark and MLib?
>
>
>
> Further, does MLib support any algorithms for B2B cross sell/ upsell or
> customer retention (out of the box preferably) that I could run on my Sales
> force data? I am currently using Collaborative filtering but that’s
> essentially B2C.
>
>
>
> Thanks in advance!!
>
>
>
> Regards,
>
> Saurabh Agrawal
>
> ------------------------------
> This e-mail, including accompanying communications and attachments, is
> strictly confidential and only for the intended recipient. Any retention,
> use or disclosure not expressly authorised by Markit is prohibited. This
> email is subject to all waivers and other terms at the following link:
> http://www.markit.com/en/about/legal/email-disclaimer.page
>
> Please visit http://www.markit.com/en/about/contact/contact-us.page? for
> contact information on our offices worldwide.
>
> MarkitSERV Limited has its registered office located at Level 4, Ropemaker
> Place, 25 Ropemaker Street, London, EC2Y 9LY and is authorized and
> regulated by the Financial Conduct Authority with registration number 207294
>

Re: Please help me get started on Apache Spark

Posted by Darin McBeath <dd...@yahoo.com.INVALID>.
Take a look at the O'Reilly Learning Spark (Early Release) book.  I've found this very useful.
Darin.
      From: Saurabh Agrawal <sa...@markit.com>
 To: "user@spark.apache.org" <us...@spark.apache.org> 
 Sent: Thursday, November 20, 2014 9:04 AM
 Subject: Please help me get started on Apache Spark
   
 <!--#yiv9027708365 _filtered #yiv9027708365 {font-family:Calibri;panose-1:2 15 5 2 2 2 4 3 2 4;}#yiv9027708365 #yiv9027708365 p.yiv9027708365MsoNormal, #yiv9027708365 li.yiv9027708365MsoNormal, #yiv9027708365 div.yiv9027708365MsoNormal {margin:0in;margin-bottom:.0001pt;font-size:11.0pt;font-family:"Calibri", "sans-serif";}#yiv9027708365 a:link, #yiv9027708365 span.yiv9027708365MsoHyperlink {color:blue;text-decoration:underline;}#yiv9027708365 a:visited, #yiv9027708365 span.yiv9027708365MsoHyperlinkFollowed {color:purple;text-decoration:underline;}#yiv9027708365 p.yiv9027708365MsoListParagraph, #yiv9027708365 li.yiv9027708365MsoListParagraph, #yiv9027708365 div.yiv9027708365MsoListParagraph {margin-top:0in;margin-right:0in;margin-bottom:0in;margin-left:.5in;margin-bottom:.0001pt;font-size:11.0pt;font-family:"Calibri", "sans-serif";}#yiv9027708365 span.yiv9027708365EmailStyle17 {font-family:"Calibri", "sans-serif";color:windowtext;}#yiv9027708365 .yiv9027708365MsoChpDefault {font-family:"Calibri", "sans-serif";} _filtered #yiv9027708365 {margin:1.0in 1.0in 1.0in 1.0in;}#yiv9027708365 div.yiv9027708365WordSection1 {}#yiv9027708365 _filtered #yiv9027708365 {} _filtered #yiv9027708365 {} _filtered #yiv9027708365 {} _filtered #yiv9027708365 {} _filtered #yiv9027708365 {} _filtered #yiv9027708365 {} _filtered #yiv9027708365 {} _filtered #yiv9027708365 {} _filtered #yiv9027708365 {} _filtered #yiv9027708365 {}#yiv9027708365 ol {margin-bottom:0in;}#yiv9027708365 ul {margin-bottom:0in;}-->   Friends,     I am pretty new to Spark as much as to Scala, MLib and the entire Hadoop stack!! It would be so much help if I could be pointed to some good books on Spark and MLib?    Further, does MLib support any algorithms for B2B cross sell/ upsell or customer retention (out of the box preferably) that I could run on my Sales force data? I am currently using Collaborative filtering but that’s essentially B2C.    Thanks in advance!!    Regards, Saurabh Agrawal 
This e-mail, including accompanying communications and attachments, is strictly confidential and only for the intended recipient. Any retention, use or disclosure not expressly authorised by Markit is prohibited. This email is subject to all waivers and other terms at the following link: http://www.markit.com/en/about/legal/email-disclaimer.page

Please visit http://www.markit.com/en/about/contact/contact-us.page? for contact information on our offices worldwide.

MarkitSERV Limited has its registered office located at Level 4, Ropemaker Place, 25 Ropemaker Street, London, EC2Y 9LY and is authorized and regulated by the Financial Conduct Authority with registration number 207294