Scala & Spark Links

Scala & Spark Links

Useful Links for learning Scala

  1. Scala overview from stackoverflow
  2. Programming in Scala by Martin Odersky A cool textbook for scala
  3. Scala School cheatsheet alike

About
Scala school started as a series of lectures at Twitter to prepare experienced engineers to be productive Scala programmers. Scala is a relatively new language, but draws on many familiar concepts. Thus, these lectures assumed the audience knew the concepts and showed how to use them in Scala. We found this an effective way of getting new engineers up to speed quickly. This is the written material that accompanied those lectures. We have found that these are useful in their own right.

  1. scala-cheat-sheet

  2. Scala API Doc

  3. Functional Programming Principles in Scala

    Coursera:Learn about functional programming, and how it can be effectively combined with object-oriented programming. Gain practice in writing clean functional code, using the Scala programming language.

    Assignments

  4. Code Snippets
    1. Scala tabulate method - Use on List, Array, Vector, Seq, and more
    2. The groupBy method from Scala’s collection library
    3. Scala reduceLeft examples
    4. Difference between foldLeft and reduceLeft in Scala

Useful Links for learning Spark

  1. Fast Data Processing with Spark I don't recommend this book
  2. Spark Overview
  3. Spark Quick Start A cool tutorial!
    install_scala_sbt.sh
  4. Spark开发指南(0.8.1中文版) Cool, but out-of-date
  5. Spark Programming Guide
  6. Apache Spark学习:利用Scala语言开发Spark应用程序
  7. Writing Standalone Spark Programs
  8. Zaharia M, Chowdhury M, Das T, et al. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing
  9. Sparks E R, Talwalkar A, Smith V, et al. MLI: An API for Distributed Machine Learning[J]. arXiv preprint arXiv:1310.5426, 2013.
  10. Xin R S, Gonzalez J E, Franklin M J, et al. Graphx: A resilient distributed graph system on spark

Introduction to MLLibs

Unsupervised Learning with Apache Spark

Multinomial Logistic Regression with Apache Spark

Scala%20%26%20Spark%20Links%0A%3D%3D%3D%3D%3D%20%20%20%0A@%5Bpublished%7Cdistributed%20computing%5D%20%20%20%0A%23%23%23Useful%20Links%20for%20learning%20Scala%0A%0A%0A1.%20%5BScala%20overview%20from%20stackoverflow%5D%28http%3A//stackoverflow.com/tags/scala/info%29%20%20%0A2.%20%5BProgramming%20in%20Scala%20by%20Martin%20Odersky%5D%28http%3A//book.douban.com/subject/3338669/%29%20%20A%20cool%20textbook%20for%20scala%0A1.%20%5BScala%20School%5D%28http%3A//twitter.github.io/scala_school/index.html%29%20cheatsheet%20alike%20%20%0A%0A%3EAbout%0AScala%20school%20started%20as%20a%20series%20of%20lectures%20at%20Twitter%20to%20prepare%20experienced%20engineers%20to%20be%20productive%20Scala%20programmers.%20Scala%20is%20a%20relatively%20new%20language%2C%20but%20draws%20on%20many%20familiar%20concepts.%20Thus%2C%20these%20lectures%20assumed%20the%20audience%20knew%20the%20concepts%20and%20showed%20how%20to%20use%20them%20in%20Scala.%20We%20found%20this%20an%20effective%20way%20of%20getting%20new%20engineers%20up%20to%20speed%20quickly.%20This%20is%20the%20written%20material%20that%20accompanied%20those%20lectures.%20We%20have%20found%20that%20these%20are%20useful%20in%20their%20own%20right.%0A%0A2.%20%5Bscala-cheat-sheet%5D%28https%3A//github.com/soulmachine/scala-cheat-sheet%29%20%20%0A%0A3.%20%5BScala%20API%20Doc%5D%28http%3A//www.scala-lang.org/api/current/index.html%23package%29%20%20%0A4.%20***%5BFunctional%20Programming%20Principles%20in%20Scala%5D%28https%3A//www.coursera.org/course/progfun%29***%0A%3ECoursera%3ALearn%20about%20functional%20programming%2C%20and%20how%20it%20can%20be%20effectively%20combined%20with%20object-oriented%20programming.%20Gain%20practice%20in%20writing%20clean%20functional%20code%2C%20using%20the%20Scala%20programming%20language.%20%20%0A%20%20%20%20%0A%20%20%20%20%5BAssignments%5D%28https%3A//github.com/irwenqiang/functional-programming-principles-in-scala%29%20%20%20%20%0A5.%20Code%20Snippets%0A%20%20%20%206.%20%20%5BScala%20tabulate%20method%20-%20Use%20on%20List%2C%20Array%2C%20Vector%2C%20Seq%2C%20and%20more%5D%28http%3A//alvinalexander.com/source-code/scala/scala-tabulate-method-use-list-array-vector-seq-and-more%29%0A%20%20%20%207.%20%20%5BThe%20groupBy%20method%20from%20Scala%u2019s%20***collection***%20library%5D%28http%3A//markusjais.com/the-groupby-method-from-scalas-collection-library/%29%20%20%0A%20%20%20%208.%20%20%5BScala%20reduceLeft%20examples%5D%28http%3A//alvinalexander.com/scala/scala-reduceleft-examples%29%20%20%0A%20%20%20%209.%20%20%5BDifference%20between%20foldLeft%20and%20reduceLeft%20in%20Scala%5D%28http%3A//stackoverflow.com/questions/7764197/difference-between-foldleft-and-reduceleft-in-scala%29%0A%0A%0A%23%23%23Useful%20Links%20for%20learning%20Spark%20%20%0A1.%20%5BFast%20Data%20Processing%20with%20Spark%5D%28http%3A//book.douban.com/subject/25780498/%29%20%20I%20don%27t%20recommend%20this%20book%20%20%0A2.%20%5BSpark%20Overview%5D%28http%3A//spark.apache.org/docs/0.9.1/%29%0A2.%20%5BSpark%20Quick%20Start%5D%28http%3A//spark.apache.org/docs/0.9.1/quick-start.html%29%20A%20cool%20tutorial%21%20%20%20%0A%20%20%20%20%5Binstall_scala_sbt.sh%5D%28https%3A//gist.github.com/visenger/5496675%29%0A3.%20%5BSpark%u5F00%u53D1%u6307%u5357%uFF080.8.1%u4E2D%u6587%u7248%uFF09%5D%28http%3A//rdc.taobao.org/%3Fp%3D2024%29%20Cool%2C%20but%20out-of-date%20%20%0A4.%20%5BSpark%20Programming%20Guide%5D%28http%3A//spark.apache.org/docs/0.9.1/scala-programming-guide.html%23initializing-spark%29%0A5.%20%5BApache%20Spark%u5B66%u4E60%uFF1A%u5229%u7528Scala%u8BED%u8A00%u5F00%u53D1Spark%u5E94%u7528%u7A0B%u5E8F%5D%28http%3A//dongxicheng.org/framework-on-yarn/spark-scala-writing-application/%29%20%20%20%0A6.%20%5BWriting%20Standalone%20Spark%20Programs%5D%28http%3A//ampcamp.berkeley.edu/wp-content/uploads/2012/06/matei-zaharia-part-2-amp-camp-2012-standalone-programs.pdf%29%20%20%20%0A7.%20Zaharia%20M%2C%20Chowdhury%20M%2C%20Das%20T%2C%20et%20al.%20%5BResilient%20distributed%20datasets%3A%20A%20fault-tolerant%20abstraction%20for%20in-memory%20cluster%20computing%5D%28https%3A//www.usenix.org/system/files/conference/nsdi12/nsdi12-final138.pdf%29%20%20%0A8.%20Sparks%20E%20R%2C%20Talwalkar%20A%2C%20Smith%20V%2C%20et%20al.%20%5BMLI%3A%20An%20API%20for%20Distributed%20Machine%20Learning%5D%28http%3A//arxiv-web3.library.cornell.edu/pdf/1310.5426v2.pdf%29%5BJ%5D.%20arXiv%20preprint%20arXiv%3A1310.5426%2C%202013.%20%20%0A9.%20Xin%20R%20S%2C%20Gonzalez%20J%20E%2C%20Franklin%20M%20J%2C%20et%20al.%20%5BGraphx%3A%20A%20resilient%20distributed%20graph%20system%20on%20spark%5D%28https%3A//amplab.cs.berkeley.edu/wp-content/uploads/2013/05/grades-graphx_with_fonts.pdf%29%20%20%20%20%20%0A%0A%0A%0A----%0A%0A%23%23%23Introduction%20to%20MLLibs%20%20%20%0A%5Bslideshare%20url%3D%22http%3A//www.slideshare.net/chaochen5496/mlllib-sparkmeetup8613finalreduced%22%5D%20%20%20%0A%23%23%23Unsupervised%20Learning%20with%20Apache%20Spark%20%20%20%0A%5Bslideshare%20url%3D%22http%3A//www.slideshare.net/dbtsai/unsupervised-learning-with-apache-spark%22%5D%20%20%20%0A%23%23%23Multinomial%20Logistic%20Regression%20with%20Apache%20Spark%20%20%20%20%0A%5Bslideshare%20url%3D%22http%3A//www.slideshare.net/dbtsai/2014-0501-mlor%22%5D%20%20%20%0A


comments powered by Disqus