MLlib: Scalable Machine Learning on Spark
MLlib: Scalable Machine Learning on Spark
Xiangrui Meng
Collaborators: Ameet Talwalkar, Evan Sparks, Virginia Smith, Xinghao Pan, Shivaram Venkataraman, Matei Zaharia, Rean Griffith, John Duchi, Joseph Gonzalez, Michael Franklin, Michael I. Jordan, Tim Kraska, etc
MLlib is a Spark subproject providing machine learning primitives: initial contribution from AMPLab, UC Berkeley shipped with Spark since version 0.8, 33 contributors
Download Presentation (LINK to .PFF)
Resources
Website: http://spark.apache.org
Tutorials: http://ampcamp.berkeley.edu
Spark Summit: http://spark-summit.org
Github: https://github.com/apache/spark
Mailing lists: user AT spark.apache.org dev AT spark.apache.org