Apache Mahout
Open-source project to provide scalable machine learning (Chu et al., 2006; Owen et al., 2012) Written in Java for the Apache Hadoop MapReduce platform Some supported ML methods: Supervised: NB, HMM, SVM, Logist. Reg.,...
Operational Database Management Systems
Open-source project to provide scalable machine learning (Chu et al., 2006; Owen et al., 2012) Written in Java for the Apache Hadoop MapReduce platform Some supported ML methods: Supervised: NB, HMM, SVM, Logist. Reg.,...
This is a project started at Yahoo! Research and continuing at Microsoft Research to design a fast, scalable, useful learning algorithm. VW is the essence of speed in machine learning, able to learn from terafeature datasets with ease....
BY Sajawel Ahmed Master Thesis Goethe University Frankfurt, Fraunhofer IAIS in cooperation with PricewaterhouseCoopers AG WPG Supervisors: Prof. Dott.-Ing. Roberto V. Zicari, Frankfurt Big Data Laboratory Dr. Joerg Kindermann, Knowledge Discovery Group March 2017 Abstract Since...
Open source library for Python and in Python Provides Natural Language Processing functionality to Python Loper and Bird (2002); Bird (2006); Bird, Klein and Loper (2009) Available from http://nltk.org Useful for teaching and...
The Spark Python API (PySpark) exposes the Spark programming model to Python. To learn the basics of Spark, we recommend reading through the Scala programming guide first; it should be easy to follow even if you...
Some interesting info graphics on Machine Learning by Alan Morrison and Anand Rao, PwC US For the connected consumer, machine learning is now a key enabler, from on-demand translation services, to weather forecasting, to guessing what users want based...
Weka 3: Data Mining Software in Java Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own...
BY Sean Kandel, Andreas Paepcke, Joseph M. Hellerstein, and Jeffrey Heer Sean Kandel is with Stanford University, e-mail: skandel@cs.stanford.edu. Andreas Paepcke is with Stanford University, e-mail:paepcke@cs.stanford.edu. Joseph M. Hellerstein is with UC Berkeley, e-mail:hellerstein@cs.berkeley.edu....
By Xindong Wu · Vipin Kumar · J. Ross Quinlan · Joydeep Ghosh · Qiang Yang · Hiroshi Motoda · Geoffrey J. McLachlan · Angus Ng · Bing Liu · Philip S. Yu ·...
BY Pedro Domingos Department of Computer Science and Engineering University of Washington Seattle, WA 98195-2350, U.S.A. pedrod@cs.washington.edu ABSTRACT Machine learning algorithms can figure out how to perform important tasks by generalizing from examples. This...