The Spark Python API (PySpark)
The Spark Python API (PySpark) exposes the Spark programming model to Python. To learn the basics of Spark, we recommend reading through the Scala programming guide first; it should be easy to follow even if you...
Operational Database Management Systems
The Spark Python API (PySpark) exposes the Spark programming model to Python. To learn the basics of Spark, we recommend reading through the Scala programming guide first; it should be easy to follow even if you...
Some interesting info graphics on Machine Learning by Alan Morrison and Anand Rao, PwC US For the connected consumer, machine learning is now a key enabler, from on-demand translation services, to weather forecasting, to guessing what users want based...
Weka 3: Data Mining Software in Java Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own...
BY Sean Kandel, Andreas Paepcke, Joseph M. Hellerstein, and Jeffrey Heer Sean Kandel is with Stanford University, e-mail: skandel@cs.stanford.edu. Andreas Paepcke is with Stanford University, e-mail:paepcke@cs.stanford.edu. Joseph M. Hellerstein is with UC Berkeley, e-mail:hellerstein@cs.berkeley.edu....
By Xindong Wu · Vipin Kumar · J. Ross Quinlan · Joydeep Ghosh · Qiang Yang · Hiroshi Motoda · Geoffrey J. McLachlan · Angus Ng · Bing Liu · Philip S. Yu ·...
BY Pedro Domingos Department of Computer Science and Engineering University of Washington Seattle, WA 98195-2350, U.S.A. pedrod@cs.washington.edu ABSTRACT Machine learning algorithms can figure out how to perform important tasks by generalizing from examples. This...
Data Science Tools: Visualization ggplot2 (for R) ggplot2 is a plotting system for R, based on the grammar of graphics, which tries to take the good parts of base and lattice graphics and none...
(Text) Annotation Tools for Convenience BRAT– brat is a web-based tool for text annotation; that is, for adding notes to existing text documents. brat is designed in particular for structured annotation, where the notes...
By Vassilis Plachouras , Thomson Reuters Research & Development 1 Mark Square London, EC2A 4EG, UK Jochen L. Leidner , Thomson Reuters Research & Development 1 Mark Square London, EC2A 4EG, UK Andrew G....
Panel “Ethical issues of AI and New Technologies”, Published on Mar 23, 2017 CeBIT Global Conferences – 23 March 2017: Panel “Ethical issues of AI and New Technologies” / Dr. Michal Kosinski, Graduate School...