Data Analysis on Coursera
This class was an 8 week introduction to Data Analysis, starting from very basic concepts of what type of data analyis questions you can ask, to how to get data and do a basic...
Operational Database Management Systems
This class was an 8 week introduction to Data Analysis, starting from very basic concepts of what type of data analyis questions you can ask, to how to get data and do a basic...
Provides an intensive introduction to applied statistics and data analysis. Trains students to become data scientists capable of both applied data analysis and critical evaluation of the next generation of statistical methods. Since both...
Trend Report 2016-2017 Can mixed media make us more empathic? What is the future of work in the 21st century? And how far should we go with human enhancement? These questions (and more) form...
Deep Learning Pipelines is a new library published by Databricks to provide high-level APIs for scalable deep learning model application and transfer learning via integration of popular deep learning libraries with MLlib Pipelines and...
TCIA Collections The image data in The Cancer Imaging Archive (TCIA) is organized into purpose-built collections of subjects. The subjects typically have a cancer type and/or anatomical site (lung, brain, etc.) in common. Each...
Open-source project to provide scalable machine learning (Chu et al., 2006; Owen et al., 2012) Written in Java for the Apache Hadoop MapReduce platform Some supported ML methods: Supervised: NB, HMM, SVM, Logist. Reg.,...
This is a project started at Yahoo! Research and continuing at Microsoft Research to design a fast, scalable, useful learning algorithm. VW is the essence of speed in machine learning, able to learn from terafeature datasets with ease....
BY Sajawel Ahmed Master Thesis Goethe University Frankfurt, Fraunhofer IAIS in cooperation with PricewaterhouseCoopers AG WPG Supervisors: Prof. Dott.-Ing. Roberto V. Zicari, Frankfurt Big Data Laboratory Dr. Joerg Kindermann, Knowledge Discovery Group March 2017 Abstract Since...
Open source library for Python and in Python Provides Natural Language Processing functionality to Python Loper and Bird (2002); Bird (2006); Bird, Klein and Loper (2009) Available from http://nltk.org Useful for teaching and...
The Spark Python API (PySpark) exposes the Spark programming model to Python. To learn the basics of Spark, we recommend reading through the Scala programming guide first; it should be easy to follow even if you...