Free Data Mining eBooks
Free Data Mining eBooks Data Mining: Concepts and Techniques, Jiawei Han and Micheline Kamber About data mining and data warehousing Mining of Massive Datasets, Jure Leskovec, Anand Rajaraman, Jeff Ullman The focus of this...
Operational Database Management Systems
Free Data Mining eBooks Data Mining: Concepts and Techniques, Jiawei Han and Micheline Kamber About data mining and data warehousing Mining of Massive Datasets, Jure Leskovec, Anand Rajaraman, Jeff Ullman The focus of this...
Apache Giraph is an iterative graph processing system built for high scalability. For example, it is currently used at Facebook to analyze the social graph formed by users and their connections. Giraph originated as the...
Interaction for Visualization Morgan & Claypool Publishers Synthesis Lectures on Visualization June 2015, 107 pages, (doi:10.2200/S00651ED1V01Y201506VIS003) Christian Tominski, University of Rostock Abstract Visualization has become a valuable means for data exploration and analysis. Interactive...
Data Mining and Analysis Fundamental Concepts and Algorithms TEXTBOOK AUTHORS: Mohammed J. Zaki, Rensselaer Polytechnic Institute, New York Wagner Meira, Jr, Universidade Federal de Minas Gerais, Brazil DATE PUBLISHED: July 2014 AVAILABILITY: In stock FORMAT: Hardback ISBN: 9780521766333 Cambridge University...
Intro to HBase via R: A Tutorial BY Aaron Benz , sync up . 04/28/2015 Part I: Intro to HBase Welcome to a brief introduction to HBase by way of R. This tutorial aims to...
Using Pandas easily with Cassandra BY Aaron Benz, Charlie Hack Spring 2015 available in Github. Pandas interface for Cassandra. What is it? caspanda is a Python module combines Apache Cassandra with Python’s Pandas module… aka caspanda. Its overall goal is...
A package that allows R developers to use Hadoop HBase BY Aaron Benz aaronbenz/rhbase forked from RevolutionAnalytics/rhbase A package that allows R developers to use Hadoop HBase, developed as part of the RHadoop project....
Data Mining: The Textbook Comprehensive textbook on data mining, 734 pages. Authored by Charu Aggarwal Publisher: Springer, April 2015. Book Description The emergence of data science as a discipline requires the development of...
The value of Apache Kafka in Big Data ecosystem By Jun Rao, co-founder at Confluent Enterprises have been adopting various technologies for Big Data these days. One of the technologies being increasingly adopted is Apache Kafka,...
CREATING EFFECTIVE RISK MODELS USING MACHINE INTELLIGENCE Ayasdi White Paper, March 2015  Why Effective Risk Models are Critical There is a growing need for financial institutions to have sound models in place that...