INTRODUCTION TO DATA SCIENCE
INTRODUCTION TO DATA SCIENCE © 2012, 2013 By Jeffrey Stanton, Portions © 2013, By Robert De Graaf This book is distributed under the Creative Commons Attribution- NonCommercial-ShareAlike 3.0 license. You are free to...
Operational Database Management Systems
INTRODUCTION TO DATA SCIENCE © 2012, 2013 By Jeffrey Stanton, Portions © 2013, By Robert De Graaf This book is distributed under the Creative Commons Attribution- NonCommercial-ShareAlike 3.0 license. You are free to...
Traversing Trillions of Edges in Real-time: Graph Exploration on Large-scale Parallel Machines Fabio Checconi, Fabrizio Petrini High Performance Analytics Department IBM TJ Watson, Yorktown Heights, NY 10598 Email: {fchecco,fpetrin}@us.ibm.com Abstract—The world of Big Data...
Scalable Single Source Shortest Path Algorithms for Massively Parallel Systems Venkatesan T. Chakaravarthy∗, Fabio Checconi†, Fabrizio Petrini†, Yogish Sabharwal∗ ∗ IBM Research – India, New Delhi. {vechakra,ysabharwal}@in.ibm.com † IBM T J Watson Research Center,...
Doug Terry is a Distinguished Research Engineer at Samsung Research America. Previously he was Principal Researcher in the Microsoft Research Silicon Valley Lab. His research focuses on the design and implementation of distributed systems with...
Survey of Apache Big Data Stack Supun Kamburugamuve For the PhD Qualifying Exam 12/16/2013 Advisory Committee Prof. Geoffrey Fox Prof. David Leake Prof. Judy Qiu 1. Introduction Over the last decade there has being...
BigDataBench As a multi-discipline research effort, BigDataBench is an open-source big data benchmark suite. The current version is BigDataBench 3.0. It includes 6 real-world and 2 synthetic data sets, and 32 big data workloads, covering micro...
Fabrizio Petrini is a senior researcher and manager of the High Performance Analytics Department of the IBM TJ Watson Research Laboratory. His research interests include various aspects of multi-core processors and supercomputers, including high-performance...
PoliTwi: Early Detection of Emerging Political Topics on Twitter and the Impact on Concept-Level Sentiment Analysis Sven Rill, Dirk Reinela, Jörg Scheidt, Institute of Information Systems, University of Applied Sciences Hof, Alfons-Goppel-Platz 1, Hof, Germany...
Joseph M. Hellerstein is a Chancellor’s Professor of Computer Science at UC Berkeley, and the co-founder and CEO of Trifacta. Hellerstein’s work is in the broad area of data-centric systems and the way...
MapReduce-MPI Library MapReduce-MPI (MR-MPI) library is an open-source implementation of MapReduce written for distributed-memory parallel machines on top of standard MPI message passing. The MR-MPI library was developed at Sandia National Laboratories, a US...