Category: Big Data, Analytical Data Platforms, AI and Data Science
Analyzing Big Data With Twitter
A special UC Berkeley iSchool course. Link to course material, including video of the lectures
How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time
by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. Abstract: ScaleOut’s latest whitepaper presents the problem of ‘real-time’ data analysis that enterprises are facing and demonstrates how In-Memory Data Grids (IMDGs) are...
S4
Stream processing: S4 is a general-purpose, distributed, scalable, fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continuous unbounded streams of data. LINK
List of companies providing products that include Apache Hadoop
List of companies providing products that include Apache Hadoop, a derivative work thereof, commercial support, and/or tools and utilities related to Hadoop (Link- open new tab). LINK (open new tab)
Apache Kafka – distributed publish-subscribe messaging system
Distributed system:: Apache Kafka – distributed publish-subscribe messaging system. Apache Kafka is a distributed publish-subscribe messaging system. It is designed to support the following: – Persistent messaging with O(1) disk structures that provide constant time...
Drill
Distributed system:: Drill— Apache Drill is a distributed system for interactive analysis of large-scale datasets. Drill is similar to Google’s Dremel, with the additional flexibility needed to support a broader range of query languages,...
Storm
Stream processing: Storm is a freeand open source distributed realtime computation system. Storm makesit easy to reliably process unbounded streams of data, doing for realtime processing what Hadoopd is for batch processing. LINK
Record Setting Hadoop in the Cloud
M.C. Srivas, CTO, MapR Technologies AbstractWhen MapR was invited to provide Hadoop on Google Compute Engine, we ran a lot of mini tests on the virtualized hardware to figure out how to tune our...
Big Data: Challenges and Opportunities
Roberto V. Zicari Abstract: In this presentation I review three current aspects related to Big Data: 1. The business perspective, 2. The Technology perspective, and 3. Big Data for social good. Presentation (89 pages)...