S4
Stream processing: S4 is a general-purpose, distributed, scalable, fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continuous unbounded streams of data. LINK
Operational Database Management Systems
Stream processing: S4 is a general-purpose, distributed, scalable, fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continuous unbounded streams of data. LINK
List of companies providing products that include Apache Hadoop, a derivative work thereof, commercial support, and/or tools and utilities related to Hadoop (Link- open new tab). LINK (open new tab)
Distributed system:: Apache Kafka – distributed publish-subscribe messaging system. Apache Kafka is a distributed publish-subscribe messaging system. It is designed to support the following: – Persistent messaging with O(1) disk structures that provide constant time...
Stream processing: Storm is a freeand open source distributed realtime computation system. Storm makesit easy to reliably process unbounded streams of data, doing for realtime processing what Hadoopd is for batch processing. LINK
Distributed system:: Drill— Apache Drill is a distributed system for interactive analysis of large-scale datasets. Drill is similar to Google’s Dremel, with the additional flexibility needed to support a broader range of query languages,...
M.C. Srivas, CTO, MapR Technologies AbstractWhen MapR was invited to provide Hadoop on Google Compute Engine, we ran a lot of mini tests on the virtualized hardware to figure out how to tune our...
Roberto V. Zicari Abstract: In this presentation I review three current aspects related to Big Data: 1. The business perspective, 2. The Technology perspective, and 3. Big Data for social good. Presentation (89 pages)...
Roger Barca, Laura Haas, Alon Halevy, Paul Miller, Roberto V. Zicari. June 5, 2012: Big Data for Good. A distinguished panel of experts discuss how Big Data can be used to create Social Capital....
Cynthia M. Saracco (saracco@us.ibm.com), Senior Software Engineer, IBM AnshulDawra (adawra@us.ibm.com), Senior Software Engineer, IBM Abstract: If you want to work with “big data” without writing code or scripts, you may want to look into...
by Roberto V. Zicari, Editor ODBMS.org. June 5, 2012. Abstract: Every day, 2.5 quintillion bytes of data are created. This data comes from digital pictures, videos, posts to social media sites, intelligent sensors, purchase...