Building Distributed Pipelines for Data Science using Kafka, Spark, and Cassandra

by Roberto Zicari · February 11, 2016

Building Distributed Pipelines for Data Science using Kafka, Spark, and Cassandra
March 1-3, 2016 | 9:00AM – 11:00AM PST
Building a distributed pipeline is a huge–and complex–undertaking. If you want to ensure that yours is scalable, has fast in-memory processing, can handle real-time or streaming data feeds and ad-hoc queries, allocates resources efficiently, and is designed for flexibility–join Andy Petrella and Xavier Tordoir for this immensely practical hands-on course.

Link: http://www.oreilly.com/pub/cpc/5854

Building Distributed Pipelines for Data Science using Kafka, Spark, and Cassandra

You may also like...

Resources

Search

News

Events

Archives

Sponsored By

InterSystems

MySQL/Oracle

SingleStore

Supporters

McObject

Persistent Systems

Raima

Scality

TIAA

Undo

Volt Active Data