Building Distributed Pipelines for Data Science using Kafka, Spark, and Cassandra

Building Distributed Pipelines for Data Science using Kafka, Spark, and Cassandra
March 1-3, 2016 | 9:00AM – 11:00AM PST
Building a distributed pipeline is a huge–and complex–undertaking. If you want to ensure that yours is scalable, has fast in-memory processing, can handle real-time or streaming data feeds and ad-hoc queries, allocates resources efficiently, and is designed for flexibility–join Andy Petrella and Xavier Tordoir for this immensely practical hands-on course.

Link: http://www.oreilly.com/pub/cpc/5854

You may also like...