Category: Big Data, Analytical Data Platforms, AI and Data Science

Scalable Parallelization of Expensive Continuous Queries over Massive Data Streams

Author: Dr. Erik Zeitler Language: English Affiliation: Uppsala University, Sweden Abstract:For applications that require execution of non-trivial Continuous Queries (CQs) over data streams of high rate, the execution of the CQs must be parallelized....

ParStream- Turning Data Into Knowledge

Abstract: ParStream offers a new approach to high-performance data analysis. It addresses the problems arising from rapidly increasing data volumes in modern business, as well as scientific application scenarios. Close-to-real-time analysis is obtained through...

Dremel: Interactive Analysis of Web-Scale Datasets

Sergey Melnik, Andrey Gubarev, JingJing Long, Geoffrey Romer, Shiva Shivakumar, Matt Tolton, Theo Vassilakis
Google, Inc. ABSTRACT: Dremelis a scalable, interactive ad-hoc query system for analysis of read-only nested data. Proceedings of the VLDB Endowment,...

Rethinking Data Analysis and Reporting

Joshua Greenbaum Unlike relational database management systems, which use a records-based storage approach, or column-oriented databases which use a column-based storage method, a correlation database uses a value-based storage (VBS) architecture in which all...