HPE Vertica Community Edition (CE)
Vertica Community Edition (CE) is free for use up to 1 TB of data, with no time limits. Become a Member of myVertica today and gain access to the FREE HPE Vertica Community Edition to fully...
Operational Database Management Systems
Vertica Community Edition (CE) is free for use up to 1 TB of data, with no time limits. Become a Member of myVertica today and gain access to the FREE HPE Vertica Community Edition to fully...
Analyze Your Data With the Free Vertica Community Edition Fully experience the HPE Vertica Analytics Platform at no cost and no time limit! Manage and analyze up to 1 TB of data across three...
Using Pandas easily with Cassandra BY Aaron Benz, Charlie Hack Spring 2015 available in Github. Pandas interface for Cassandra. What is it? caspanda is a Python module combines Apache Cassandra with Python’s Pandas module… aka caspanda. Its overall goal is...
A package that allows R developers to use Hadoop HBase BY Aaron Benz aaronbenz/rhbase forked from RevolutionAnalytics/rhbase A package that allows R developers to use Hadoop HBase, developed as part of the RHadoop project....
Aaron Benz, Data Scientist Accenture, released (made publicly available) a time-series R package. -January 2015 It has a tutorial/walkthrough of why some might use it and what it offers (being able to plot time-series data –...
BIG is an archive format that was designed to store millions of files. The format is quite simple. For each archive, two files are used. One file is where we store all the binary...
Mr.LDA is a package for flexible, scalable, multilingual topic modeling using variational inference in MapReduce. Latent Dirichlet Allocation (LDA) and related topic modeling technique are useful for exploring document collections. Because of the increasing...
BG is a benchmark to evaluate performance of a data store for interactive social networking actions and sessions.These actions and sessions either read or update a very small amount of the entire data set....
PoliTwi: Early Detection of Emerging Political Topics on Twitter and the Impact on Concept-Level Sentiment Analysis PoliTwi is on-line service that detects emerging political topics (Top Topics) in Twitter sooner than other standard information...
BigDataBench As a multi-discipline research effort, BigDataBench is an open-source big data benchmark suite. The current version is BigDataBench 3.0. It includes 6 real-world and 2 synthetic data sets, and 32 big data workloads, covering micro...