Perspective: streaming data visualization engine
Perspective: Perspective is a streaming data visualization engine in JavaScript for building real-time & user-configurable analytics entirely in the browser.
Operational Database Management Systems
Perspective: Perspective is a streaming data visualization engine in JavaScript for building real-time & user-configurable analytics entirely in the browser.
Quilt Data: Quilt is a data package manager, designed to make managing data as easy as managing code. It supports Parquet format via pyarrow for data access.
Ray: A flexible, high-performance distributed execution framework with a focus on machine learning and AI applications. Uses Arrow to efficiently store Python data structures containing large arrays of numerical data. Data can be accessed...
Red Data Tools: A project that provides data processing tools for Ruby. It provides Red Arrow that is a Ruby bindings of Apache Arrow based on Apache Arrow GLib. Red Arrow is a core...
Turbodbc: Python module to access relational databases via the Open Database Connectivity (ODBC) interface. It provides the ability to return Arrow Tables and RecordBatches in addition to the Python Database API Specification 2.0.
pandas: data analysis toolkit for Python programmers. pandas supports reading and writing Parquet files using pyarrow. Several pandas core developers are also contributors to Apache Arrow.
MapD: in-memory columnar SQL engine designed to run on GPUs. MapD supports Arrow for data ingest and data interchange via CUDA IPC handles. This work is part of the GPU Open Analytics Initiative
Fletcher: Fletcher is an FPGA acceleration framework that can convert an Arrow schema into an easy-to-use hardware interface. The accelerator can request data from Arrow tables by supplying row indices. In turn, the interface...
Dremio: A self-service data platform. Dremio makes it easy for users to discover, curate, accelerate, and share data from any source. It includes a distributed SQL execution engine based on Apache Arrow. Dremio reads...
Dask: Python library for parallel and distributed execution of dynamic task graphs. Dask supports using pyarrow for accessing Parquet files