Category: Big Data, Analytical Data Platforms, Data Science – Free Software
Apache Parquet: A columnar storage format available to any project in the Hadoop ecosystem, regardless of the choice of data processing framework, data model or programming language. The C++ and Java implementation provide vectorized...
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides...
MarketStore is a database server optimized for financial timeseries data. You can think of it as an extensible DataFrame service that is accessible from anywhere in your system, at higher scalability. It is designed...
Detectron is Facebook AI Research’s software system that implements state-of-the-art object detection algorithms, including Mask R-CNN. It is written in Python and powered by the Caffe2 deep learning framework. At FAIR, Detectron has enabled...
Comes with prebuilt models for Fetal Brain Segmentation from MRI, and segmentation on cardiovascular magnetic resonance images. GitHub: https://github.com/DLTK/models Referencing and citing methods in the Model Zoo To find out how to reference each implementation,...
Early Access Program MemSQL 6.0 RC introduces enhancements to manageability and resiliency, as well as exception handling support for extensibility. It also includes the enhancements to query processing and extensibility that were introduced in...
Project Jupyter is an open source project was born out of the IPython Project in 2014 as it evolved to support interactive data science and scientific computing across all programming languages. Jupyter will always be 100%...
Dr. Manuel Rivas at the Stanford Medical School Department of Biomedical Data Science has launched a web-based engine for exploring association results for large-scale genotype-phenotype association studies starting with the data from the UK Biobank and...
Available on Databricks Runtime 3.0 by Michael Armbrust Originally posted in ENGINEERING BLOG , July 11, 2017 Today we are happy to announce the availability of Apache Spark 2.2.0 on Databricks as part of the Databricks Runtime...
Timesketch is an open source tool for collaborative forensic timeline analysis. Using sketches you and your collaborators can easily organize your timelines and analyze them all at the same time. Add meaning to your...