Open Source projects within Artificial Intelligence and the Data Space.
The LF AI & Data Foundation supports open source projects within artificial intelligence and the data space.
1chipML is an open source library for basic numerical crunching and machine learning for microcontrollers.
Acumos AI
Acumos AI is a platform and open source framework that makes it easy to build, share, and deploy AI apps.
Adlik
Adlik is a toolkit for accelerating deep learning inference. The goal of Adlik is to accelerate deep learning inference process both on cloud and embedded environments.
Adversarial Robustness Toolbox
Adversarial Robustness Toolbox (ART) provides tools that enable developers and researchers to evaluate, defend, certify and verify Machine Learning models and applications against the adversarial threats.
AI Explainability 360
AI Explainability 360 is an open source toolkit that can help users better understand the ways that machine learning models predict labels using a wide variety of techniques throughout the AI application lifecycle.
AI Fairness 360
AI Fairness 360 is an extensible open source toolkit that can help users understand and mitigate bias in machine learning models throughout the AI application lifecycle.
Amundsen
Amundsen is a data discovery and metadata engine for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Angel ML
The Angel Project is a high-performance distributed machine learning platform based on Parameter Server, running on YARN and Apache Spark.
Artigraph
Artigraph is a tool to improve the authorship, management, and quality of data.
BeyondML
BeyondML is a framework for developing sparse neural networks that can perform multiple tasks across multiple data domains.
BI & AI
The goal of this committee is to integrate the power of AI and BI to make it CI (Cognitive Intelligence) by combing the speed machines accelerate (AI) with the direction intuited by human insight (BI).
CLAIMED
CLAIMED (Component Library for AI, Machine Learning, ETL and Data Science) is a runtime and programming language agnostic Data & AI component framework.
DataOps Committee
The DataOps Committee in LF AI & Data is is a global group that consists of participants from various geographies focused on DataOps.
DataPractices
DataPractices is a “Manifesto for Data Practices,” comprised of values and principles to illustrate the most effective, modern, and ethical approach to data teamwork.
Datashim
Datashim is enabling and accelerating data access for Kubernetes/Openshift workloads in a transparent and declarative way.
Delta
DELTA is a deep learning based end-to-end natural language and speech processing platform.
DocArray
DocArray is a library for nested, unstructured, multimodal data in transit.
Egeria
Egeria is the world’s first open source metadata standard. It provides open APIs, event formats, types and integration logic so organizations can share data management and governance across the entireenterprise without reformatting or restricting the data to a single format, platform, or vendor product.
Egeria Conformance
To ensure both consistency and alignment with the standards driven by Egeria, the Egeria Conformance program is available for vendors to showcase how they are shipping Egeria as part of their offering.
Elastic Deep Learning
EDL is an Elastic Deep Learning framework designed to help deep learning cloud service providers to build cluster cloud services using deep learning frameworks such as PaddlePaddle and TensorFlow.
Elyra
Elyra is an open-source low code / no code framework for creating reproducible, scalable and component based data science pipelines.
FATE
FATE (Federated AI Technology Enabler) is the world’s first industrial grade federated learning open source framework to enable enterprises and institutions to collaborate on data while protecting data security and privacy.
Feast
Feast is an open source feature store for machine learning. It was developed as a collaboration between Gojek and Google in 2018.
Feathr
Feathr is an enterprise-grade, high-performance feature store.
FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale models. Learn More
Flyte
Flyte is a production-grade, declarative, structured and highly scalable cloud-native workflow orchestration platform. Learn More
ForestFlow
ForestFlow is a scalable policy-based cloud-native machine learning model server. Learn More
Horovod
Horovod makes it easy to take a single-GPU TensorFlow program and successfully train it on many GPUs faster. Horovod also achieved significantly improved GPU resource usage figures. Learn More
Horovod
Horovod makes it easy to take a single-GPU TensorFlow program and successfully train it on many GPUs faster. Horovod also achieved significantly improved GPU resource usage figures. Learn More
Horovod
Horovod makes it easy to take a single-GPU TensorFlow program and successfully train it on many GPUs faster. Horovod also achieved significantly improved GPU resource usage figures. Learn More
Kompute
Kompute is a general purpose GPU compute framework for cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing use cases. Learn More
KServe
KServe provides a Kubernetes Custom Resource Definition for serving machine learning (ML) models on arbitrary frameworks .Learn More
Ludwig
Ludwig is an open-source, declarative machine learning framework that makes it easy to define deep learning pipelines with a simple and flexible data-driven configuration system. Learn More
Machine Learning eXchange
Machine Learning eXchange (MLX) is a Data and AI Assets Catalog and Execution Engine. Learn More
Marquez
Marquez is an open source metadata service for the collection, aggregation, and visualization of a data ecosystem’s metadata. Learn More
Milvus
Milvus is an open-source vector database that is highly flexible, reliable, and blazing fast. Learn More