New Continuous Learning Framework and Enhanced Spark Integration Can Power Real-Time Learning for Digital Transformation and Omnichannel Customer Experience Initiatives
GridGain Continuous Learning Framework
GridGain Professional Edition 2.4 now includes the first fully supported release of the Apache Ignite integrated machine learning and multilayer perceptron features, making continuous learning using machine learning and deep learning available directly in GridGain. By optimizing these libraries for massively parallel processing (MPP) against the data residing in the GridGain cluster, large-scale machine learning use cases can be greatly accelerated. Processing data directly in the GridGain cluster enables a continuous learning workflow by eliminating the need to move transactional data into a separate database before model training. The result is real-time model training or even continuous model training with less complexity and substantially lower cost than traditional approaches.
The new GridGain Continuous Learning Framework is a building block for in-process HTAP (hybrid transactional/analytical processing) applications in which a data model is continually trained based on incoming data. In-process HTAP offers next-generation applications the ability to react to and benefit from real-time model training, which can power better real-time decision making in a wide range of business applications, such as fraud prevention, ecommerce recommendation engines, credit approvals, logistics, and transportation system maintenance decisions.
Expanded Support for Spark DataFrames
GridGain can now be used to store and manage Spark DataFrames. DataFrame support expands what was already the broadest support for Spark by any in-memory computing platform. GridGain continues to include the GridGain RDD API for accessing data in GridGain as mutable Spark RDDs, as well as the Ignite File System (IGFS) for using GridGain as an in-memory implementation of the Hadoop Distributed File System (HDFS).
Spark can be used to process data in GridGain as DataFrames or RDDs and also save DataFrames or RDDs into GridGain for later use. These capabilities allow GridGain to be used as in-memory storage by Spark developers to access, save and share information between Spark jobs. GridGain provides ANSI-99 SQL support, including data indexing, so Apache Spark can leverage GridGain’s distributed SQL to improve ad hoc query performance up to 1000x. Spark developers can also leverage the GridGain Continuous Learning Framework to automate decisions and continually update models to improve outcomes in real-time.
“Companies wanting to automate more intelligent decision making need to harness the two sides of the digital brain – machine learning and decision automation – to continuously work together,” saidAbe Kleinfeld, President and CEO of GridGain Systems. “With this latest release, GridGain makes it possible to continuously train machine learning models in real-time on massive data sets at in-memory speed and scale, and with lower complexity and cost. This is the first step towards enabling in-process HTAP applications to drive continuous-learning applications that can power digital transformation and omnichannel customer experience initiatives.”
About GridGain® Systems
GridGain Systems is revolutionizing real-time data access and processing by offering an in-memory computing platform built on Apache® Ignite™. GridGain solutions are used by global enterprises in financial, software, e-commerce, retail, online business services, healthcare, telecom and other major sectors, with a client list that includes Barclays, ING, Sberbank, Finastra, IHS Markit, Workday, and Huawei. GridGain delivers unprecedented speed and massive scalability to both legacy and greenfield applications. Deployed on a distributed cluster of commodity servers, GridGain software can reside between the application and data layers (RDBMS, NoSQL and Apache® Hadoop®), requiring no rip-and-replace of the existing databases, or it can be deployed as an in-memory transactional SQL database. GridGain is the most comprehensive in-memory computing platform for high-volume ACID transactions, real-time analytics, web-scale applications, continuous learning and HTAP. For more information, visit gridgain.com.
# # #
CONTACT: Terry Erisman
GridGain is a trademark or registered trademark of GridGain Systems, Inc. Apache, Apache Hadoop, Hadoop, Apache Ignite, Ignite, Apache Spark, and Spark, are trademarks of The Apache Software Foundation. All other product and company names herein may be trademarks of their registered owners.