Database Q&A With MemSQL CEO, Eric Frenkiel
We started MemSQL with a vision to enable every company to be a real-time enterprise, and to do that by delivering the leading real-time database for transactions and analytics. Since then, the forces shaping our digital economy have only added wind to our sails. The world is more mobile, interconnected, interactive, and on the cusp of several industry transformations such as the Internet of Things.
Our latest release, MemSQL 4, represents a leap forward in relational database innovation with new capabilities such as real-time geospatial intelligence and high-throughput connectivity to Apache Spark. It also includes a free forever, unlimited capacity Community Edition, enhancements to the Optimizer for distributed joins, and a new version of the MemSQL Ops management framework.
As Big Data abounds, understanding every company and product can be tricky. To make easier, here are a few questions and answers about MemSQL.
The database landscape is big. Where does MemSQL fit?
MemSQL is the leading database for real-time transactions and analytics. That means we’re operational by nature, much more akin to SQL Server, Oracle, or SAP HANA than Hadoop. MemSQL is also:
• Providing the utmost performance for today’s demanding workloads
• Enabling cost-effective, horizontal scale-out on-premises or in the cloud
• Relational and multi-model
• Allowing companies to use in-house SQL tools, applications, and knowledge
• With JSON and Geospatial data formats supported
• Designed to run on commodity hardware for costs savings
You talk about transactions and analytics. Can you explain in more detail?
To meet real-time demands, companies must be able to capture information across millions to hundreds of millions of sensors or mobile applications. They also want to analyze that data up to the last transaction. So with real-time operations you don’t have the luxury or the pain of ETL. You need to bypass ETL by transacting and analyzing in a single database designed to support these concurrent workloads. It boils down to analytics on changing datasets, today’s critical capability.
MemSQL 4 includes the MemSQL Spark Connector and the MemSQL Loader for HDFS and S3. How should we think about MemSQL with Spark and Hadoop?
MemSQL and Spark work well together as they both have memory-optimized, distributed frameworks. Spark is a processing framework that enables real-time transformation and advanced analytics, but Spark itself does not have a storage or persistence ability. By storing data permanently in MemSQL, customers get an easy way to build operational applications, and the ability to take operational data and ‘round-trip’ it to Spark for advanced analytics.
You have always focused on SQL. All the while NoSQL has received a lot of discussion. Where is MemSQL in this?
SQL is the lingua franca of the data processing world, having been well adopted over its decades-long history. Companies can build SQL applications quickly, and then immediately use in house analytics tools and practices to derive insights.
Prevailing wisdom used to be that SQL could not scale. That is untrue and companies are now discovering the powerful combination MemSQL delivers of a relational, in-memory, distributed database with speed, scale, and simplicity.
The future is multi-model and MemSQL also includes JSON and Geospatial datatypes to fulfill this promise.
What is the difference between Community and Enterprise Editions?
Community Edition is a free unlimited capacity, unlimited scale offering of MemSQL that includes all transactional and analytical features.
Enterprise Edition comes with direct support from MemSQL, plus production features such as high availability, cross data center replication replication, granular user permissions, and SSL. More information can be found on at memsql.com/download/
The Community Edition is freely available but not open-source. What strategy is MemSQL pursuing?
Our strategy is to build a scalable software business. This means running on all the platforms customers want to use, including public and private clouds, in the way they want to use them. Community Edition is a way to allow more people to use MemSQL without limitations on time or capacity.
There is a wide spectrum of commercial database offerings. On on extreme there are proprietary hardware / software combinations. Our belief is that proprietary hardware is a down elevator.
On the other end of the spectrum there are open-source projects with businesses built around them. But that is essentially selling consulting hours, not software. There are many successful businesses in the middle of the spectrum, such as MySQL’s dual commercial / open-source licensing, and we maintain open-source projects, including MemSQL Loader for Hadoop and S3, a large number of general-purpose Python packages, and the MemSQL Spark Connector.
At MemSQL, we want to offer performance that trumps legacy vendors, deliver it a fraction of the cost, and provide complete deployment choice across hardware, data centers, and clouds. In doing so we hope to help more companies achieve their goals to become real-time enterprises.