Review of Strata + Hadoop-17-20 Feb. San Jose

Review of Strata + Hadoop-17-20 Feb. San Jose/ News and Talks




San Jose, CA, Feb. 17, 2015 (Strata + Hadoop World) — HP today unveiled HP Haven Predictive Analytics, a new offering that accelerates and operationalizes large-scale machine learning and statistical analysis, and ultimately provides organizations with much deeper insights and understanding into today’s rapidly evolving data volumes.

Powered by HP’s innovative Distributed R offering, the new release dramatically improves performance and enables users to analyze much larger data sets than was previously possible with the popular R statistical programing language. Available now at, the new offering includes the following key components and capabilities:

  • Distributed R – a new high performance analytical engine based on the open source R language developed with HP Labs to address the most demanding, Big Data predictive analytics tasks.
  • Data acceleration and native SQL support with HP Vertica – native integration with the market leading columnar MPP database increases overall data access performance by up to 5X and enables a broader community of developers and DBAs to put predictive analytics into action.
  • Out-of-the-box-algorithms – a comprehensive set of proven, out-of-the-box parallel algorithms that produce accurate and consistent results with mature standard R algorithms.
  • Open Source – the new offering is free and fully compatible with the open source R language and tools and backed by enterprise support from HP and priced per node.

“HP Haven Predictive Analytics provides the scale and performance to Cerner to achieve predictive analytics health care solutions that were not possible before,” said Dr. Doug McNair, M.D., PhD, senior vice president at Cerner Corporation (@Cerner). “The Distributed R technology is vital for Cerner’s discovery activities, which we conduct for Cerner’s health care clients around the world. In health care use cases, the most valuable item sets are rare. Therefore, coverage of the entire corpus of records is often essential to avoid false-negative results and to ensure model stability. HP Haven Predictive Analytics is a strategic enabler for Cerner.”

Predictive Analytics, Built for Big Data

The open source R language is used by millions of data scientists around the globe to interpret, interact with, and visualize data, and has been a powerful tool in tackling predictive modeling tasks such as drug discovery and financial modeling. Unfortunately, due to its inherent design, it has been challenged to process large data sets.

To overcome this limitation, HP Labs (@hplabs) and HP Software (@HPSoftware) developedDistributed R, a revolutionary extension of R, which boosts performance by splitting tasks between multiple processing nodes. The result of this strategic initiative is the industry’s first open source version of a distributed platform for R that is explicitly designed to address today’s demanding Big Data predictive analytic tasks.

Now the global developer community can employ R to scale for billions of records of data – an order of magnitude improvement over traditional R-based performance. HP Haven Predictive Analytics also retains the flexibility and consistency with R and enables data scientists to use their familiar R console and RStudio to work with Distributed R.

“HP Haven Predictive Analytics delivers the industry’s first open, high-performance platform based on R, seamlessly integrated with the HP Haven Big Data Platform,” says Shilpa Lawande (@slawande), GM Platform, HP Software Big Data Business Unit. “Now, organizations can unlock the untapped value of Big Data with scalable predictive analytics to address every use case – from customer acquisition and retention to fraud detection to predictive maintenance and many more.”

Pricing and Availability

HP Haven Predictive Analytics is free open-source software and is backed by award winning HP global enterprise support, which helps organizations realize the full value of their investment in Big Data analytics. This optional support offering is priced per node up to 5 nodes with attractive discount pricing available for larger deployments. More information about HP Haven Predictive Analytics is available at

Additional Information

The new product is available immediately. For information on a recent Distributed R workshop, see:

Join HP Software on Linkedin and follow @HPSoftware and @HPVertica on Twitter.

About HP

HP creates new possibilities for technology to have a meaningful impact on people, businesses, governments and society.  With the broadest technology portfolio spanning printing, personal systems, software, services and IT infrastructure, HP delivers solutions for customers’ most complex challenges in every region of the world.  More information about HP (NYSE: HPQ) is available at

This media advisory contains forward-looking statements that involve risks, uncertainties and assumptions. If such risks or uncertainties materialize or such assumptions prove incorrect, the results of HP and its consolidated subsidiaries could differ materially from those expressed or implied by such forward-looking statements and assumptions. All statements other than statements of historical fact are statements that could be deemed forward-looking statements, including but not limited to statements of the plans, strategies and objectives of management for future operations; any statements concerning expected development, performance, market share or competitive performance relating to products and services; any statements regarding anticipated operational and financial results; any statements of expectation or belief; and any statements of assumptions underlying any of the foregoing. Risks, uncertainties and assumptions include the need to address the many challenges facing HP’s businesses; the competitive pressures faced by HP’s businesses; risks associated with executing HP’s strategy and plans for future operations; the impact of macroeconomic and geopolitical trends and events; the need to manage third-party suppliers and the distribution of HP’s products and services effectively; the protection of HP’s intellectual property assets, including intellectual property licensed from third parties; risks associated with HP’s international operations; the development and transition of new products and services and the enhancement of existing products and services to meet customer needs and respond to emerging technological trends; the execution and performance of contracts by HP and its suppliers, customers, clients and partners; the hiring and retention of key employees; integration and other risks associated with business combination and investment transactions; the execution, timing and results of restructuring plans, including estimates and assumptions related to the cost and the anticipated benefits of implementing those plans; the resolution of pending investigations, claims and disputes; and other risks that are described in HP’s Annual Report on Form 10-K for the fiscal year ended October 31, 2013, and that are otherwise described or updated from time to time in HP’s Securities and Exchange Commission reports. HP assumes no obligation and does not intend to update these forward-looking statements.


© 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein.


Pinterest is experimenting with for real-time data analytics . See the demo at


Breaking News! Expands Solutions for Leading and Platforms


RT : What is HP Haven Predictive Analytics? Learn more about today’s announcement:


Thrilled to announce the addition of General Electric, Toyota Motors Europe, and Roche as customers at .


At San Jose? Check out tomorrow’s session Pro Bono Data Science in Action – Helping Teens in Crisis:



Visiting this week? Find the w/analytics during .


Tamr  will be demonstrating the Tamr Platform at Booth #531 at Strata + Hadoop World from February 17 to 20.

Tamr executives will also be discussing scalable data unification in several presentations, all on February 19:

Solutions Showcase:“Taming Data Variety: Intelligent Solutions Using Machine Learning and Expert Crowdsourcing,Alan Wagner, Field Engineer,  1:45 – 1:55 PM
“The Data Unification Imperative,” Andy Palmer, co-founder and CEO, 2:20 – 3:00 PM, Room 230B
“Tackling Data Curation in Three Generations,” Michael Stonebraker, co-founder and CTO, 4:00 – 4:40 PM, Room 230C

Eric Frenkiel co-founder and CEO of  MemSQL  will be speaking at Strata + Hadoop :

Close Encounters with the Third Kind of Database.

Eric Frenkiel (MemSQL)  9:10am–9:15am Thursday, 02/19/2015

Keynotes, Sponsored
Location: Grand Ballroom 220


Bringing OLAP Fully Online: Analyze Changing Datasets in MemSQL and Spark with Pinterest Demo.

Eric Frenkiel (MemSQL) 10:40am–11:20am Thursday, 02/19/2015
Location: LL20 D


A Simple, Fast Approach to Analytics for Big Data/IoT with kdb+, a High-performance Time-series Database System
2:20pm–3:00pm Thursday, 02/19/2015
Location: LL20 D
One of the first industries to invest heavily in Big Data analytics was financial services, where firms have been pushing the boundaries on speed and scale in dynamically processing large volumes of structured market data for the past twenty years to gain competitive advantage.

As more industries are deploying Big Data initiatives, and adding new software for batch processing to the technology stack, they are looking to other sectors, like the financial industry, for different sorts of tools to use for real-time, or close to real-time, analyses of big structured data.

Kdb+ is a relational, time-series and columnar database with a tightly integrated query language, widely utilized by financial institutions because of its ability to do complex tasks like joins, aggregations and consolidation on billions of streaming, real-time and historical records.

In this talk we will also demonstrate how kdb+ can be used with visualization tools on vast amounts of data. When powered by kdb+, results can unfold as quickly as you can type a question.

This session is sponsored by Kx Systems



Schedule a Demo at Strata + Hadoop World

View a live demo, when are you free? 

Planning on attending Strata this year in San Jose? Schedule an onsite demo with the Couchbase engineering team who will be showing off the Couchbase Hadoop Connector and our brand new Couchbase to Apache Kafka Adapter.

Couchbase will be at Strata + Hadoop World and the conference is only a few weeks away! Let us know which day works best and we’ll set up a time to show you a live demo.

You will see first hand why the world’s largest enterprises choose Couchbase for the most demanding web and mobile applications.

Let’s set up a time to talk

You may also like...