{"id":2335,"date":"2013-06-10T06:35:23","date_gmt":"2013-06-10T06:35:23","guid":{"rendered":"http:\/\/www.odbms.org\/blog\/?p=2335"},"modified":"2014-12-20T18:14:40","modified_gmt":"2014-12-20T18:14:40","slug":"on-big-data-and-hadoop-interview-with-paul-c-zikopoulos","status":"publish","type":"post","link":"https:\/\/www.odbms.org\/blog\/2013\/06\/on-big-data-and-hadoop-interview-with-paul-c-zikopoulos\/","title":{"rendered":"On Big Data and Hadoop. Interview with Paul C. Zikopoulos."},"content":{"rendered":"<blockquote><p><strong><em>&#8220;We&#8217;re not all LinkedIns and Facebooks; we don&#8217;t have budgets to hire 1000s of new hires with these skills, and what&#8217;s more we&#8217;ve invested in existing skills and people today. So to democratize Big Data, you need it to be consumable and integrated. These will flatten the time to value for Hadoop&#8221;<\/em> &#8212; Paul C. Zikopoulos.<\/strong><\/p><\/blockquote>\n<p>I have interviewed <strong>Paul C. Zikopoulos<\/strong>, Director of Technical Professionals for <strong>IBM<\/strong> Software Group\u2019s Information Management division. The topic: Apache Hadoop and Big Data, State of the Union in 2013 and Vision for the future.<\/p>\n<p>RVZ<\/p>\n<p><strong>Q1. What what do you think is still needed for big data analytics to be really useful for the enterprise?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: <em>Integration and Consumability.<\/em> We&#8217;re not all LinkedIns and Facebooks; we don&#8217;t have budgets to hire 1000s of new hires with these skills, and what&#8217;s more we&#8217;ve invested in existing skills and people today.<br \/>\nSo to democratize Big Data, you need it to be consumable and integrated.<br \/>\nThese will flatten the time to value for Hadoop. IBM is working really hard in these areas. I could go into other areas, but this is key.<\/p>\n<p><strong>Q2. Hadoop is still quite new for many enterprises, and different enterprises are at different stages in their Hadoop journey.<br \/>\nWhen you speak with your customers what are the typical use cases and requirements they have?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: No matter what industry I&#8217;m working with, 90% of the Big Data use cases always have 2 common denominators: Whole Population Analytics to break free of traditional capacity constrained samples and analytics for data at-rest moving to in-motion.<br \/>\nSo if you think about churn prediction, next best action, next best offer, fraud prediction, condition monitor, out of tolerance quality predictors, and more &#8211; it&#8217;s all going to rely on using more data (could be volume, could be variety, and often both) to build better models.<br \/>\nIf you&#8217;re looking for specific use cases by industry, <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www-01.ibm.com\/software\/success\/cssdb.nsf\/solutionareaL2VW?OpenView&amp;Count=30&amp;RestrictToCategory=default_BigData');\"  href=\"http:\/\/www-01.ibm.com\/software\/success\/cssdb.nsf\/solutionareaL2VW?OpenView&amp;Count=30&amp;RestrictToCategory=default_BigData\">here&#8217;s a bunch of them<\/a> that we&#8217;ve worked with clients on at IBM.<\/p>\n<p><strong>Q3. How do you categorize the various stages of the Hadoop usage in the enterprises?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: The <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www-935.ibm.com\/services\/us\/gbs\/thoughtleadership\/');\"  href=\"http:\/\/www-935.ibm.com\/services\/us\/gbs\/thoughtleadership\/\">IBM Institute for Business Value<\/a> did a joint study with <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.sbs.ox.ac.uk');\"  href=\"http:\/\/www.sbs.ox.ac.uk\">Said Business School<\/a> (University of Oxford). They talked to a lot of Big Data folks and found that 28% were in the pilot phase, 24% haven&#8217;t started anything, and 47% are planning. After going through their research, they broke the answers\u00a0into four stages: <em>Educate \/ Explore \/ Engage \/ Execute<\/em>.<br \/>\nSo I&#8217;ll detail those four stages, but you can get the entire study <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www-935.ibm.com\/services\/us\/gbs\/thoughtleadership\/ibv-big-data-at-work.html');\"  href=\"http:\/\/www-935.ibm.com\/services\/us\/gbs\/thoughtleadership\/ibv-big-data-at-work.html\">here.<\/a><\/p>\n<p><em>Educate<\/em>: <em>Building a base of knowledge<\/em> (24 percent of respondents).<br \/>\nIn the Educate stage, the primary focus is on awareness and knowledge development.<br \/>\nAlmost 25 percent of respondents indicated they are not yet using big data within their organizations. While some remain relatively unaware of the topic of big data, our interviews suggest that most organizations in this stage are studying the potential benefits of big data technologies and analytics, and trying to better understand how big data can help address important business opportunities in their own industries or markets.<br \/>\nWithin these organizations, it is mainly individuals doing the knowledge gathering as opposed to formal work groups, and their learnings are not yet being used by the organization. As a result, the potential for big data has not yet been fully understood and embraced by the business executives.<\/p>\n<p><em>Explore:<\/em> <em>Defining the business case and roadmap<\/em> (47 percent).<br \/>\nThe focus of the Explore stage is to develop an organization\u2019s roadmap for big data development.<br \/>\nAlmost half of respondents reported formal, ongoing discussions within their organizations about how to use big data to solve important business challenges.<br \/>\nKey objectives of these organizations include developing a quantifiable business case and creating a big data blueprint.<br \/>\nThis strategy and roadmap takes into consideration existing data, technology and skills, and then outlines where to start and how to develop a plan aligned with the organization\u2019s business strategy.<\/p>\n<p><em>Engage<\/em>: <em>Embracing big data<\/em> (22 percent).<br \/>\nIn the Engage stage, organizations begin to prove the business value of big data, as well as perform an assessment of their technologies and skills.<br \/>\nMore than one in five respondent organizations is currently developing POCs to validate the requirements associated with implementing big data initiatives, as well as to articulate the expected returns. Organizations in this group are working \u2013 within a defined, limited scope \u2013 to understand and test the technologies and skills required to capitalize on new sources of data.<\/p>\n<p><em>Execute:<\/em> <em>Implementing big data at scale<\/em> (6 percent).<br \/>\nIn the Execute stage, big data and analytics capabilities are more widely operationalized and implemented within the organization. However, only 6 percent of respondents reported that their organizations have implemented two or more big data solutions at scale \u2013 the threshold for advancing to this stage. The small number of organizations in the Execute stage is consistent with the implementations we see in the marketplace. Importantly, these leading organizations are leveraging big data to transform their businesses and thus are deriving the greatest value from their information assets.<br \/>\nWith the rate of enterprise big data adoption accelerating rapidly \u2013 as evidenced by 22 percent of respondents in the Engage stage, with either POCs or active pilots underway \u2013 we expect the percentage of organizations at this stage to more than double over the next year. NOW ! While only 6% are executing, about 25% of respondents in this study are &#8216;piloting&#8217; initiatives.<\/p>\n<p><strong>Q4. Could you give us some examples on how do you get (Big) Data Insights?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: IBM has a non-forked version of Hadoop called <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www-01.ibm.com\/software\/data\/infosphere\/biginsights\/');\"  href=\"http:\/\/www-01.ibm.com\/software\/data\/infosphere\/biginsights\/\">BigInsights<\/a>.<br \/>\nWhen it comes to open source, it&#8217;s really hard to look past IBM&#8217;s achievements. <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/lucene.apache.org');\"  href=\"http:\/\/lucene.apache.org\">Lucene<\/a>, <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/db.apache.org\/derby\/');\"  href=\"http:\/\/db.apache.org\/derby\/\">Apache Derby<\/a>, <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/jakarta.apache.org');\"  href=\"http:\/\/jakarta.apache.org\">Apache Jakarta<\/a>, <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/geronimo.apache.org');\"  href=\"http:\/\/geronimo.apache.org\">Apache Geronimo<\/a>, <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Eclipse_(software)');\"  href=\"http:\/\/en.wikipedia.org\/wiki\/Eclipse_(software)\">Eclipse<\/a> and so much more &#8211; so it shouldn&#8217;t surprise anyone that IBM is squarely in Hadoop&#8217;s corner.<br \/>\nOur strategy here is Embrace and Extend. We will embrace the open source Hadoop community. We are a vibrant part of it (in the latest Hadoop patch as of the time of this interview, the most fixes came from IBM; we have a number of contribution to <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/hbase.apache.org');\"  href=\"http:\/\/hbase.apache.org\">HBase<\/a>, and more). IBM has a long history in understanding enterprise concerns, that&#8217;s the extend part.<br \/>\nSome of the extensions work just fine with open source. For example, we provide a rich management tool, a quick installer, and concentrate opens ports into a single one to make your Hadoop cluster pass audit easier.<br \/>\nSome of our extensions overlay Hadoop. For example, our\u00a0<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/pic.dhe.ibm.com\/infocenter\/bigins\/v1r2\/index.jsp?topic=%2Fcom.ibm.swg.im.infosphere.biginsights.doc%2Fdoc%2Fc0059636.html');\"  href=\"http:\/\/pic.dhe.ibm.com\/infocenter\/bigins\/v1r2\/index.jsp?topic=%2Fcom.ibm.swg.im.infosphere.biginsights.doc%2Fdoc%2Fc0059636.html\">Adaptive Map Reduce<\/a> which can deliver a 30% performance boost using its algorithms to optimize the overhead of MapReduce task startup.<br \/>\nWe have enhanced schedulers, announced the option to use <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/IBM_General_Parallel_File_System');\"  href=\"http:\/\/en.wikipedia.org\/wiki\/IBM_General_Parallel_File_System\">GPFS<\/a> as the file system which provides a lot of benefits, and more.  But these are optional. If you use BigInsights you are using a non-forked Hadoop distro.<br \/>\nSome of our extensions are &#8217;round-trip-able&#8217; &#8211; if you use them, you can walk back to pure Open Source Hadoop at any time, and some aren&#8217;t. If you want to get our fast to install non extended version of Hadoop for free, you can download InfoSphere BigInsights Basic Edition here.<\/p>\n<p><strong>Q5. What are the main technical challenges for big data analytics when data is in motion rather than at rest?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: Well the challenge is to ask yourselves how do I get those analytics artifacts that I learn at rest either in Hadoop or the EDW and get them to real time; I call this <em>Nowcasting<\/em> instead of Forecasting.<br \/>\nIn order to do that, with agility and speed, you&#8217;re going to want a platform that&#8217;s designed for in-motion at-rest analytics.<br \/>\nI&#8217;m not seeing that in the marketplace today. In fact, I&#8217;m not seeing a focus on in-motion analytics.<br \/>\nWhen I refer to in-motion, I refer to the Velocity attribute of Big Data (people often talk to the Big Vs in Big Data, so that&#8217;s the one for in-motion).  Velocity IS the game change.<br \/>\nIt&#8217;s not just how fast data is produces or changes, BUT the speed at which it must be understood, acted upon, turned into something useful. So to me the main technical challenge in getting to in-motion from at-rest is the fact that I\u2019m not really seeing that kind of true integration and it\u2019s something we squarely hit on in the IBM Big Data platform.<br \/>\nLet me share an example, if you were to build some text analytical function at rest in Hadoop, perhaps an email phrase that&#8217;s highly correlated with a customer churn even, you can SEAMLESSLY take that artifact and deploy it on InfoSphere Streams (our Big Data Velocity engine) without any work at all, you just deploy the compiled AOG file. Wow! Platform.<br \/>\nThe other challenge is just the volume and speed in which you have to process events. IBM invented our streaming products with the US government &#8211; and it can scale. For example, one of our clients analyzes and correlates over 5M market messages a second to execute algorithmic option trades with average latency of 50 microseconds.<br \/>\nThe point is that this is not <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/en.wikipedia.org\/wiki\/Complex_event_processing');\"  href=\"http:\/\/en.wikipedia.org\/wiki\/Complex_event_processing\">CEP<\/a>; this is not 1 or 2 servers with 10-20,000 events a second. CEP can be a style or a technology.<br \/>\nYou need to be able to do the style, but you need a technology platform too. If you asked me what is one of the biggest things IBM has done in the Big Data space, it is flattening the technical challenge to perform Big Data analytics on data in motion.<\/p>\n<p><strong>Q6. In your opinion, is there a technology which is best suited to build a Big Data Analytics Data Platform? If yes, which one?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: Well you say the word platform, and that&#8217;s going to imply a number of technologies.\u00a0Right?<br \/>\nWhen I get asked this question, I refer to my Big Data Platform Manifesto, this is what you&#8217;re going to need to form a Big Data platform. Many people think big data is about Hadoop technology.  It is and it isn\u2019t.  Its about a lot more than Hadoop.<br \/>\nOne of the key requirements is to understand and navigate federated sources of big data \u2013 to discover data in place.<br \/>\nNew technology has emerged that discovers, indexes, searches, and navigates diverse sources of big data.  Of course big data is also about Hadoop.  Hadoop is a collection of open source capabilities.<br \/>\nTwo of the most prominent ones are Hadoop Distributed File System (HDFS) for storing a variety of information, and MapReduce \u2013 a parallel processing engine.<br \/>\nData warehouses also manage big data- the volume of structured data is growing quickly.  The ability to run deep analytic queries on huge volumes of structured data is a big data problem.  It requires massive parallel processing data warehouses and purpose-built appliances for deep analytics.<br \/>\nBig data isn\u2019t just at rest \u2013 it\u2019s also in motion.  Streaming data represents an entirely different big data problem \u2013 the ability to quickly analyze and act upon data while its still moving.  This new technology opens a world of possibilities \u2013 from processing volumes of data that were just not practical to store, to detecting insight and responding quickly.<br \/>\nAs much of the worlds big data is unstructured and in textual content, text analytics is a critical component to analyze and derive meaning from text.<br \/>\nAnd finally, integration and governance technology \u2013 ETL, data quality, security, MDM, and lifecycle management.  Integration and governance technology establishes the veracity of big data, and is critical in determining whether information is trusted.<br \/>\nFinally, consumability, characteristics here include such items as being able to declare what you want done, not how to do it, expert integrated systems, deployment patterns, and so on.<\/p>\n<p>So if you wanted a short answer a Big Data platform needs to be consumable, governable, give the opportunity for analytics in-motion, at rest (in an EDW AND things like Hadoop), discovery and index Big Data, and finally, provide the ability to analyze unstructured data.<\/p>\n<p>Notice I didn&#8217;t mention one IBM product above; you can piece together a platform with a mash of vendors if you want; if you start to look into what IBM is doing, and although I&#8217;m bias and work there, I think you will find we have a true Big Data platform.<\/p>\n<p><strong>Q6. Does it make sense in your opinion to virtualize Hadoop?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: It can. It&#8217;s going to depend on the use case right? I see a lot of efforts by EMC in that area and that&#8217;s cool. Of course the Cloud and Hadoop kind of go hand and hand. I think this space is growing by leaps and bounds&#8230;fun to watch.<\/p>\n<p><strong>Q7. What is your opinion on the evolution of Hadoop?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: It&#8217;s just that &#8211; an evolution. I think that innovation is going to deliver more and more of what enterprises need from a &#8216;hardening&#8217; aspect as time goes on. <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/hadoop.apache.org\/docs\/current\/');\"  href=\"http:\/\/hadoop.apache.org\/docs\/current\/\">Hadoop 2.0<\/a> is a big step forward for availability. It&#8217;s out there yet now, but not ready for production in my humble opinion (although some vendors are shipping it, their documentation tells you it&#8217;s not ready for production).\u00a0Next version of MapReduce (<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/hadoop.apache.org\/docs\/current\/hadoop-yarn\/hadoop-yarn-site\/YARN.html');\"  href=\"http:\/\/hadoop.apache.org\/docs\/current\/hadoop-yarn\/hadoop-yarn-site\/YARN.html\">Yarn<\/a>) and making Hive really fast (<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/hortonworks.com\/blog\/introducing-tez-faster-hadoop-processing\/');\"  href=\"http:\/\/hortonworks.com\/blog\/introducing-tez-faster-hadoop-processing\/\">Tez<\/a>) are also part of the evolution, stay close here, it&#8217;s changing fast!<br \/>\nThat&#8217;s the best part of community. Now if you look at most of the vendors in this space, many are getting distracted and working on non-Hadoop&#8217;ish things to help Hadoop, and that&#8217;s fine too. We\u2019re on a good path here.<br \/>\nA lot of vendors here are\u00a0and more popping up all the time (like Intel just announced their own distribution). At some point, I think there will be a consolidated of distros out there, but with the hype around it right now, it will continue to evolve.<br \/>\nFor example, it&#8217;s becoming more than just a MapReduce processing areas. Right? Lots of technologies are storing data in Hadoop&#8217;s HDFS, but bypassing MapReduce. So I find the file system key to the evolution.<\/p>\n<p><strong>Q8. Can In-Memory Data Management play a significant role for Big Data Analytics? If yes, how?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: I think it&#8217;s essential, but in a Big Data world, it would seem that the amount of data we are storing &#8211; at least right now &#8211; is proportionally bigger than the amount we can get into memory at a cost effective rate.<br \/>\nSo in-memory needs to harmoniously live with the database. If you look at what we did with <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www-01.ibm.com\/software\/data\/db2\/linux-unix-windows\/db2-blu-acceleration\/');\"  href=\"http:\/\/www-01.ibm.com\/software\/data\/db2\/linux-unix-windows\/db2-blu-acceleration\/\">BLU Acceleration<\/a> and DB2, we did just that.<br \/>\nIn-memory columnar and typical relational tables live side by side in the same database kernel.<br \/>\nYou can work with both structures together, in the same memory structures, queries, and so on.<\/p>\n<p>When you can&#8217;t fit all the columns into memory, performance either falls off the cliff, or worse! Could crash the system.<\/p>\n<p>From an analytics side, BLU Acceleration\u00a0allows you to run queries faster, amazingly faster. That&#8217;s going to get more iterations of queries, analytics and what not. It&#8217;s not for everything, but if you can help my reports run faster, that&#8217;s cool. So imagine you find in a Discovery Zone powered by a Hadoop engine some interesting pieces of information, pulling that out and packing it into an in-memory structure and surfacing it to the enterprise is going to be pretty cool<\/p>\n<p><strong>Q9. What about elastic computing in the Cloud? How does it relate to Big Data Analytics?<\/strong><\/p>\n<p><strong>Paul C. Zikopoulos<\/strong>: This is pretty important because I need the utility-like nature of a Hadoop cluster, without the capital investment.  Time\u00a0to analytics is the benefit here. After all, if you&#8217;re a start-up analytics firm seeking venture capital funding, do you really walk into to your investor and ask for millions to set up a cluster; you&#8217;ll get kicked out the door.<br \/>\nNo, you go to Racksapce or Amazon, swipe a card, and get going. IBM is there with its Hadoop clusters (private and public) and you&#8217;re looking at clusters that cost as low as $0.60 US an hour.<br \/>\nI think at one time I costed out a 100 node Hadoop cluster for an hour and it was like $34US &#8211; and the price has likely gone down. What&#8217;s more, your cluster will be up and running in 30 minutes. So on-premise or off-premise Cloud is key for these environments.<\/p>\n<p>___________________________<br \/>\n<strong>Paul C. Zikopoulos,<\/strong> <em>B.A., M.B.A., is the Director of Technical Professionals for IBM Software Group\u2019s Information Management division and additionally leads the World Wide Competitive Database and Big Data Technical Sales Acceleration teams. <\/em><em><br \/>\nPaul is an award winning writer and speaker with more than 19 years of experience in Information Management.<br \/>\nPaul is seen as a global expert in Big Data and database. He was picked by SAP as one of its \u201cTop 50 Big Data Twitter Influencers\u201d, named by BigData Republic to its \u201cTop 100 Most Influential\u201d list, Technopedia listed him a \u201cA Big Data Expert to Follow\u201d, and he was consulted on Big Data by the popular TV show \u201c60 Minutes\u201d.<br \/>\nPaul has written more than 350 magazine articles and 16 books, some of which include \u201cHarness the Power of Big Data\u201d, \u201cUnderstanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data\u201d, \u201cWarp Speed, Time Travel, Big Data, and More: DB2 10 New Features\u201d, \u201cDB2 pureScale: Risk Free Agile Scaling\u201d, \u201cDB2 Certification for Dummies\u201d, \u201cDB2 for Dummies\u201d, and more.<br \/>\nIn his spare time, he enjoys all sorts of sporting activities, including running with his dog Chachi, avoiding punches in his MMA training, and trying to figure out the world according to Chlo\u00eb\u2014his daughter.<\/em><\/p>\n<p><strong>Related Posts<\/strong><\/p>\n<p><strong>&#8211;<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.odbms.org\/blog\/2013\/04\/on-virtualize-hadoop-interview-with-joe-russell\/');\"  href=\"http:\/\/www.odbms.org\/blog\/2013\/04\/on-virtualize-hadoop-interview-with-joe-russell\/\">On Virtualize Hadoop. Interview with Joe Russell. April 29, 2013<\/a><\/strong><\/p>\n<p>&#8211;<strong><a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.odbms.org\/blog\/2013\/04\/on-pivotal-hd-interview-with-scott-yara-and-florian-waas\/');\"  href=\"http:\/\/www.odbms.org\/blog\/2013\/04\/on-pivotal-hd-interview-with-scott-yara-and-florian-waas\/\">On Pivotal HD. Interview with Scott Yara and Florian Waas. April 22, 2013<\/a><\/strong><\/p>\n<p>&#8211;<strong><a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/www.odbms.org\/blog\/2013\/01\/on-big-data-velocity-interview-with-scott-jarr\/');\"  href=\"http:\/\/www.odbms.org\/blog\/2013\/01\/on-big-data-velocity-interview-with-scott-jarr\/\">On Big Data Velocity. Interview with Scott Jarr.  January 28, 2013<\/a><\/strong><\/p>\n<p><strong>Resources<\/strong><\/p>\n<p>&#8211; <strong>Harness the Power of Big Data<\/strong> <em>The IBM Big Data Platform.<\/em><br \/>\nPaul C. Zikopoulos, Dirk deRoos, Krishnan Parasuraman, Thomas Deutsch, David Corrigan,James Giles, Chris Eaton.<br \/>\n<strong>Book<\/strong>, Copyright \u00a9 2013 by The McGraw-Hill Companies.<br \/>\n<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/public.dhe.ibm.com\/common\/ssi\/ecm\/en\/imm14100usen\/IMM14100USEN.PDF');\"  href=\"http:\/\/public.dhe.ibm.com\/common\/ssi\/ecm\/en\/imm14100usen\/IMM14100USEN.PDF\">Download Book (.PDF 250 pages) <\/a><\/p>\n<p>&#8211; <strong>Warp Speed, Time Travel, Big Data, and More.<\/strong> <em>DB2 10 for Linux, UNIX, and Windows New Features<\/em>.<br \/>\nPaul Zikopoulos, George Baklarz, Matt Huras, Walid Rjaibi, Dale McInnis, Matthias Nicola, Leon Katsnelson.<br \/>\n<strong>Book<\/strong>, Copyright \u00a9 2012 by The McGraw-Hill Companies.<br \/>\n<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/public.dhe.ibm.com\/common\/ssi\/ecm\/en\/imm14091usen\/IMM14091USEN.PDF');\"  href=\"http:\/\/public.dhe.ibm.com\/common\/ssi\/ecm\/en\/imm14091usen\/IMM14091USEN.PDF\"><strong>Download book (.PDF 217 pages) <\/strong><\/a><\/p>\n<p>&#8211; <strong>Understanding Big Data<\/strong> <em>Analytics for Enterprise Class Hadoop and Streaming Data<\/em>.<br \/>\nPaul C. Zikopoulos, Chris Eaton, Dirk deRoos, Thomas Deutsch, George Lapis,<br \/>\n<strong>Book<\/strong>, Copyright \u00a9 2012 by The McGraw-Hill Companies.<br \/>\n<a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/public.dhe.ibm.com\/common\/ssi\/ecm\/en\/iml14296usen\/IML14296USEN.PDF');\"  href=\"http:\/\/public.dhe.ibm.com\/common\/ssi\/ecm\/en\/iml14296usen\/IML14296USEN.PDF\">Download book (.PDF 142 pages) <\/a><\/p>\n<p><strong>&#8211; ODBMS.org Resources on Big Data and Analytical Data Platforms:<\/strong><br \/>\n<a href=\"http:\/\/www.odbms.org\/downloads.aspx#adp\">Blog Posts | Free Software | Articles\t| Lecture Notes\t| PhD and Master Thesis|<br \/>\n<\/a><\/p>\n<p>&#8211; <strong>Follow ODBMS.org on Twitter: <a onclick=\"javascript:pageTracker._trackPageview('\/outgoing\/twitter.com\/odbmsorg');\"  href=\"https:\/\/twitter.com\/odbmsorg\">@odbmsorg<\/a><\/strong><\/p>\n<p>##<\/p>\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>&#8220;We&#8217;re not all LinkedIns and Facebooks; we don&#8217;t have budgets to hire 1000s of new hires with these skills, and what&#8217;s more we&#8217;ve invested in existing skills and people today. So to democratize Big Data, you need it to be consumable and integrated. These will flatten the time to value for Hadoop&#8221; &#8212; Paul C. [&hellip;]<!-- AddThis Advanced Settings generic via filter on get_the_excerpt --><!-- AddThis Share Buttons generic via filter on get_the_excerpt --><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[27,35,66,78,97,102,147,171,193,224,239,263,264,344,355,446,461,486],"_links":{"self":[{"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/posts\/2335"}],"collection":[{"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/comments?post=2335"}],"version-history":[{"count":1,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/posts\/2335\/revisions"}],"predecessor-version":[{"id":3701,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/posts\/2335\/revisions\/3701"}],"wp:attachment":[{"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/media?parent=2335"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/categories?post=2335"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.odbms.org\/blog\/wp-json\/wp\/v2\/tags?post=2335"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}