As organizations grapple with a flood of structured and unstructured data generated by computers, mobile devices, sensors and social networks, they're under unprecedented pressure to analyze much more data at faster speeds and at lower costs to help deepen customer relationships, prevent threat and fraud, and identify new revenue opportunities.
Today’s announcement, which represents the work of hundreds of IBM developers and researchers in labs around the world, includes an industry-first innovation called "BLU Acceleration," which combines a number of techniques to dramatically improve analytical performance and simplify administration.
Also announced is the new IBM PureData System for Hadoop, designed to make it easier and faster to deploy Hadoop in the enterprise. Hadoop is the game-changing open-source software used to organize and analyze vast amounts of structured and unstructured data, such as posts to social media sites, digital pictures and videos, online transaction records, and cell phone location data.
The new system can reduce from weeks to minutes the ramp-up time organizations need to adopt enterprise-class Hadoop technology with powerful, easy-to-use analytic tools and visualization for both business analysts and data scientists. In addition, it provides enhanced Big Data tools for monitoring, development and integration with many more enterprise systems.
"Big data is about using all data in context at the point of impact," said Bob Picciano, general manager, IBM Information Management. "With the innovations we are delivering, now every organization can realize value quickly by leveraging existing skills as well as adopt new capabilities for speed and exploration to improve business outcomes."
BLU Acceleration enables users to have much faster access to key information, leading to better decision-making. The software extends the capabilities of traditional in-memory systems -- which allows data to be loaded into Random Access Memory instead of hard disks for faster performance -- by providing in-memory performance even when data sets exceed the size of the memory. During testing, some queries in a typical analytics workload were more than 1000 times faster when using the combined innovations of BLU Acceleration.
Innovations in BLU Acceleration include "data skipping," which allows the ability to skip over data that doesn't need to be analyzed, such as duplicate information; the ability to analyze data in parallel across different processors; and greater ability to analyze data transparently to the application, without the need to develop a separate layer of data modeling. Another industry-first advance in BLU Acceleration is called "actionable compression," where data no longer has to be decompressed to be analyzed.
BNSF Railway Company, one of the largest freight rail transportation networks in North America, is using IBM BLU Acceleration to more quickly understand the vast amounts of data from the organization's 1700 servers that tracks maintenance, weather, scheduling, inventory, safety, deliveries and more. BNSF operates more than 1,400 trains a day on 32,500 route miles of track in 28 states and two Canadian provinces.
"BNSF transports many of the products and materials that we use every day in America and around the world, so tracking of these shipments is critical to our organization," said Kent Collins, Database Solutions Architect, BNSF Railway Company. "Working with IBM we are now embracing our organization's Big Data with the power of analytics. Thanks to the new technology, we're performing tasks more quickly than ever before, for example, one of the queries improved over 100-fold, and our storage consumption went down by about 10 times. One of the things that impressed us the most about BLU Acceleration is its simplicity. We just load the data and run queries."
IBM innovations make Big Data more accessible and enterprise-ready
The new offerings expand what is already the industry's deepest portfolio of Big Data technologies and solutions, spanning software, services, research and hardware. The IBM Big Data platform combines traditional data warehouse technologies with new Big Data techniques, such as Hadoop, stream computing, data exploration, analytics and enterprise integration, to create an integrated solution to address these critical needs.
IBM PureData System for Hadoop is the next step forward in IBM’s overall strategy to deliver a family of systems with built-in expertise that leverages its decades of experience reducing the cost and complexity associated with information technology. This new system integrates IBM InfoSphere BigInsights, which allows companies of all sizes to cost-effectively manage and analyze data and add administrative, workflow, provisioning and security features, along with best-in-class analytical capabilities from IBM Research.
Kelley Blue Book, www.kbb.com, the leading provider of new and used car information, will be evaluating the new PureData System for Hadoop to analyze click stream data created by users on its website. The company will be able to analyze this information, including social media data, to see what topics visitors care most about, such as used and new-vehicle prices, accident reports, safety recall and warranty data, and car shopper reviews.
"Kelley Blue Book collects all kinds of data from various sources, so managing the efficiency of data is critical to grow our business," said Steve Chow, vice president of technology and data intelligence for Kelley Blue Book. "We see many opportunities to leverage the PureData System for Hadoop as a strategic platform to expand on our analytic ecosystem and tap the value of social media, text and machine data to get a better view of our consumers and customers to improve their overall experience on KBB.com."
This announcement also includes the following new versions of IBMs Big Data solutions:
· A new version of InfoSphere BigInsights, IBM’s enterprise-ready Hadoop offering, which makes it simpler to develop applications using existing SQL skills, compliance security and high availability features vital for enterprise applications. BigInsights offers three entry points: free download, enterprise software and now an expert integrated system, IBM PureData System for Hadoop.
· A new version of InfoSphere Streams, unique “stream computing” software that enables massive amounts of data in motion to be analyzed in real-time, with performance improvements, and simplified application development and deployment. ·A new version of Informix including TimeSeries Acceleration for operational reporting and analytics on smart meter and sensor data.