Bridges are Being Created Between Hadoop and Powerful Tech Tools
In organizations and businesses of all sizes, the Big Data trend--sorting and sifting large data sets with new tools in pursuit of surfacing meaningful angles on stored information--is on the rise. As we've noted, the Hadoop project has been one of the big drivers of this trend. It has also given rise to commercial companies that offer custom Hadoop distributions, support, training and more (with Hortonworks and Cloudera being prime examples).
Now, there are many types of technology bridges being created to integrate Hadoop's capabilities with other powerful tools. These include tools to bridge Hadoop and coud platforms like OpenStack, and a new bridge between Hadoop and the very powerful R statistics-focused programming language.
Revolution Analytics is the company behind the R programming language. We covered R in detail in this post. It includes all of the components that any professional statistician might need, and is well-known for generating beautifully detailed graphics. It is also a complete language.
Now, as ZDNet reports, "Revolution Analytics is supporting the integration of R inside Cloudera's Distribution Including Apache Hadoop (CDH), versions 3.0 and 4.0." This will facilitate seamless, predictive analytics with data used by Hadoop and R. It's an example of several partnerships that Cloudera has struck around Hadoop, creating bridges between it and other important technology platforms.
Also notable on the Hadoop integration front is Project Savanna, which aims to enable users to easily provision and manage Hadoop clusters on OpenStack. It's backed by Red Hat, Mirantis, and Hortonworks, and promises to create important bridges between the powerful data crunching capabilities of Hadoop and the cloud flexibility of OpenStack. Savanna should become an important project, focused on two of the most important enterprise open source tools available.
Hortonworks is also focused on technology that can orbit around and integrate with Hadoop. Its Hadoop distribution includes the latest version of Apache Ambari, an open source tool for management and monitoring of Apache Hadoop clusters.
Hadoop is now a vital platform that will see many tools integrating with it right down to the file system level. This is being driven by both the open source community and commercial companies focused on Hadoop.