Hortonworks' Hadoop Platform Now on Google Cloud Platform

by Ostatic Staff - Jan. 23, 2015

On the heels of its introduction as a hot new publlic company a few weeks ago, Hortonworks, which focuses on the open source Big Data platform Hadoop, is expanding its reach. Recently, Hortonworks extended its technology partner program with the addition of three new certifications it offers. Hadoop-related certification is a very hot commodity in the tech job market at the moment.

And now, Hortonworks‘ own distribution of the Hadoop platform is available and supported on Google’s public cloud, the Google Cloud Platform--a ringing endorsement.

Hadoop lets organizations sift and process data in powerful ways, allowing them to cull insights that they couldn't get with standard tools like databases. Hortonworks' distribution of Hadoop is already popular, but integrating with Google's Cloud Platform will give it a boost.

According to a post from Hortonworks:

"As enterprises adopt the cloud and Apache Hadoop, they look to leverage the Google Cloud Platform and Hortonworks Data Platform (HDP), the only 100% open source distribution of Apache Hadoop. Today, we are thrilled to announce the certification and availability of HDP on Google Cloud Platform."

"With this new certification, enterprises worldwide can dynamically provision HDP clusters on Google Compute Engine and Google Cloud Storage to store, discover and analyze a unified collection of structured and unstructured information assets. With Google Cloud Platform and Hortonworks Data Platform, enterprises benefit from limitless scalability and an enterprise-grade platform backed by community driven open source innovation."

According to Hortonworks, the company worked closely with Google on the integration. Engineering teams collaborated to integrate “bdutil” with Apache Ambari Blueprints API, "to deliver a simple and streamlined provisioning experience for the end user," says the company. Key highlights of the joint solution include:

Google’s “bdutil” with Apache Ambari plugin to provision infrastructure and fully configure the Hadoop cluster.

Source code available for use & open contribution on GitHub with Apache License v2.

You can learn more at https://cloud.google.com/solutions/hadoop and there is detailed “bdutil” usage instruction available on a github page.