5 Results for yahoo

Cloudera Announces Hadoop World, and Hadoop Marches On

We've written before several times about Hadoop, an open source software framework for highly scalable queries and data-intensive distributed applications. The ecosystem of companies and organizations using Hadoop has grown dramatically in recent years, and we've also written about Cloudera, a well-funded company that is focusing on providing support and services for Hadoop, in addition to offering its own Hadoop distribution.

Today, Cloudera announced the first ever Hadoop World conference, to take place at the Roosevelt Hotel in New York City on October 2nd, with registration available here. A look at the companies and institutions organizing and participating in the event shows just how far Hadoop has come, and how it has extended well beyond just search applications.



At Hadoop Summit, Yahoo! Announces its Tested Distribution

At today's Hadoop Summit in Silicon Valley, Yahoo! announced the availability of the Yahoo! Distribution of Hadoop, a source-only version of Apache Hadoop that Yahoo! uses within its own search engine. Hadoop, of course, is an open source software framework that helps process very large data sets, and is widely used in large-scale data mining applications as well as in search tools at sites like Facebook and many others. For developers and users interested in Hadoop, it's worth noting that the Yahoo! Distribution of Hadoop has been widely tested and developed at Yahoo! for years now, as Eric Baldeschwieler, VP of grid computing at Yahoo, described in detail here.?


Yahoo Extends its Research Reach, in the Open Source Cloud

As GigaOm reports today, Yahoo is expanding its cloud initiatives, extending its reach into research territory. Initially available only to researchers at Carnegie Mellon University, Yahoo?s 4,000-core, 1.5-petabytes-of-storage M45 cluster is now available to their counterparts at the University of California at Berkeley, Cornell University and the University of Massachusetts at Amherst. Yahoo?s M45 cluster runs Hadoop, an open source distributed file system and parallel execution environment that enables its users to process massive amounts of data. Apache Hadoop is an open source project of the Apache Software Foundation, to which Yahoo engineers have been the primary contributors to date. Find out more at GigaOm.


Cloudera's Biz Model: Supporting Hadoop

clouderaWe've covered Hadoop on a few occasions here at OStatic. Sponsored by the Apache Software Foundation, Hadoop is a software framework able to take advantage of huge clusters of computers to produce fast results for queries and more, by breaking them into parts. Yahoo makes extensive use of Hadoop for its search features. Now, as Valleywag is reporting, a veteran of Bear Stearns and Facebook is one of the folks behind Cloudera, a business focusing on supporting Hadoop deployments.


HP, Yahoo and Intel Leverage Hadoop for a Compute Cloud

After dragging its feet, Hewlett-Packard is stepping up with an answer to cloud computing by inking a partnership with Intel, Yahoo and three universities to create a cloud computing testbed. The cloud will comprise six physical locations where mostly HP servers containing between 1,000 and 4,000 mostly Intel cores will run Apache Hadoop. Hadoop is the open source infrastructure for taking advantage of huge clusters of computers to produce fast results for queries. It's behind much of Yahoo's search, as we covered here. Today, GigaOm offers complete analysis. Check it out.