5 Results for Hadoop

Cloudera Announces Hadoop World, and Hadoop Marches On

We've written before several times about Hadoop, an open source software framework for highly scalable queries and data-intensive distributed applications. The ecosystem of companies and organizations using Hadoop has grown dramatically in recent years, and we've also written about Cloudera, a well-funded company that is focusing on providing support and services for Hadoop, in addition to offering its own Hadoop distribution.

Today, Cloudera announced the first ever Hadoop World conference, to take place at the Roosevelt Hotel in New York City on October 2nd, with registration available here. A look at the companies and institutions organizing and participating in the event shows just how far Hadoop has come, and how it has extended well beyond just search applications.



As Microsoft Powers Yahoo!'s Search, Whither Yahoo!'s Open Efforts?

Back when Microsoft was actively pursuing an acquisition of Yahoo!, only to withdraw its bid, many observers felt that the withdrawal was good news for Yahoo!'s many open source and open initiatives. But we made the point back then that the game might just have begun. From Yahoo!'s open strategy for developers, to its Yahoo! User Interface Library (YUI), to the company's reliance on Hadoop for advanced fast queries, Yahoo! has always been a strong supporter of open standards. Yahoo!'s entire site runs on FreeBSD--a free operating system descended from AT&T Unix.

Microsoft still isn't acquiring Yahoo!, but it has struck a far-reaching search deal with the company. What are the implications for Yahoo!'s many open source and open standards initiatives?



OStatic Buffer Overflow...

Missing: Dell Ubuntu desktop PCs. The VAR Guy explores whether Dell has stopped selling Ubuntu desktop computers.

Open source Hive: Large-scale, distributed data processing made easy. Hive is a data analysis and query front end for Hadoop that makes Hadoop data files look like SQL tables.

Microsoft opened Linux-driver code after violating GPL. Did it act simply to head off any potentially embarrassing legal disputes over violations?

SpringSource and MindTouch seek to redefine the application server. Spurred by economic pressures on IT departments, new breeds of app servers are taking shape.

The tech jobs that the cloud will eliminate. IT pros face new competition for their jobs from cloud services. Which jobs go, and which become more valuable?



OStatic Buffer Overflow...

Hadoop and MapReduce are cheap and scalable for clustered queries, but they're slower than relational databases. Yale researchers have an improvement.

The Ubuntu Linux app store: fact or fiction? The emerging app store, which offers Ubuntu Linux and Debian applications, wasn?t built by Canonical.

Linux slips into Microsoft's warm, deadly embrace. How Microsoft will use the GPL to mount a serious backdoor assault on the core of the Linux platform.

Is Microsoft's GPL2 support really a big deal? It's recently released code is only for Linux Virtual Machines on Windows, not physical Linux servers and Linux desktops.

Red Hat is wrong to insist Microsoft disavow litigation. Did IBM, HP, Oracle, or even Red Hat ever declare that they will never, ever sue open source developers over patent infringements?

Palm's Linux secret makes the Pre. Palm Pre is no thriller as a smartphone, but the SDK reveals the most open mobile platform on the market.



Is Microsoft Going to Acquire Powerset?

While it's still only a rumor, VentureBeat is reporting that Microsoft has agreed to buy Silicon Valley semantic search engine Powerset for over $100 million--apparently to be announced next month. Powerset's search technology uses the open source cluster-based technology Hadoop, which provides fast answers to queries by using the resources of many computers. We wrote about both Hadoop and Powerset here, and got to hear from Powerset's Chad Walters. Based on what I saw Chad demonstrate, and Microsoft's need to catch up in search, I won't be surprised if this rumor is true.