A Look at Clustered File Systems: Interview with Gluster's Anand Babu Periasamy

by Joe Brockmeier - Mar. 23, 2010Comments (6)

Anand Babu Periasamy

As the saying goes, few things in life are certain except death and taxes. To that, we should add another certainty: that the amount of data that you need to store and manage will continue to grow at a rapid pace. One way to deal with this profusion of data is with clustered storage, like Gluster.

Gluster is an open source storage platform for working with large amounts of data (terabytes all the way up to petabytes) that ties together everything from the operating system layer to filesystem and management interface. To get a deeper view on Gluster, we asked Anand Babu Periasamy, CTO and co-founder of Gluster, to describe the technology and give a glimpse into the project's roadmap.

OStatic: How did Gluster get its start? What's the origin of the software and the technology?

Hitesh Chellani and I co-founded Gluster, and we had been part of the team at California Digital Corporation that built the 'Thunder' supercomputer for Lawrence Livermore National Lab. When 'Thunder' was put into production in 2004 it was the 2nd fastest supercomputer in the world, demonstrating the feasibility and power of building large scale computing clusters with industry standard hardware (IA64) and an open source software stack. Following the 'Thunder' project, Hitesh and AB left California Digital to start a company with the goal of bringing the open source software / commodity hardware combination to the commercial enterprise.

In working with early customers, mostly in the energy exploration industry, it became clear that the pain on the storage side of the data center was more acute. The team looked at existing alternatives but recognized from past experience that it would be better and faster to build from scratch without legacy limitations. That is how the original Gluster file system was born.

OStatic: Tell us about Gluster, what it is, and what it's good for.

Gluster Storage Platform is clustered storage. In other words multiple storage building blocks, or nodes, are connected and our software aggregates those resources into a unified pool. Gluster automatically manages tasks like data distribution, I/O scheduling, replication, etc. The key advantage here is scalability; Gluster can manage hundreds of storage nodes and multiple petabytes of capacity. The 'scale-out' approach enables this, eliminating bottlenecks and allowing customers to add resources as they grow. We are also a software-only solution that runs on commodity hardware, a model that drastically lowers cost.

Gluster Storage Platform excels at managing large numbers of files. Files can range from small to very large and the product is flexible enough to support a wide range of application types. Managing large numbers of files is generally referred to as the 'unstructured data explosion' problem. The modular design of Gluster makes it possible to tailor the configuration to a wide range of needs.

OStatic: What type of environments is Gluster being used in, and what kind of workloads is Gluster aimed at?

Gluster offers great flexibility and is therefore well suited for a wide range of applications and uses cases. Generic use cases include: scalable Network Attached Storage (NAS), high performance storage, archive, media delivery, and cloud. We span industries such as online music/video, managed hosting, health care, biotech, energy, and others.

OStatic: What companies are involved in Gluster development, aside from Gluster?

Gluster is the primary developer for the product; however, we do collaborate with other companies, both vendors and customers. Early on we worked closely with the team maintaining the Filesystem in Userspace (FUSE) project, and now one of the lead maintainers is a Gluster employee. We are collaborating with the cloud team at Red Hat to enhance the cloud capabilities of the product. Early on, the team was considering writing their own code from scratch, but we worked together pointing out APIs and features on our roadmap and now we collaborate. Another example is a customer who offers managed hosting services and will be writing a billing module, also under the GPL license, for the product. The HyperTable open source database (C++ implementation of Hadoop) works with us to support Gluster as the back-end scalable file system. We have several customers who have deployed Gluster on Amazon Web Services (AWS) who are working with us to productize solutions for this environment.

OStatic: Can you describe the community model that's being used to develop Gluster?

We have a growing community of over 1,000 registered members whom are very active. The community contributes bug fixes regularly and occasionally develops features/modules that get integrated into the product. We like to point out that as hard as it is to develop a file system, the hardest part is testing and quality assurance. Our community does an outstanding job of stressing the product in a wide range of use cases, frequently in ways we never envisioned. One interesting example is a community member wrote a Python binding module – whether this is useful or not is still an open question, but it highlights the axiom that there are far more ideas from people outside the company than inside. In addition to identifying bugs, our roadmap is heavily influenced by community input.

Another interesting dynamic we are seeing with file system development is our implementation and architecture is building the pool of file systems developers by lowering the barriers to entry. Open source is a big part of this, but the real benefit is our product is written in user space with a modular architecture that simplifies feature development. One no longer needs to be deeply familiar with OS kernel development or file system internals. We have recent college graduates with basic C programming skills making contributions, we have multiple student targeted projects like compression and encryption being offered through the Google Summer of Code program, and we have seen college courses teaching file system development using Gluster as part of the curriculum.

OStatic: What parts of the stack, if any, are not open source?

The entire software stack of Gluster Storage Platform is open source and licensed under GPLv3. Additionally, the product available for free download is identical to the commercial product supported by subscriptions.

OStatic: Tell us a bit about the roadmap -- what's coming, and why?

We will continue to focus on improving the manageability and ease of use through core features as well as services provided by the Gluster Subscription Network Portal such as monitoring and analysis tools; it is impossible to scale cost effectively without simplicity. The industry is rushing to virtualize every corner of the data center, our ability to virtualize storage resources under a global namespace is a key advantage here and we continue to invest in features that optimize our storage for virtual server environments. Cloud computing is another area where customers are trying to sort through the hype for practical solutions. We are taking our experience with cloud storage deployments from our existing customer base to create ready to use cloud solutions for both private and public deployments.



balakrishna korrapati uses OStatic to support Open Source, ask and answer questions and stay informed. What about you?



6 Comments
 

Gluster is 'free software' and *not* open source !


0 Votes

Gluster *is* open source (GPL v3). Get the source code here: http://www.gluster.org/download/gluster-source-code/


0 Votes

Tiffany & Co is well-known for its sterling silver jewelry, as sterling silver jewelry become more adorable and popular,we never stop our exploration in the world of jewelry and return to tiffany. Tiffany 1837 ring, a classical one ,is a two-to-none chioce for the one you really love,when you want to make a propose ,tiffany 1837 ring will get you a good luck! Tiffany Somerset is sparkling items especially for your head!At this time ,our price is uncomparable!you should grasp this chance! The newest style is on sale, wearing tiffany & co necklace,you will find you are the main role in the street!Your life will change with the coming of it! Tiffany Atlas can decorate you hand,neck ears with different taste!You will find yourself more charming!http://www.enjoysilverjewelry.com/.


0 Votes

Welcome to www.manoloblahnikcvs.com, from here you will hunt for any designer shoes which you desire!all


style Manolo Blahnik Boots sale Ever since 2005, when we became the best online dealer of authentic


Giuseppe Zanotti , Gucci,Cheap Manolo?Blahnik Boots , UGG Australia , Yves Saint Lauret , Christian


Louboutin , and were upgraded to be a retailer for these brands,we have been devoted to provide customers


with a large selection with finest quality at lowest price.We strive to build long-term relationship with


each and every one of our customers, we can only accomplish our mission if you’re happy with the products


you buy.So we’ve created a guarantee that takes all the risk out of shopping!All goods shipped are


insured, free and no customs taxes! 1-2 days to pick up and 5-6days to your doors!you can also have a good


view in our ed hardy clothing store.


0 Votes

web file system

http://www.gleamtech.com/products/filevista/web-file-manager


FileVista is a web file manager for storing, managing and sharing files online through your web browser. It is a web based software which you install on your web server to fulfill web file management requirements of your company or organization. This web file manager allows your users to upload, download and organize any type of file with an intuitive user interface.


0 Votes

Really interesting project - and of course that fact that it is open source makes it all the more interesting. I wonder if it has any application in the small-to-medium business environment. With quantity of files being stored nowadays, even in the home environment, I would think it does!


http://www.concertinadoors.net/interior-doors-types


0 Votes
Share Your Comments

If you are a member, to have your comment attributed to you. If you are not yet a member, Join OStatic and help the Open Source community by sharing your thoughts, answering user questions and providing reviews and alternatives for projects.


Promote Open Source Knowledge by sharing your thoughts, listing Alternatives and Answering Questions!