Apache Software Foundation Delivers Tajo v0.11.0, for Big Data
The Apache Software Foundation (ASF), the all-volunteer steward organization of so many open source projects. has announced the availability of Apache Tajo v0.11.0, an advanced open source data warehousing system in Apache Hadoop. It's one of several new projects from Apache focused on the Big Data space.
Apache claims that Tajo provides the ability to rapidly extract more intelligence fro Hadoop deployments, third party databases, and commercial business intelligence tools. Tajo 0.11.0 has numerous new features and improvements that include:
- ORC file support
- Improved ResultSet fetch performance of JDBC and TajoClient
- Tablespace support (similar to those of RDBMS)
- JDBC storage support and projection/filter push down, enabling Tajo to efficiently process RDBMS tables without ETL
- Multi-query support
- Python UDF/UDFA support
Additionally, Tajo 0.11.0 includes improved join optimization, better query response, and nearly 350 bug fixes.
"Tajo 0.11.0 representsa very important milestone. It introduced critical features and functions that let us build out a modern data warehouse system," said Hyunsik Choi, Vice President of Apache Tajo and Research Director of Gruter Inc. "Especially, query federation and tablespace features will help enterprise users easily integrate existing databases and NoSQL stores with Tajo, a Hadoop-based analytical system."
"Congratulations on Apache Tajo 0.11 release. As a leading online and mobile developer of the popular MMORPG game 'TERA', we process tons of log data to analyze various usage patterns," said Sung Min Ahn, head of Platform Development Department at Bluehole. "We adopted Tajo in our game log analytics system for large scale data processing and Tajo made our works simple through its direct JSON data format support. Its ANSI SQL support also helped our data analysts to crunch the big data with no learning curve."
"After we adopted Apache Tajo to replace old Hive setup, the performance and stability of Tajo led us to expand its use to more mission-critical analysis workloads which we had depended on the commercial DW solution before. Tajo helps us to maintain our DW system in a more cost-efficient way," said Byunghwa Yun, R&D Planning Team at Loen Entertainment. "We are interested in the enhanced HBase tablespace support in Tajo 0.11. Data integration between various storages would be way easier."
Apache Tajo is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with the project, visit http://tajo.apache.org/