Apache Cloudberry 2.1.0 Released: PostgreSQL-Based MPP Database for Analytics & AI Workloads

Posted on 2026-04-19 by Apache Cloudberry
Related Open Source

The Apache Cloudberry (Incubating) community is pleased to announce the release of Apache Cloudberry 2.1.0, the latest version of its massively parallel processing (MPP) database designed for large-scale analytics and AI workloads.

Following the 2.0.0 release, which marked the project’s first official release after entering the Apache Incubator, version 2.1.0 continues to improve the database kernel, execution engine, and surrounding ecosystem components.

Key Enhancements in 2.1.0

This release introduces several improvements across query execution, storage, and ecosystem integration:

  • Enhanced distributed execution: A new UDP2 interconnect protocol improves communication efficiency between segments, enhancing performance and scalability for distributed queries.
  • Optimizer improvements: Updates to the ORCA optimizer include CTE pruning, partial aggregate pushdown, and multiple correctness and memory fixes.
  • Storage and performance optimizations: Enhancements to the PAX storage format include LZ4 compression support and improvements to I/O and memory management.
  • Faster statistics collection: A new fast ANALYZE capability for append-optimized tables addresses a common operational bottleneck in large-scale environments.
  • Improved integration with modern data workflows: The introduction of the MCP server enables easier integration with LLM-based tools and AI-driven workflows.

Apache Cloudberry 2.1.0 also includes updates to its ecosystem components:

  • Cloudberry PXF: Modernized codebase, updated dependencies, and alignment with Apache project standards.
  • Cloudberry Backup: Consolidated backup tooling with improved maintainability, including integration of the S3 plugin into the main repository.

The project is actively working on ongoing PostgreSQL kernel upgrades (from PG 14.x to PG 16.x) and deeper integration with the PostgreSQL ecosystem.

Download

Apache Cloudberry 2.1.0 is available for download at: https://cloudberry.apache.org/releases.

Get Involved

The community welcomes contributors and users interested in distributed PostgreSQL-based analytics systems.

  • Website: https://cloudberry.apache.org/
  • GitHub: https://github.com/apache/cloudberry
  • Issue tracker: https://github.com/apache/cloudberry/issues
  • Mailing list: https://lists.apache.org/list.html?dev@cloudberry.apache.org

About Apache Cloudberry

Apache Cloudberry (Incubating) is an open-source Massively Parallel Processing (MPP) database for large-scale data analytics, derived from PostgreSQL and the last open-source version of Greenplum Database. It is designed to support both on-premise and cloud deployments, providing a scalable foundation for data warehousing and advanced analytics. The project is currently in incubation at the Apache Software Foundation.