VMware Greenplum 7 is officially released!

In today's rapidly changing business environment, companies continue to seek innovative ways to optimize operations, streamline decision-making, and build unique competitive advantages. The key to achieving these goals is to effectively utilize massive data resources. However, this task is not easy. The volume, complexity, and sources of data are growing explosively, while the technology for extracting value from data is evolving at a rapid pace. 

VMware Greenplum plays a vital role in this area. Greenplum is a unified analytics and artificial intelligence (AI) platform designed to help enterprises make the most of their data resources. Whether it is structured data, semi-structured data or unstructured data, Greenplum can provide a unified platform as the undisputed "single source of truth", and through support for parallel processing of vector data, Greenplum can compete with the latest An ensemble of large language model methods (LLM). 

The power of integration

The core of VMware Greenplum is based on the open source PostgreSQL project, which is unique in that it seamlessly integrates business intelligence (BI) and artificial intelligence (AI) functions on the same platform. This integration of various tools and technologies allows companies to respond to complex challenges efficiently and quickly. At the same time, all operations can be implemented through the SQL database interface that users are familiar with. 

Imagine a business that needs to perform intelligent searches on massive amounts of customer feedback documents and merge that information with detailed customer online transaction processing (OLTP) transaction history. In the past, these tasks required various data silos and disparate tools, but now they can be performed seamlessly within the Greenplum platform. This greatly improves the company's operational efficiency and enhances its responsiveness to customer needs. 

Seamless transition from business intelligence to artificial intelligence

 

A distinctive feature of Greenplum is its ability to unify data analysis and artificial intelligence needs and promote a smooth transition from business intelligence to artificial intelligence applications. This transition can occur at any scale, whether dealing with small data sets or vast petabyte-scale data ecosystems. 

Greenplum's versatility is driven by its ability to adapt to the changing data landscape. As the volume and variety of data continues to increase and new analytics technologies emerge, VMware Greenplum is evolving in step with it. This helps keep businesses at the forefront of data-driven decision-making, constantly uncovering new insights and opportunities. 

VMware Greenplum 7

 

VMware Greenplum 7 demonstrates our commitment to creating and evolving an inherently secure, mature and flexible SQL-based online analytical processing (OLAP) platform. This innovative platform introduces a series of enhancements and new capabilities focused on resource management and complex analytics for all data types, whether structured, semi-structured or unstructured. 

VMware Greenplum 7 also has many important updates in terms of seamless data scalability, multi-workload handling, and deployment flexibility. 

What’s new in VMware Greenplum 7

Here are the powerful new features introduced in VMware Greenplum 7: 

Open Source and PostgreSQL 12 Evolution: VMware Greenplum 7 is built on open source code, leveraging the power, reliability, and flexibility of modern PostgreSQL versions. Compared with the previous version, Greenplum 7 is rooted in PostgreSQL 12 and integrates PostgreSQL releases from the past five years.

Multiple index types:  VMware Greenplum 7 supports multiple index types, including B-tree indexes, hash indexes, bitmap indexes, block range indexes, text indexes, geospatial indexes, and AI vector indexes. This feature optimizes data retrieval and query performance. The Greenplum query optimizer has been continuously improved since 2009, achieving a good performance record in version 6 and expanded in version 7 to provide comprehensive index selection support. 

Enhanced data federation with PXF: The Platform Extensions Framework (PXF) in VMware Greenplum 7 has been improved to enable superior data federation. Enterprises can now query data sets in Amazon Simple Storage Service (S3) object storage, Hadoop Distributed File System (HDFS), and other relational databases via JDBC. It leverages PostgreSQL's foreign data wrapper API to access remote data sources in parallel, providing an abstract data model to manage the security and statistics of remote data to optimize queries.

Enhanced text search: VMware Greenplum 7 extends text search capabilities with support for both lexical search and AI-driven semantic search to provide more accurate search results. Lexical search supports traditional text search based on keywords, and for semantic search, it is powered by artificial intelligence and vector embeddings. 

Upgraded geospatial analysis: VMware Greenplum 7 has upgraded geospatial analysis capabilities by integrating PostGIS version 3. This improvement greatly increases the speed and feature richness of geospatial queries. 

Row-level security permissions:  This feature complements the role-based security model and table-level and column-level permissions already in VMware Greenplum.

Generated columns for enhanced data modeling:  Generated columns were introduced in VMware Greenplum 7, improving data abstraction and modeling, addressing use cases such as security feature-preserving data masking. 

Improved DBA query functionality:  Greenplum 7 has made numerous improvements to DBA query functionality, including UPSERT support, user-defined functions with transactions, and improvements to ALTER TABLE to reduce data rewriting. 

Enhanced semi-structured and unstructured data analysis:  In addition to supporting XML documents, Greenplum 7 also supports semi-structured data processing, such as enhanced JSON and array data processing capabilities. Full-text search and text-based lexical search indexes enable efficient text storage, indexing, and searching. Additionally, vector embeddings enable condensed and efficient representation of unstructured data, allowing similarity searches for matching documents, images, and videos across multiple languages, including multilingual searches. 

PostgreSQL extension ecosystem:  More comprehensive PostgreSQL extension support, such as advanced password checking, fuzzy string matching, Hyperloglog, Ip4r for network data, ISN for media data, nanosecond timestamps, sparse vectors, for pivoting Tablefunc, UUID for unique identifiers, and pg_vector for artificial intelligence vector embedding, are all supported.

Advanced resource management:  Greenplum 7 introduces a series of advanced resource management features. These features ensure robust performance under high load conditions. 

VMware vSphere deployment model:  Greenplum 7 can be deployed in bare metal or public cloud environments by referring to the recommended architecture. You can also use the automatic deployment mode provided in Greenplum 7 version to seamlessly integrate into the vSphere private cloud environment. 

Multi-datacenter disaster recovery solution:  As part of a multi-datacenter disaster recovery solution, data is replicated through transaction log archives, resulting in more efficient and lower recovery point objectives (RPO) and recovery time objectives ( RTO) disaster recovery solution.

New extension PostgresML: Provides new user-defined functions that allow users to use tens of thousands of open source artificial intelligence/machine learning pre-trained models in VMware Greenplum. 

VMware Greenplum Advantages

The many benefits VMware Greenplum brings to the enterprise can be divided into four key areas: flexibility , speed and scale , productivity , and resiliency . 

flexibility

Infrastructure Versatility: VMware Greenplum offers significant deployment flexibility and is compatible with a variety of infrastructure types. It is optimized for bare metal, public cloud and vSphere-based private cloud environments. This means businesses can choose the infrastructure that best suits their needs without sacrificing performance or efficiency. 

Purpose-built optimization: Greenplum provides purpose-built reference architectures to ensure seamless integration into different infrastructure setups and reduce deployment complexity.

speed and scale  

In-database analysis:  Greenplum's in-database analysis capabilities significantly speed up pivot time. This capability means data analysts and scientists can perform complex analyzes directly in the database, without the need for time-consuming data transfers. 

Petabyte-level data processing:  Greenplum can handle massive amounts of data, even petabyte-level data. This ensures businesses can efficiently analyze and manage massive data sets, gaining insights from their largest data repositories. 

Productivity

Data Diversity:  Greenplum excels at managing various types of data on a single platform. It seamlessly handles structured, semi-structured and unstructured data, including text, images, videos, vectors, geospatial information, graphics and speech data. This versatility enables businesses to consolidate data sources and make it easier to analyze it no matter where it is stored. 

Data accessibility:  Greenplum's ability to process and analyze data in a variety of formats from disparate sources reduces the time and effort required to preprocess and integrate data from multiple sources and increases productivity. 

elasticity

Mature foundation:  Greenplum is built on the open source database PostgreSQL, a proven and mature database platform. This improves reliability and stability for mission-critical applications and data workloads. 

Enhanced security:  Greenplum integrates enhanced security features to help enterprises keep their data safe. This includes authentication mechanisms, encryption options, and access controls. 

Enterprise Support:  Greenplum offers robust enterprise-grade support, giving enterprises the assistance they need to manage and optimize their data platform. 

Disaster recovery:  Through functions such as remote disaster recovery, Greenplum provides a data backup and recovery mechanism to minimize downtime and data loss when a disaster occurs. 

With the new release, VMware Greenplum is more than just a platform, it’s a catalyst for transformation. It enables enterprises to realize the full potential of their data assets, improve operational efficiency, accelerate decision-making processes, and ultimately achieve superior customer responsiveness. As data continues to shape the future of enterprises, Greenplum has become a leader in innovation, guiding enterprises from BI to AI and beyond. Embrace the power of unified data analytics and artificial intelligence with Greenplum and propel your business into a future where data is the ultimate competitive advantage!

Get started with VMware Greenplum 7 today:
https://network.pivotal.io/products/vmware-greenplum 

Read how using Greenplum 7 on Samsung's Gen 5 NVMe drives creates a new reference architecture that will have a profound impact on the future of big data, analytics and data warehousing: https://tanzu.vmware.com/content/blog/vmware -greenplum-on-samsung-performance

 

Content source|Public account: VMware China R&D Center

If you have any questions, please scan the official account below to contact us~

​​​​​​​

Lei Jun: The official version of Xiaomi's new operating system ThePaper OS has been packaged. The pop-up window on the lottery page of Gome App insults its founder. Ubuntu 23.10 is officially released. You might as well take advantage of Friday to upgrade! Ubuntu 23.10 release episode: The ISO image was urgently "recalled" due to containing hate speech. A 23-year-old PhD student fixed the 22-year-old "ghost bug" in Firefox. RustDesk remote desktop 1.2.3 was released, enhanced Wayland to support TiDB 7.4 Release: Official Compatible with MySQL 8.0. After unplugging the Logitech USB receiver, the Linux kernel crashed. The master used Scratch to rub the RISC-V simulator and successfully ran the Linux kernel. JetBrains launched Writerside, a tool for creating technical documents.
{{o.name}}
{{m.name}}

Guess you like

Origin my.oschina.net/u/4238514/blog/10120151
Recommended