GreenPlum main functions and advantages

Software-based and optimized for commercial hardware
Software is easily installed on commercial x86 servers from multiple tier-one vendors and runs on Linux and Solaris.

Linearly scalable performance
The "shared nothing" architecture and parallel query optimizations ensure linearly scalable performance and capacity to cost thousands of nodes and processor cores.

Support for MapReduce
MapReduce has been proven to be a viable large-scale data analysis technology by major Internet service operators such as Google and Yahoo. With Greenplum, businesses can use MapReduce technology internally.

SQL Standards Full support for SQL-92 and SQL-99
through SQL 2003 OLAP control capabilities. All query messages are executed on the entire system in parallel.

Unified Analytical Processing All query and analysis (SQL, MapReduce, R, etc.) operations
can be performed on the same parallel dataflow engine, allowing analysts, developers, and statisticians to use the same infrastructure for data analysis.

Programmable Parallel Analysis Provides more advanced parallel analysis capabilities
for those working in arithmetic and statistics, supporting R, linear algebra, and machine learning capabilities.

In-Database Compression
uses industry-leading compression technology to significantly reduce the space required to store data while improving performance. Customers can reduce the space used by 3-10 times and increase the effective I/O performance.

Petabyte-scale data loading operations A
high-performance parallel data loader can perform operations synchronously on all nodes, with loading speeds exceeding 4.5TB/hour.

Access data anywhere
Regardless of the location, format, or storage medium of the data, query operations can be performed from the database to external data sources, returning data to the database in parallel.

Dynamic Scaling
Helps companies easily expand their data warehouses on a small or large scale while avoiding costly equipment or SMP server upgrades.
The full name of SMP is "Symmetrical Multi-Processing" (Symmetrical Multi-Processing) technology, which refers to a collection of processors (multiple CPUs) on a computer, and the memory subsystem and bus structure are shared among the CPUs.

Workload Management
Allows administrators to create role-based resource queues to divide resources and manage system load.

Centralized Management
Provides cluster-level management tools and resources to help administrators manage the entire Greenplum Database platform as if they were a single server.

Performance Monitoring
Through the graphical performance monitoring function, users can determine the current running situation and historical query information, and track system usage and resource information.

Index Support
Greenplum supports binary search trees, hashes, bitmaps, GiST, and GIN, enabling a variety of indexing capabilities to provide data architects with the tools necessary to implement optimal designs.

Industry Standard Interfaces
Supports standard database interfaces (SQL, ODBC, JDBC, DBI) and interoperates with leading business intelligence and extract/transform/load (ETL) tools on the market

Guess you like

Origin http://10.200.1.11:23101/article/api/json?id=326609099&siteId=291194637