Greenplum 6官方文档:全方位管理系统与性能管理

5星 · 超过95%的资源 需积分: 9 23 下载量 177 浏览量 更新于2024-07-16 1 收藏 89.38MB PDF 举报
GreenPlum 6.0官方文档是一份全面的指南,介绍了Greenplum数据库的配置、管理和使用。Greenplum是一个分布式数据库系统,它在PostgreSQL的基础上扩展了并行处理能力,适用于大数据环境。文档涉及的关键知识点包括: 1. **数据库概念**:阐述了Greenplum的组件和特性,如高可用性、并行数据装载和管理工具,强调了其分布式架构和核心概念。 2. **系统管理**:涵盖了数据库管理员的日常任务,如系统配置、监控和基础维护。 3. **数据安全**:强调了保护数据库访问的重要性,包括网络配置、用户身份验证和数据加密措施。 4. **数据定义与操作**:介绍了数据定义语言(DDL)的应用,包括创建和管理数据库对象,以及处理数据的插入、更新和删除操作,以及并发访问策略。 5. **SQL查询**:提供了在Greenplum中使用标准SQL语句进行查询的数据操作指导。 6. **外部数据访问**:讨论了如何通过外部表和外表连接到存储在非Greenplum系统的数据源,实现数据的读写操作。 7. **数据装载与卸载**:详细说明了数据导入(装载)和导出(卸载)的方法,以及数据文件的格式化。 8. **性能管理**:涉及性能监控和资源调度,如何通过配置来优化工作负载,确保系统的高效运行。 9. **Pivotal Greenplum的区别**:区分了开源版Greenplum与Pivotal商业支持版本的不同,特别是那些仅在后者中提供的特性,如特定的工具、集成和功能限制。 10. **技术支持**:文档特别指出,关于Pivotal Greenplum的特定支持和问题解决建议,只适用于商业用户,而开源版本的Greenplum可能有不同的资源和支持路径。 这份文档对于任何使用或维护Greenplum 6.0系统的人来说,都是一个重要的参考资源,帮助他们了解和优化数据库的各个方面。
2019-12-16 上传
greenplum-db-6.2.1-rhel7-x86_64.rpm Pivotal Greenplum 6.2 Release Notes This document contains pertinent release information about Pivotal Greenplum Database 6.2 releases. For previous versions of the release notes for Greenplum Database, go to Pivotal Greenplum Database Documentation. For information about Greenplum Database end of life, see Pivotal Greenplum Database end of life policy. Pivotal Greenplum 6 software is available for download from the Pivotal Greenplum page on Pivotal Network. Pivotal Greenplum 6 is based on the open source Greenplum Database project code. Important: Pivotal Support does not provide support for open source versions of Greenplum Database. Only Pivotal Greenplum Database is supported by Pivotal Support. Release 6.2.1 Release Date: 2019-12-12 Pivotal Greenplum 6.2.1 is a minor release that includes new features and resolves several issues. New Features Greenplum Database 6.2.1 includes these new features: Greenplum Database supports materialized views. Materialized views are similar to views. A materialized view enables you to save a frequently used or complex query, then access the query results in a SELECT statement as if they were a table. Materialized views persist the query results in a table-like form. Materialized view data cannot be directly updated. To refresh the materialized view data, use the REFRESH MATERIALIZED VIEW command. See Creating and Managing Materialized Views. Note: Known Issues and Limitations describes a limitation of materialized view support in Greenplum 6.2.1. The gpinitsystem utility supports the --ignore-warnings option. The option controls the value returned by gpinitsystem when warnings or an error occurs. If you specify this option, gpinitsystem returns 0 if warnings occurred during system initialization, and returns a non-zero value if a fatal error occurs. If this option is not specified, gpinitsystem returns 1 if initialization completes with warnings, and returns value of 2 or greater if a fatal error occurs. PXF version 5.10.0 is included, which introduces several new and changed features and bug fixes. See PXF Version 5.10.0 below. PXF Version 5.10.0 PXF 5.10.0 includes the following new and changed features: PXF has improved its performance when reading a large number of files from HDFS or an object store. PXF bundles newer tomcat and jackson libraries. The PXF JDBC Connector now supports pushdown of OR and NOT logical filter operators when specified in a JDBC named query or in an external table query filter condition. PXF supports writing Avro-format data to Hadoop and object stores. Refer to Reading and Writing HDFS Avro Data for more information about this feature. PXF is now certified with Hadoop 2.x and 3.1.x and Hive Server 2.x and 3.1, and bundles new and upgraded Hadoop libraries to support these versions. PXF supports Kerberos authentication to Hive Server 2.x and 3.1.x. PXF supports per-server user impersonation configuration. PXF supports concurrent access to multiple Kerberized Hadoop clusters. In previous releases of Greenplum Database, PXF supported accessing a single Hadoop cluster secured with Kerberos, and this Hadoop cluster must have been configured as the default PXF server. PXF introduces a new template file, pxf-site.xml, to specify the Kerberos and impersonation property settings for a Hadoop or JDBC server configuration. Refer to About Kerberos and User Impersonation Configuration (pxf-site.xml) for more information about this file. PXF now supports connecting to Hadoop with a configurable Hadoop user identity. PXF previously supported only proxy access to Hadoop via the gpadmin Greenplum user. PXF version 5.10.0 deprecates the following configuration properties. Note: These property settings continue to work. The PXF_USER_IMPERSONATION, PXF_PRINCIPAL, and PXF_KEYTAB settings in the pxf-env.sh file. You can use the pxf-site.xml file to configure Kerberos and impersonation settings for your new Hadoop server configurations. The pxf.impersonation.jdbc property setting in the jdbc-site.xml file. You can use the pxf.service.user.impersonation property to configure user impersonation for a new JDBC server configuration. Note: If you have previously configured a PXF JDBC server to access Kerberos-secured Hive, you must upgrade the server definition. See Upgrading PXF in Greenplum 6.x for more information. Changed Features Greenplum Database 6.2.1 includes these changed features: Greenplum Stream Server version 1.3.1 is included in the Greenplum distribution. Resolved Issues Pivotal Greenplum 6.2.1 is a minor release that resolves these issues: 29454 - gpstart During Greenplum Database start up, the gpstart utility did not report when a segment instance failed to start. The utility always displayed 0 skipped segment starts. This issue has been resolved. gpstart output was also enhanced to provide additional warnings and summary information about the number of skipped segments. For example: [WARNING]:-********