没有合适的资源?快使用搜索试试~ 我知道了~
首页greenplum6.1官方文档.pdf
GreenPlum官方文档,基于6.1的最新文档,包括安装、配置、优化等等。 Greenplum是一家总部位于美国加利福尼亚州,为全球大型企 业用户提供新型企业级数据仓库(EDW)、企业级数据云(EDC)和商务智能(BI)提供解决方案和咨询服务的公司,在全球已有:纳斯达克,纽约证券交易所,Skype. FOX,T-Mobile;中国已有:中信实业银行,东方航空公司,阿里巴巴,华泰保险,中国远洋(Cosco),李宁公司等大型企业用户选择Greenplum的产品。 MPP 系统 Greenplum的架构采用了MPP(大规模并行处理)。在 MPP 系统中,每个 SMP节点也可以运行自己的操作系统、数据库等。换言之,每个节点内的 CPU 不能访问另一个节点的内存。节点之间的信息交互是通过节点互联网络实现的,这个过程一般称为数据重分配(Data Redistribution) 。与传统的SMP架构明显不同,通常情况下,MPP系统因为要在不同处理单元之间传送信息,所以它的效率要比SMP要差一点,但是这也不是绝对的,因为 MPP系统不共享资源,因此对它而言,资源比SMP要多,当需要处理的事务达到一定
资源详情
资源评论
资源推荐
PRODUCT DOCUMENTATION
Pivotal
™
Greenplum
Database
®
Version 6.1
Pivotal Greenplum Database
Documentation
Rev: A02
© 2019 Pivotal Software, Inc.
Copyright Release Notes
2
Notice
Copyright
Privacy Policy | Terms of Use
Copyright © 2019 Pivotal Software, Inc. All rights reserved.
Pivotal Software, Inc. believes the information in this publication is accurate as of its publication date. The
information is subject to change without notice. THE INFORMATION IN THIS PUBLICATION IS PROVIDED
"AS IS." PIVOTAL SOFTWARE, INC. ("Pivotal") MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY
KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS
IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Use, copying, and distribution of any Pivotal software described in this publication requires an applicable
software license.
All trademarks used herein are the property of Pivotal or their respective owners.
Revised November 2019 (6.1.0)
Contents Release Notes
3
Contents
Chapter 3: Pivotal Greenplum 6.1 Release Notes.................................. 14
Release 6.1.0.................................................................................................................................... 15
New Features......................................................................................................................... 15
Resolved Issues..................................................................................................................... 16
Upgrading to Greenplum 6.1.0...............................................................................................18
Migrating Data to Greenplum 6........................................................................................................ 19
Known Issues and Limitations.......................................................................................................... 20
Chapter 4: Installing and Upgrading Greenplum................................... 21
Platform Requirements......................................................................................................................22
Operating Systems................................................................................................................. 22
Hardware and Network...........................................................................................................24
Storage....................................................................................................................................24
Tools and Extensions Compatibility....................................................................................... 25
Hadoop Distributions.............................................................................................................. 27
Introduction to Greenplum.................................................................................................................28
The Greenplum Master...........................................................................................................29
The Segments........................................................................................................................ 29
The Interconnect.....................................................................................................................33
ETL Hosts for Data Loading.................................................................................................. 35
Greenplum Performance Monitoring...................................................................................... 36
Estimating Storage Capacity.............................................................................................................38
Calculating Usable Disk Capacity.......................................................................................... 38
Calculating User Data Size.................................................................................................... 39
Calculating Space Requirements for Metadata and Logs......................................................39
Configuring Your Systems.................................................................................................................40
Disabling SELinux and Firewall Software.............................................................................. 40
Recommended OS Parameters Settings............................................................................... 41
Synchronizing System Clocks................................................................................................ 48
Creating the Greenplum Administrative User.........................................................................48
Next Steps.............................................................................................................................. 50
Installing the Greenplum Database Software................................................................................... 51
Installing Greenplum Database.............................................................................................. 51
Enabling Passwordless SSH..................................................................................................52
Confirming Your Installation................................................................................................... 53
About Your Greenplum Database Installation........................................................................53
Next Steps.............................................................................................................................. 53
Creating the Data Storage Areas......................................................................................................54
Creating Data Storage Areas on the Master and Standby Master Hosts...............................54
Creating Data Storage Areas on Segment Hosts.................................................................. 54
Next Steps.............................................................................................................................. 55
Validating Your Systems................................................................................................................... 56
Validating Network Performance............................................................................................56
Validating Disk I/O and Memory Bandwidth...........................................................................57
Initializing a Greenplum Database System.......................................................................................58
Overview................................................................................................................................. 58
Initializing Greenplum Database.............................................................................................58
Setting Greenplum Environment Variables............................................................................ 61
Contents Release Notes
4
Next Steps.............................................................................................................................. 62
Installing Optional Extensions........................................................................................................... 64
Procedural Language, Machine Learning, and Geospatial Extensions..................................64
Python Data Science Module Package..................................................................................64
R Data Science Library Package........................................................................................... 68
Greenplum Platform Extension Framework (PXF)................................................................. 71
Installing Additional Supplied Modules..............................................................................................72
Configuring Timezone and Localization Settings..............................................................................73
Configuring the Timezone...................................................................................................... 73
About Locale Support in Greenplum Database..................................................................... 73
Character Set Support............................................................................................................75
Setting the Character Set.......................................................................................................77
Character Set Conversion Between Server and Client..........................................................78
Upgrading from an Earlier Greenplum 6 Release............................................................................ 81
Upgrading from 6.x to a Newer 6.x Release......................................................................... 81
Troubleshooting a Failed Upgrade.........................................................................................83
Migrating Data from Greenplum 4.3 or 5..........................................................................................84
Preparing the Greenplum 6 Cluster....................................................................................... 84
Preparing Greenplum 4.3 and 5 Databases for Backup........................................................ 85
Backing Up and Restoring a Database..................................................................................87
Completing the Migration........................................................................................................88
Enabling iptables (Optional).............................................................................................................. 90
Example iptables Rules..........................................................................................................90
Installation Management Utilities.......................................................................................................93
Greenplum Environment Variables................................................................................................... 94
Required Environment Variables............................................................................................94
Optional Environment Variables.............................................................................................94
Example Ansible Playbook................................................................................................................96
Chapter 6: Greenplum Database Administrator Guide.......................... 98
Greenplum Database Concepts........................................................................................................99
About the Greenplum Architecture.........................................................................................99
About Management and Monitoring Utilities........................................................................ 101
About Concurrency Control in Greenplum Database...........................................................102
About Parallel Data Loading................................................................................................ 110
About Redundancy and Failover in Greenplum Database...................................................111
About Database Statistics in Greenplum Database............................................................. 113
Managing a Greenplum System..................................................................................................... 121
About the Greenplum Database Release Version Number................................................. 121
Starting and Stopping Greenplum Database....................................................................... 121
Accessing the Database.......................................................................................................124
Configuring the Greenplum Database System.....................................................................132
Enabling Compression..........................................................................................................134
Enabling High Availability and Data Consistency Features................................................. 135
Backing Up and Restoring Databases................................................................................. 152
Expanding a Greenplum System..........................................................................................190
Migrating Data with gpcopy..................................................................................................205
Monitoring a Greenplum System..........................................................................................205
Routine System Maintenance Tasks....................................................................................221
Recommended Monitoring and Maintenance Tasks............................................................ 225
Managing Greenplum Database Access.........................................................................................233
Configuring Client Authentication......................................................................................... 233
Managing Roles and Privileges............................................................................................261
Defining Database Objects..............................................................................................................268
Creating and Managing Databases......................................................................................268
Contents Release Notes
5
Creating and Managing Tablespaces...................................................................................270
Creating and Managing Schemas........................................................................................272
Creating and Managing Tables............................................................................................ 274
Choosing the Table Storage Model..................................................................................... 279
Partitioning Large Tables......................................................................................................290
Creating and Using Sequences........................................................................................... 303
Using Indexes in Greenplum Database............................................................................... 306
Creating and Managing Views............................................................................................. 310
Distribution and Skew..................................................................................................................... 311
Local (Co-located) Joins.......................................................................................................311
Data Skew............................................................................................................................ 311
Processing Skew.................................................................................................................. 312
Inserting, Updating, and Deleting Data...........................................................................................315
About Concurrency Control in Greenplum Database...........................................................315
Inserting Rows...................................................................................................................... 316
Updating Existing Rows........................................................................................................317
Deleting Rows.......................................................................................................................317
Working With Transactions...................................................................................................317
Global Deadlock Detector.....................................................................................................319
Vacuuming the Database..................................................................................................... 321
Running Out of Locks...........................................................................................................321
Querying Data................................................................................................................................. 323
About Greenplum Query Processing....................................................................................323
About GPORCA....................................................................................................................326
Defining Queries................................................................................................................... 339
WITH Queries (Common Table Expressions)......................................................................351
Using Functions and Operators............................................................................................355
Working with JSON Data..................................................................................................... 365
Working with XML Data........................................................................................................378
Using Full Text Search.........................................................................................................390
Using Greenplum MapReduce............................................................................................. 425
Query Performance.............................................................................................................. 433
Managing Spill Files Generated by Queries........................................................................ 433
Query Profiling...................................................................................................................... 433
Working with External Data.............................................................................................................439
Accessing External Data with PXF...................................................................................... 439
Defining External Tables...................................................................................................... 439
Accessing External Data with Foreign Tables..................................................................... 457
Using the Greenplum Parallel File Server (gpfdist)..............................................................466
Loading and Unloading Data.......................................................................................................... 470
Loading Data Using an External Table................................................................................ 471
Loading and Writing Non-HDFS Custom Data.................................................................... 471
Handling Load Errors............................................................................................................474
Loading Data with gpload.....................................................................................................476
Accessing External Data with PXF...................................................................................... 477
Transforming External Data with gpfdist and gpload........................................................... 478
Loading Data with COPY..................................................................................................... 488
Running COPY in Single Row Error Isolation Mode............................................................489
Optimizing Data Load and Query Performance................................................................... 489
Unloading Data from Greenplum Database.........................................................................489
Formatting Data Files........................................................................................................... 492
Example Custom Data Access Protocol.............................................................................. 495
Managing Performance................................................................................................................... 502
Defining Database Performance.......................................................................................... 502
Common Causes of Performance Issues............................................................................ 503
Greenplum Database Memory Overview............................................................................. 506
剩余1675页未读,继续阅读
foxxxx_001
- 粉丝: 0
- 资源: 14
上传资源 快速赚钱
- 我的内容管理 收起
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
会员权益专享
最新资源
- stc12c5a60s2 例程
- Android通过全局变量传递数据
- c++校园超市商品信息管理系统课程设计说明书(含源代码) (2).pdf
- 建筑供配电系统相关课件.pptx
- 企业管理规章制度及管理模式.doc
- vb打开摄像头.doc
- 云计算-可信计算中认证协议改进方案.pdf
- [详细完整版]单片机编程4.ppt
- c语言常用算法.pdf
- c++经典程序代码大全.pdf
- 单片机数字时钟资料.doc
- 11项目管理前沿1.0.pptx
- 基于ssm的“魅力”繁峙宣传网站的设计与实现论文.doc
- 智慧交通综合解决方案.pptx
- 建筑防潮设计-PowerPointPresentati.pptx
- SPC统计过程控制程序.pptx
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功
评论0