Volcano：可扩展并行查询评估系统

需积分: 9 114 浏览量更新于2024-07-17 收藏 4.52MB PDF 举报

"火山模型是数据库查询处理领域的一个重要理论框架，它是一个可扩展且支持并行查询评估的系统。该系统由Goetz Graefe提出，发表在1994年的IEEE Transactions on Knowledge and Data Engineering期刊上。Volcano旨在探索在数据库查询处理中的可扩展性和并行性之间的相互作用，并提供了一个丰富的研究和教育环境，涵盖了查询优化、并行查询执行和资源分配等关键领域。" 在Volcano模型中，设计了一种标准接口，将代数操作符隔离开来，这使得添加新的操作符和操作符实现变得简单易行。这种灵活性使得Volcano可以轻松应对不断变化的数据处理需求和新的数据类型。例如，通过支持函数，个体项目的操作（如谓词）可以被引入到查询处理操作中，这些支持函数的语义并不固定，能够处理包括复杂对象在内的任何数据类型和任意操作，从而实现了对新运算符、算法、数据类型以及特定类型的扩展。 Volcano系统引入了两个创新的元操作符，这是其独特之处。这些元操作符可能是用于管理和协调查询执行流程的关键组件，可能包括数据流控制、并行化策略或者其他优化策略。它们的存在使得Volcano不仅能够处理传统的SQL查询，还能够适应更复杂的查询模式和分布式数据环境，如分布式数据库或云计算平台。此外，Volcano的并行查询执行能力使其能够在多处理器或分布式系统中高效运行。通过智能地分配计算任务和管理数据流，Volcano可以在多个处理器之间并行执行查询操作，从而显著提高查询性能。这对于大数据处理和实时数据分析尤其重要，因为这些场景通常需要快速响应大量复杂查询。总体来说，Volcano模型通过其可扩展性和并行性，为数据库系统的设计提供了新的视角和方法。它不仅为研究者提供了探索和实验新算法、优化策略的平台，也为教育领域提供了深入理解数据库查询处理机制的实例。这个模型的影响力深远，后续的许多数据库系统设计都受到了Volcano的启发，例如Google的MapReduce和Apache Spark等大数据处理框架。

GRAEFE: VOLCANO-QUERY EVALUATION SYSTEM

Execute (driver)

Exchange Choose-Plan

Hash One-to-One Match Sort One-to-One Mate h

sort

Hash One-to-Many Match Sort One-to-Many Match

Index Maintenance

scans

Functional Join Filter

Files & Records Devices

Indices (B+-trees)

Physical I/O

Buffer Manager

Memory Manager

Fig. 1. Volcano’s main modules.

placement, and reading and writing disk pages, while the

higher level software determines the policies depending

on data semantics, importance, and access patterns. It is

surprising that database buffer managers derive replace-

ment decisions from observed reference behavior in spite

of the fact that this behavior is generated by higher level

database software and thus known and foreseeable in ad-

vance within the same system, albeit different subcom-

ponents.

Files are composed of records, clusters, and extents.

Since file operations are invoked very frequently in any

database system, all design decisions in the file module

have been made to provide basic functionality with the

highest attainable performance. A cluster, consisting of

one or more pages, is the unit of I/O and of buffering, as

discussed above. The cluster size is set for each file in-

dividually. Thus, different files on the same device can

have different cluster sizes. Disk space for files is allo-

cated in physically contiguous extents, because extents

allow very fast scanning without seeks and large-chunk

read-ahead and write-behind.

Records are identified by a record identifier (RID), and

can be accessed directly using the RID. For fast access to

a large set of records, Volcano supports not only individ-

ual file and record operations but also scans that support

read-next and append operations. There are two interfaces

to file scans; one is part of the file system and is described

momentarily; the other is part of the query processing

level and is described later. The first one has the standard

procedures for file scans, namely open, next, close, and

rewind. The next procedure returns the main memory ad-

dress of the next record. This address is guaranteed

(pinned) until the next operation is invoked on the scan.

Thus, getting the next record within the same cluster does

not require calling the buffer manager and can be per-

formed very efficiently.

For fast creation of files, scans support an append op-

eration. It allocates a new record slot, and returns the new

slot’s main memory address. It is the caller’s responsibil-

ity to fill the provided record space with useful informa-

tion, i.e., the append routine is entirely oblivious to the

data and their representation.

Scans also support optional predicates. The predicate

function is called by the next procedure with the argument

and a record address. Selective scans are the first example

of support functions mentioned briefly in the introduction.

Instead of determining a qualification itself, the scan

mechanism relies on a predicate function imported from

a higher level.

Support functions are passed to an operation as a func-

tion entry point and a typeless pointer that serves as a

123

predicate argument. Arguments to support functions can

be used in two ways, namely in compiled and interpreted

query execution. In compiled scans, i.e., when the pred-

icate evaluation function is available in macvhine code,

the argument can be used to pass a constant or a pointer

to several constants to the predicate function. For exam-

ple, if the predicate consists of comparing a record field

with a string, the comparison function is passed as pred-

icate function while the search string is passed as predi-

cate argument. In interpreted scans, i.e., when a general

interpreter is used to evaluate all predicates in query, they

can be used to pass appropriate code to the interpreter.

The interpreter’s entry point is given as predicate func-

tion. Thus, both interpreted and compiled scans are sup-

ported with a single simple and efficient mechanism. Vol-

cano’s use of support functions and their arguments is

another example for a mechanism that leaves a policy de-

cision, in this case whether to use compiled or interpreted

scans, open to be decided by higher level software.

Zndices are implemented currently only in the form of

B + -trees with an interface similar to files. A leaf entry

consists of a key and information. The information part

typically is a RID, but it could include more or different

information. The key and the information can be of any

type; a comparison function must be provided to compare

keys. The comparison function uses an argument equiv-

alent to the one described for scan predicates. Permitting

any information in the leaves gives more choices in phys-

ical database design. It is another example of Volcano

providing a mechanism to allow a multitude of designs

and usage policies. B + -trees support scans similar to files,

including predicates and append operations for fast load-

ing. In addition, B f -tree scans allow seeking to a partic-

ular key, and setting lower and upper bounds.

For intermediate results in query processing (later called

streams), Volcano uses special devices called virtual de-

vices. The difference between virtual and disk devices is

that data pages of virtual devices only exist in the buffer.

As soon as such data pages are unpinned, they disappear

and their contents are lost. Thus, Volcano uses the same

mechanisms and function calls for permanent and inter-

mediate data sets, easing implementation of new opera-

tors significantly.

In summary, much of Volcano’s file system is conven-

tional in its goals but implemented in a flexible, efficient,

and compact way. The file system supports basic abstrac-

tions and operations, namely devices, files, records,

B+-trees, and scans. It provides mechanisms to access

these objects, leaving many policy decisions to higher

level software. High performance was a very important

goal in the design and implementation of these mecha-

nisms since performance studies and parallelization only

make sense if the underlying mechanisms are efficient.

Furthermore, research into implementation and perfor-

mance trade-offs for extensible database systems and new

data models is only relevant if an efficient evaluation plat-

form is used.

剩余15页未读，继续阅读

littlegeorge

粉丝: 2
资源: 3

Volcano：可扩展并行查询评估系统

DB - The Volcano Optimizer Generator - Extensibility and ...

DB - The Cascades Framework for Query Optimization.pdf

马达-Volcano-在Kubernetes上运行高性能作业.pdf

Ubuntu查看volcano版本的命令

make[1]: *** No rule to make target 'volcano_5gregmeter_test.c', needed by 'volcano_5gregmeter_test.o'. Stop.

make[1]: *** No rule to make target ‘volcano_5gregmeter_test.c’, needed by ‘volcano_5gregmeter_test.o’. Stop. 执行bb文件时 以上报错怎么解决

volcano_plot <- volcano_plot + geom_text(aes(label = labels), position = position.coords, vjust = -0.5) 错误: 找不到对象'position.coords'

如何用R语言进行GEO数据库差异分析

r语言基于FPKM数据作火山图

data volcano

最新资源

make[1]: *** No rule to make target ‘volcano_5gregmeter_test.c’, needed by ‘volcano_5gregmeter_test.o’. Stop. 执行bb文件时以上报错怎么解决