SAS PROC MIXED 混合线性模型解析

mixed

需积分: 50 48 浏览量更新于2023-05-14 1 收藏 849KB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

"SAS PROC MIXED 是 SAS 环境下用于拟合混合线性模型的统计过程，它能够处理具有相关性和非恒定方差的数据。混合模型结合了固定效应和随机效应，适用于复杂数据结构的分析。" 在统计学中，混合模型是一种非常重要的工具，尤其在处理具有层次结构或非独立观测的数据时。SAS 的 PROC MIXED 过程提供了对这种模型的强大支持。混合线性模型扩展了传统的广义线性模型（GLM），不仅考虑了数据的均值，还考虑了其方差和协方差结构，使得模型更加灵活，能够适应多种系统和情境。 PROC MIXED 的主要假设包括： 1. 数据遵循正态分布（高斯分布）：这是线性模型的基础，意味着数据的误差项是服从正态分布的。 2. 数据的均值（期望值）与一组特定参数呈线性关系：这意味着模型可以表达数据的线性趋势。 3. 数据的方差和协方差与另一组参数有关，并且它们展示出 PROC MIXED 提供的结构之一：这允许模型处理不恒定的方差和相关的观测值，比如时间序列数据或者具有嵌套或交叉分类变量的数据。在 PROC MIXED 中，用户可以选择不同的协方差结构来适应数据的特定特性，例如，独立、同方差、自相关、空间依赖等。此外，该过程支持固定效应和随机效应的组合，其中固定效应是研究者感兴趣的参数，而随机效应则用于捕捉未观察到的变异或不确定性。在实际应用中，PROC MIXED 可以用于各种领域，如生物统计学、社会科学、工程学等。例如，在农业试验中，可能需要考虑不同地块之间的差异（随机效应）以及施肥量对作物产量的影响（固定效应）。在医学研究中，可以分析不同治疗组的效果，同时考虑患者个体差异带来的随机性。通过 PROC MIXED，用户可以进行模型选择、参数估计、假设检验以及预测。输出结果通常包括参数估计值、标准误差、显著性水平和置信区间，以及模型拟合度的指标，如残差图和随机效应的分布。 SAS PROC MIXED 是一个强大且灵活的工具，能够处理复杂的混合线性模型，为研究人员提供深入理解和分析数据的能力。它允许用户在处理非独立或有结构方差的数据时，做出统计推断，从而揭示隐藏在数据背后的模式和关系。

资源详情

资源推荐

SAS PROC MIXED 13

identifies the contrast in the table. A label is required for every contrast specified. Labels can be up to 20

characters and must be enclosed in single quotes.

fixed-effect

identifies an effect that appears in the MODEL statement. The keyword INTERCEPT can be used as an

effect when an intercept is fitted in the model. You do not need to include all effects that are in the

MODEL statement.

random-effect

identifies an effect that appears in the RANDOM statement. The first random effect must follow a vertical

bar (|); however, random effects do not have to be specified.

values

are constants that are elements of the L matrix associated with the fixed and random effects.

The rows of L' are specified in order and are separated by commas. The rows of the K' component of L' are

specified on the left side of the vertical bars (|). These rows test the fixed effects and are, therefore, checked for

estimability. The rows of the M' component of L' are specified on the right side of the vertical bars. They test the

random effects, and no estimability checking is necessary.

If PROC MIXED finds the fixed-effects portion of the specified contrast to be nonestimable (see the SINGULAR=

option), then it displays "Non-est" for the contrast entries.

The following CONTRAST statement reproduces the F-test for the effect A in the split-plot example (see Example

41.1):

contrast 'A broad'

A 1 -1 0 A*B .5 .5 -.5 -.5 0 0 ,

A 1 0 -1 A*B .5 .5 0 0 -.5 -.5 / df=6;

Note that no random effects are specified in the preceding contrast; thus, the inference space is broad. The resulting

F-test has two numerator degrees of freedom because L' has two rows. The denominator degrees of freedom is, by

default, the residual degrees of freedom (9), but the DF= option changes the denominator degrees of freedom to 6.

The following CONTRAST statement reproduces the F-test for A when Block and A*Block are considered fixed

effects (the narrow inference space):

contrast 'A narrow'

A 1 -1 0

A*B .5 .5 -.5 -.5 0 0 |

A*Block .25 .25 .25 .25

-.25 -.25 -.25 -.25

0 0 0 0 ,

A 1 0 -1

A*B .5 .5 0 0 -.5 -.5 |

A*Block .25 .25 .25 .25

0 0 0 0

-.25 -.25 -.25 -.25 ;

The preceding contrast does not contain coefficients for B and Block because they cancel out in estimated

differences between levels of A. Coefficients for B and Block are necessary when estimating the mean of one of the

levels of A in the narrow inference space (see Example 41.1

If the elements of L are not specified for an effect that contains a specified effect, then the elements of the specified

effect are automatically "filled in" over the levels of the higher-order effect. This feature is designed to preserve

estimability for cases when there are complex higher-order effects. The coefficients for the higher-order effect are

determined by equitably distributing the coefficients of the lower-level effect as in the construction of least squares

means. In addition, if the intercept is specified, it is distributed over all classification effects that are not contained

by any other specified effect. If an effect is not specified and does not contain any specified effects, then all of its

SAS PROC MIXED 14

coefficients in L are set to 0. You can override this behavior by specifying coefficients for the higher-order effect.

If too many values are specified for an effect, the extra ones are ignored; if too few are specified, the remaining ones

are set to 0. If no random effects are specified, the vertical bar can be omitted; otherwise, it must be present. If a

SUBJECT effect is used in the RANDOM statement, then the coefficients specified for the effects in the RANDOM

statement are equitably distributed across the levels of the SUBJECT effect. You can use the E

option to see exactly

what L matrix is used.

The SUBJECT

and GROUP options in the CONTRAST statement are useful for the case when a SUBJECT= or

GROUP= variable appears in the RANDOM statement, and you want to contrast different subjects or groups. By

default, CONTRAST statement coefficients on random effects are distributed equally across subjects and groups.

PROC MIXED handles missing level combinations of classification variables similarly to the way PROC GLM

does. Both procedures delete fixed-effects parameters corresponding to missing levels in order to preserve

estimability. However, PROC MIXED does not delete missing level combinations for random-effects parameters

because linear combinations of the random-effects parameters are always estimable. These conventions can affect

the way you specify your CONTRAST coefficients.

The CONTRAST statement computes the statistic

and approximates its distribution with an F-distribution. In this expression,

is an estimate of the generalized

inverse of the coefficient matrix in the mixed model equations. See the "Inference and Test Statistics" section for

more information on this F-statistic.

The numerator degrees of freedom in the F-approximation is rank(L), and the denominator degrees of freedom is

taken from the "Tests of Fixed Effects" table and corresponds to the final effect you list in the CONTRAST

statement. You can change the denominator degrees of freedom by using the DF=

option.

You can specify the following options in the CONTRAST statement after a slash (/).

CHISQ

requests that

-tests be performed in addition to any F-tests. A -statistic equals its corresponding F-

statistic times the associate numerator degrees of freedom, and this same degrees of freedom is used to

compute the p-value for the

-test. This p-value will always be less than that for the F-test, as it

effectively corresponds to an F-test with infinite denominator degrees of freedom.

DF=number

specifies the denominator degrees of freedom for the F-test. The default is the denominator degrees of

freedom taken from the "Tests of Fixed Effects" table and corresponds to the final effect you list in the

CONTRAST statement.

requests that the L matrix coefficients for the contrast be displayed. For ODS purposes, the label of this "L

Matrix Coefficients" table is "Coefficients".

SAS PROC MIXED 15

GROUP coeffs

GRP coeffs

sets up random-effect contrasts between different groups when a GROUP=

variable appears in the

RANDOM statement. By default, CONTRAST statement coefficients on random effects are distributed

equally across groups.

SINGULAR=number

tunes the estimability checking. If v is a vector, define ABS(v) to be the absolute value of the element of v

with the largest absolute value. If ABS(K'-K'T) is greater than C*number for any row of K' in the contrast,

then K is declared nonestimable. Here T is the Hermite form matrix (X'X)

X'X, and C is ABS(K') except

when it equals 0, and then C is 1. The value for number must be between 0 and 1; the default is 1E-4.

SUBJECT coeffs

SUB coeffs

sets up random-effect contrasts between different subjects when a SUBJECT=

variable appears on the

RANDOM statement. By default, CONTRAST statement coefficients on random effects are distributed

equally across subjects.

ESTIMATE Statement

ESTIMATE 'label' < fixed-effect values ...>

< | random-effect values ...> , ...< / options > ;

The ESTIMATE statement is exactly like a CONTRAST statement, except only one-row L matrices are permitted.

The actual estimate,

, is displayed along with its approximate standard error. An approximate t-test that

= 0 is also produced.

PROC MIXED selects the degrees of freedom to match those displayed in the "Tests of Fixed Effects" table for the

final effect you list in the ESTIMATE statement. You can modify the degrees of freedom using the DF=

option.

If PROC MIXED finds the fixed-effects portion of the specified estimate to be nonestimable, then it displays "Non-

est" for the estimate entries.

The following examples of ESTIMATE statements compute the mean of the first level of A in the split-plot example

(see Example 41.1

) for various inference spaces.

estimate 'A1 mean narrow' intercept 1

A 1 B .5 .5 A*B .5 .5 |

block .25 .25 .25 .25

A*Block .25 .25 .25 .25

0 0 0 0

0 0 0 0;

estimate 'A1 mean intermed' intercept 1

A 1 B .5 .5 A*B .5 .5 |

Block .25 .25 .25 .25;

estimate 'A1 mean broad' intercept 1

A 1 B .5 .5 A*B .5 .5;

The construction of the L vector for an ESTIMATE statement follows the same rules as listed under the

CONTRAST

statement.

You can specify the following options in the ESTIMATE statement after a slash (/).

ALPHA=number

SAS PROC MIXED 16

requests that a t-type confidence interval be constructed with confidence level 1-number. The value of

number must be between 0 and 1; the default is 0.05.

requests that t-type confidence limits be constructed. The confidence level is 0.95 by default; this can be

changed with the ALPHA=

option.

DF=number

specifies the degrees of freedom for the t-test and confidence limits. The default is the denominator degrees

of freedom taken from the "Tests of Fixed Effects" table and corresponds to the final effect you list in the

ESTIMATE statement.

DIVISOR=number

specifies a value by which to divide all coefficients so that fractional coefficients can be entered as integer

numerators.

requests that the L matrix coefficients be displayed. For ODS purposes, the label of this "L Matrix

Coefficients" table is "Coefficients".

GROUP coeffs

GRP coeffs

sets up random-effect contrasts between different groups when a GROUP=

variable appears in the

RANDOM statement. By default, ESTIMATE statement coefficients on random effects are distributed

equally across groups.

LOWER

LOWERTAILED

requests that the p-value for the t-test be based only on values less than the t-statistic. A two-tailed test is

the default. A lower-tailed confidence limit is also produced if you specify the CL

option.

SINGULAR=number

tunes the estimability checking as documented for the CONTRAST statement

SUBJECT coeffs

SUB coeffs

sets up random-effect contrasts between different subjects when a SUBJECT=

variable appears in the

RANDOM statement. By default, ESTIMATE statement coefficients on random effects are distributed

equally across subjects.

For example, the ESTIMATE statement in the following code from Example 41.5

constructs the difference

between the random slopes of the first two batches.

proc mixed data=rc;

class batch;

model y = month / s;

random int month / type=un sub=batch s;

estimate 'slope b1 - slope b2' | month 1 / subject 1 -1;

run;

UPPER

UPPERTAILED

requests that the p-value for the t-test be based only on values greater than the t-statistic. A two-tailed test is

the default. An upper-tailed confidence limit is also produced if you specify the CL

option.

剩余79页未读，继续阅读

loveseeking

粉丝: 0
资源: 9

SAS PROC MIXED 混合线性模型解析

SAS Mixed Procedure

sas for mixed models

HPmixed：高性能混合效应模型工具箱：测量科学研究所 SAS - 基于 REML 的线性混合效应模型拟合，具有简单的方差协方差结构-matlab开发

广义乘子变换相关的p叶函数新广义积分算子研究-JEMS-2014-EG-B.V.

NoSQL数据仓库建模及模拟 Université Toulouse le Mirail - Toulouse II，2016.

基于光流和纹理分析的人脸攻击检测方法研究-沙特国王大学学报.

销售大数据中分布式环境下的关联规则挖掘与一致性检测的研究及比较 - 2017年SCI文章.

认证协议内射性的语法判据及循环性质验证 - C.J.F.克雷默斯等 - 2005.

Process proc = Runtime.getRuntime().exec("java -jar ../../../../../lib/xxl-job-admin-2.4.0.jar");

am start --display 2 com.tencent.start.tv在这条命令的基础下选择是那个驱动出声音

am start --display 2 com.tencent.start.tv比如这个加上声卡选择

windows系统上用visual studio2022汇编使用循环计算1-2+3-4+5-6+...+99-100，调用Irvine32库，在屏幕上打印结果

编写C程序模拟实现单处理机系统中的进程调度算法，实现对多个进程的调度模拟

2台centos服务器 lvs配置

2台服务器 lvs配置

最新资源