探索广义线性混合模型：从线性模型到非线性预测

下载需积分: 9 | PDF格式 | 154KB | 更新于2024-07-23 | 190 浏览量 | 举报

"这篇文档是关于广义线性混合模型（GLMM）的介绍，主要由Stephen D. Kachman撰写，他来自University of Nebraska-Lincoln的生物计量学部门。文档探讨了如何构建和修改广义线性混合模型，以及这种模型在处理线性混合模型假设不成立时的优势。GLMM近年来受到了广泛关注，特别是在SAS的GLIMMIX宏程序出现后，使得更多用户能够使用。然而，在常规育种评估中，现有的软件包效率不高，需要使用FORTRAN或C语言进行专门实现，并且GLMM在处理某些问题时带来了额外的挑战。" **广义线性模型（GLM）基础** 广义线性模型是线性模型的一种扩展，它允许响应变量遵循非正态分布，并且可以具有非恒定的方差。在GLM中，响应变量\( Y \)通过一个链接函数\( g \)与线性预测值\( \eta \)关联，即\( g(E(Y)) = \eta \)，其中\( E(Y) \)是响应变量的期望值，\( \eta = X\beta \)是线性组合，\( X \)是设计矩阵，\( \beta \)是参数向量。 **线性混合模型（LMM）与GLMM的对比** 线性混合模型适用于分析具有随机效应的数据，如遗传评估中的家族数据。LMM假设响应变量服从正态分布且方差恒定。然而，实际应用中，这些假设可能不成立，比如在农业、生物统计学和医学研究等领域，响应变量可能是二项式、泊松、负二项式等分布，或者方差随均值变化。 **GLMM的核心概念** GLMM结合了线性混合模型和广义线性模型的特点，它允许随机效应的存在，并且可以处理非正态响应变量。随机效应通常涉及个体间的差异，例如个体的遗传效应或环境效应。链接函数在GLMM中仍然存在，但其作用是在非正态分布上操作，以适应不同类型的响应变量。 **GLIMMIX宏的出现** SAS的GLIMMIX宏提供了一个方便的工具，使得用户能够处理更复杂的模型，包括GLMM。这使得GLMM的应用范围扩大，但同时也强调了对专业编程技能的需求，因为现成的软件包在处理大型数据集和复杂模型时效率低下。 **GLMM的实施与挑战** 实现GLMM需要对算法有深入理解，包括最大似然估计、贝叶斯方法等。GLMM的计算需求较高，尤其是处理大型数据集和多层结构时。此外，还需要考虑模型的识别性、模型的诊断和验证等问题。 **结论** 广义线性混合模型是统计分析的一个强大工具，尤其是在生物学、农业科学和遗传学等领域。尽管GLMM提供了更灵活的建模框架，但其实施和计算上的挑战意味着需要专业的统计知识和编程能力。随着技术的发展，预计未来将有更多的工具和软件包支持GLMM的高效分析。

Fixed

Effects

h(η )

Inverse

Link

Linear

Predictor

Xβ +Zu

Components

Variance

Random

Effects

Mean

Observations

Figure 1: Symbolic representation of a generalized linear mixed model.

3 A Generalized Linear Mixed Model

In this section I will present a formulation of a generalized linear mixed model. It

differs from presentations such as [1] in that it focuses more on the inverse link function

rather than the link function to model the relationship between the linear predictor and

the conditional mean. Generalized linear mixed models also includes the nonlinear mixed

models of [4].

Figure 1 provides a symbolic representation of a generalized linear mixed model. As

in a linear mixed model, a generalized linear mixed model includes ﬁxed effects, β, with

(e.g., management effect); random effects, u ∼ N(0, G), (e.g., breeding values); design

matrices X and Z; and a vector of observations, y, (e.g., pregnancy status) for which

the conditional distribution given the random effects has mean, µ (e.g., mean pregnancy

rate), and covariance matrix, R, (e.g., variance of pregnancy status is µ(1 − µ)).

In addition, a generalized linear mixed model includes a linear predictor, η, and a link

and/or inverse link function. In addition, the conditional mean, µ, depends on the linear

predictor through an inverse link function, h(·), and the covariance matrix, R, depends on

µ through a variance function.

For example, the mean pregnancy rate, µ, depends on the effect of management

and the breeding value of the animal. The management effect and breeding value act

剩余14页未读，继续阅读

qq_16581121

粉丝: 1

探索广义线性混合模型：从线性模型到非线性预测

广义线性模型经典教材（英文版）

广义线性模型SAS程序

广义线性模型估计 Matlab（与 R 中相同）：具有与 R 中相同结果的广义线性模型估计。-matlab开发

平方损失下的在线/批量广义线性模型：使用平方损失下的广义线性模型进行在线（竞争）/批量预测-matlab开发

使用具有因子的广义线性模型 (GLM) 的通用函数：使用具有因子的广义线性模型 (GLM) 的通用函数-matlab开发

回归模型 线性及广义线性模型

逻辑与广义线性模型

广义线性模型.xmind

广义线性模型spss

python广义线性模型

最新资源

回归模型线性及广义线性模型