基于截断L1范数的稀疏共典型关联分析：在脑影像遗传学中的应用

132 浏览量更新于2024-08-28 收藏 211KB PDF 举报

本文主要探讨了"稀疏规范性共线性分析（Sparse Canonical Correlation Analysis, SCCA）"在脑成像遗传学中的应用，特别是在基因标记与神经影像定量特征之间的多变量关联挖掘。SCCA作为一种流行的方法，因其同时具备识别多变量关系和特征选择的强大能力而备受关注。然而，现有的SCCA方法通常依赖于L1范数或其变种，这可能限制了结果的稀疏性和模型的解释性。传统的L1范数方法虽然能够引入稀疏性，但L0范数被认为更理想，因为它能更好地捕获非零关联，因为L0-norm最小化问题被证明是NP-hard，即在多项式时间内难以求解。因此，这篇2016年发表在IEEE International Conference on Bioinformatics and Biomedicine (BIBM)上的研究提出了一种创新的方法，即通过"截断L1范数"来解决这个问题。作者们，来自西北工业大学自动化学院和印第安纳大学医学院的研究者，提出了一个新算法，旨在利用L1范数的特性同时结合截断操作，以克服L0范数求解的复杂性。这种方法允许在保持模型简洁的同时，发掘潜在的基因-影像特征间的稀疏相关性。他们将这一技术应用于阿尔茨海默病神经影像遗传学（Alzheimer's Disease Neuroimaging Initiative, ADNI）的数据集上，展示了其在实际应用中的可行性与有效性。通过实施这种新颖的SCCA方法，研究人员不仅能够提高关联发现的效率，还能减少冗余特征的影响，使得结果更易于解读，这对于理解大脑功能与基因表达之间的复杂交互具有重要意义。该研究的贡献在于提供了一个有效且实用的工具，有望推动脑成像遗传学领域的进一步研究和发展。

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Sparse Canonical Correlation Analysis via Truncated 

-norm with Application to

Brain Imaging Genetics

Lei Du

∗

, Tuo Zhang

∗

, Kefei Liu

†

, Xiaohui Yao

†

, Jingwen Yan

†

Shannon L. Risacher

†

, Lei Guo

∗

, Andrew J. Saykin

†

and Li Shen

†§

for the ADNI

∗

School of Automation

Northwestern Polytechnical University, Xi’an, China 710072

Email: dulei@nwpu.edu.cn

†

Indiana University School of Medicine, Indianapolis, USA 46202

Corresponding to: Email: shenli@iu.edu

Abstract—Discovering bi-multivariate associations between

genetic markers and neuroimaging quantitative traits is a

major task in brain imaging genetics. Sparse Canonical

Correlation Analysis (SCCA) is a popular technique in this

area for its powerful capability in identifying bi-multivariate

relationships coupled with feature selection. The existing SCCA

methods impose either the 

-norm or its variants. The 

norm is more desirable, which however remains unexplored

since the 

-norm minimization is NP-hard. In this paper, we

impose the truncated 

-norm to improve the performance of

the 

-norm based SCCA methods. Besides, we propose two

efﬁcient optimization algorithms and prove their convergence.

The experimental results, compared with two benchmark meth-

ods, show that our method identiﬁes better and meaningful

canonical loading patterns in both simulated and real imaging

genetic analyse.

Keywords-Sparse Canonical Correlation Analysis, Truncated



-norm, Brain Imaging Genetics

I. INTRODUCTION

Brain imaging genetics has gained more and more atten-

tions recently [1], [2]. A major task of imaging genetics

is to identify bi-multivariate associations between single

nucleotide polymorphisms (SNPs) and imaging quantitative

traits (QTs). Sparse canonical correlation analysis (SCCA),

which is powerful in bi-multivariate relationship discovery

coupled with feature selection, has become a popular tech-

nique in imaging genetic studies [3], [4], [5], [6], [7].

Witten et al. [3] introduced the 

-norm (Lasso) to assure

sparsity which only selects a small proportion of the features.

Since then, many SCCA methods using the 

-norm or its

variants are proposed [8]. There are two major concerns

regarding them. First, the 

-norm, which only penalizes

those nonzero features, is the most ideal constraint. But it

is neither non-convex nor discontinuous [9]. Second, the 

norm constraint is not a stable feature selector and thus could

incur estimation bias [10].

To overcome the problem above, the truncated 

-norm

penalty (TLP) [10], [11] is proposed. The TLP is deﬁned

as J

(|x|)=min(

|x|

, 1) with τ being a positive tuning

parameter. It approximates 

-norm and permits desirable

sparsity. In addition, TLP can be equivalently transferred to

a piecewise linear function, and thus is easy to handle.

In this paper, we propose the TLP based SCCA (TLP-

SCCA) which embraces the TLP into the CCA model.

The TLP-SCCA has the following advantages [10]. First,

the TLP performs as a tradeoff between the 

and 

functions. This means that it not only has improved feature

selection, but also can be solved effectively. Second, it is an

adaptive shrinkage method if τ is tuned appropriately. We

propose two effective optimization algorithms, both using

the alternating direction method of multipliers (ADMM)

technique [12], and they are guaranteed to converge. The

experimental results, compared with two popular 

-norm

based SCCA [3], [6], show that both TLP-SCCA exhibit

cleaner canonical loading patterns than the 

-SCCA.

II. T

HE TRUNCATED 

-NORM PENALTY

In this paper, a boldface lowercase letter denotes a vector,

and a boldface uppercase letter denotes a matrix. X ∈ R

n×p

denotes the SNP data, and Y ∈ R

n×q

is the QT data.

The truncated 

-norm is deﬁned as follows [13]:

TLP

(u)=



(|u

|), where J

(|u

|)=min(

, 1).

(1)









´ 

´ 





Figure 1. Visualization of the 

-norm ball (left), TLP ball with τ =

and τ =

(middle), and 

-norm ball (right).

The parameter τ is a threshold. Given an appropriate τ,

TLP balances between the 

-norm and 

-norm according

to the magnitude of the coefﬁcients. Fig. 1 presents the norm

ball of 

-norm, 

-norm, and TLP with different τ ’s. The

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38638292

粉丝: 5
资源: 920

基于截断L1范数的稀疏共典型关联分析：在脑影像遗传学中的应用

Canonical correlation analysis:An overview with application to learning methods

Multi-Task Sparse Canonical Correlation Analysis with Application to Multi-Modal Brain Imaging Genetics

Hierarchical sparse representation based Multi-Instance Semi-Supervised Learning with application to image categorization

omp算法matlab代码-Learning-Macroscopic-Brain-Connectomes-via-Group-Sparse-F

熵值法matlab代码-Sparse-Imaging-for-Spinning-Space-Targets-with-Short-Time-O

Face recognition by sparse discriminant analysis via joint L2,1 norm minimization

基于多种模型剪枝方法（L1-norm、Slimming、AutoSlim）的模型轻量化和模型压缩实现

Anti-sparse Representation for Continuous Function by Dual Atomic Norm with Application in OFDM

matlabtsp问题代码-Support-Recovery-for-Sparse-Signals-With-Unknown-Non-Stat

Sparse4D v3 Advancing End-to-End 3D Detection and Tracking

最新资源