ssgsea富集分析代码

SSGSEA（Single-sample Gene Set Enrichment Analysis）是一种基于基因集富集分析的方法，可以对单个样本进行基因表达谱的分析。以下是一个Python实现的SSGSEA富集分析代码示例： ```python import numpy as np from scipy.stats import norm def ssgsea(gene_exp, gene_sets, nperm=1000, weighted_score_type=1, permutation=True, min_size=1, max_size=5000, verbose=False, seed=None): """ :param gene_exp: array-like, shape (n_samples, n_features) Gene expression matrix (rows are samples and columns are features). :param gene_sets: dict Gene sets in the format of dictionary. Keys are pathway names and values are gene lists. :param nperm: int, optional The number of permutations for calculating the p-value. Default is 1000. :param weighted_score_type: int, optional The weighting score type. Default is 1. :param permutation: bool, optional Whether to do permutation for calculating the p-value. Default is True. :param min_size: int, optional The minimum number of genes in a gene set to be considered. Default is 1. :param max_size: int, optional The maximum number of genes in a gene set to be considered. Default is 5000. :param verbose: bool, optional Whether to print the progress. Default is False. :param seed: int, optional The seed for the random number generator. Default is None. :return: dict A dictionary of pathway names and enrichment scores. """ # Initialize the random number generator if seed is not None: np.random.seed(seed) # Prepare the gene expression matrix gene_exp = np.array(gene_exp) # Prepare the gene set list gene_sets = {k: v for k, v in gene_sets.items() if min_size <= len(v) <= max_size} # Compute the gene set scores pathways = {} for pathway, genes in gene_sets.items(): # Compute the gene set score for each sample gss = [] for i in range(gene_exp.shape[0]): # Get the gene expression values for the pathway genes pathway_exp = gene_exp[i, np.isin(gene_exp.columns, genes)] # Compute the gene set score if weighted_score_type == 0: gss.append(pathway_exp.sum()) elif weighted_score_type == 1: gss.append(pathway_exp.mean()) elif weighted_score_type == -1: gss.append(pathway_exp.abs().mean()) # Compute the enrichment score and p-value if permutation: null_gss = [] for i in range(nperm): # Shuffle the gene expression values shuffle_exp = gene_exp.apply(np.random.permutation, axis=1) # Compute the gene set score for each sample null_gss.append(shuffle_exp.apply(lambda x: x[np.isin(gene_exp.columns, genes)].mean(), axis=1)) null_gss = pd.concat(null_gss, axis=1) null_es = null_gss.apply(lambda x: (x > gss).mean() - (x < gss).mean()) es = (gss - null_es.mean()) / null_es.std() pval = (null_es < gss).mean() else: es = (gss - gss.mean()) / gss.std() pval = 1 - norm.cdf(es) pathways[pathway] = {'es': es, 'pval': pval} if verbose: print('%s: ES = %.3f, p-value = %.3f' % (pathway, es, pval)) return pathways ``` 该代码使用了NumPy和SciPy库进行计算。在使用时，需要将基因表达谱和基因集传递给`ssgsea`函数。此外，还可以设置其他参数，例如是否进行置换和置换次数等。函数返回一个包含富集分析结果的字典。

阅读全文

ssgsea富集分析代码

相关推荐

ssGSEA分析需要的28种免疫细胞信息

GSEApy:Python中的基因集富集分析

8种免疫细胞的参考基因集

单样本基因组富集分析(ssgsea)

基于大数据方法分析丹参酮IIA对乳腺癌免疫细胞浸润模式的影响.pdf

GSEApy: Python实现基因集富集分析的全面指南

R语言ssGSEA代码

gsva ssgsea 分析的异同

ssgsea基因集合打分

ssgsea 28免疫细胞

R语言ssGSEA包原理

ssGSEA评分与PCA评分差异

有没有其他类似于ssGSEA的算法

在执行gsva函数时，如何正确地将数据矩阵dat作为expr参数传递，并使用'kcdf'参数和'ssgsea'方法计算gene_set的相关值？

S变换+Sockwell R G , Mansinha L , Lowe R P . Localization of the complex spectrum: the S transformJ

大家在看

北航计算机组成 P5测试数据参考.zip

用L-Edit画PMOS版图的步骤-CMOS反相器版图设计

JESD47I中文版.docx

华为组播PIM-SM过程总结

HCNP-WLAN-CEWA(H12-321)题库.pdf

最新推荐

S变换+Sockwell R G , Mansinha L , Lowe R P . Localization of the complex spectrum: the S transformJ

2021科大讯飞车辆贷违预测大赛冠军源码+全部资料.zip

AI图像处理工具包-一键抠图、背景切换、旧照片修复、人像漫画化、视频卡通化（Python+OpenCV+Dlib+TensorFlow）.zip

基于java+springboot+vue+mysql的远程教育网站设计与实现.docx

springboot005学生心理咨询评估系统(源码+数据库+论文+PPT+包调试+一对一指导)

HTML挑战：30天技术学习之旅

【CodeBlocks精通指南】：一步到位安装wxWidgets库（新手必备）

andorid studio 配置ERROR: Cause: unable to find valid certification path to requested target

VC++实现文件顺序读写操作的技巧与实践

【大数据时代必备：Hadoop框架深度解析】：掌握核心组件，开启数据科学之旅