使用GPT-4改进癌症表型识别：一项性能比较研究

版权申诉

140 浏览量更新于2024-06-13 1 收藏 2.91MB PDF 举报

"这篇文档是关于使用GPT-4模型在识别电子健康记录中的癌症表型方面的研究。对比了GPT-4与GPT-3.5-turbo、Flan-T5以及spaCy的规则基础和机器学习基础方法的性能，探讨了大语言模型在医疗数据分析中的应用潜力。" 在当前的医疗领域，电子健康记录（EHRs）已成为存储和分析患者健康信息的重要工具。随着人工智能技术的发展，尤其是自然语言处理（NLP）的进步，这些数据可以被深入挖掘以识别复杂的疾病表型。GPT-4（Generative Pre-trained Transformer 4）是OpenAI推出的一种先进的大语言模型，它在理解和生成自然语言方面具有极高的能力。该研究由华盛顿大学的研究团队进行，他们比较了GPT-4、GPT-3.5-turbo、Flan-T5以及spaCy的两种方法（规则基础和基于机器学习）在识别癌症表型时的性能。GPT系列模型是基于Transformer架构的预训练模型，它们通过大量文本数据的训练，能够理解和生成连贯的自然语言文本。Flan-T5则是Google开发的一个大型语言模型，它在多种任务上展示了强大的泛化能力。而spaCy则是一个流行的Python库，用于处理和理解自然语言，它提供了规则基础和机器学习的NLP工具。研究中，作者可能对EHR数据进行了预处理，提取关键信息，并使用这些模型进行癌症表型的分类或识别。表型识别是医疗研究中的重要环节，它可以帮助医生更好地理解疾病的进展和患者反应，从而制定更有效的治疗策略。通过对不同模型的比较，研究人员旨在确定哪种方法在处理医疗文本时表现最佳，这对于未来在临床决策支持系统中的应用具有重要意义。关键词：生成式预训练变换器（GPT）、自然语言处理、大型语言模型、临床数据。这个研究突出了大语言模型在医疗领域的潜力，特别是对于从非结构化的EHR数据中提取有价值信息的能力。通过评估和比较不同的模型，研究者不仅能够评估当前技术的状态，还可能为未来的医疗NLP研究提供方向，进一步推动精准医学的发展。

organs affected by cancer recurrence. These phenotypes are important for informing treatment

decisions and assessing disease progression in non-small cell lung cancer patients.

We built the model framework using a clinical text dataset from Washington University

in St. Louis, Missouri, for a patient population diagnosed with non-small cell lung cancer. To

evaluate the effectiveness of GPT-4, we compared its results against 2 subject matter experts’

manual annotation. We also conducted a comparative analysis with GPT-3.5-turbo

, Flan-T5

(Flan-T5-xl, Flan-T5-xxl), and spaCy (medspaCy, scispaCy), currently frequently used rule-

based and machine learning approaches in clinical phenotype extraction. While Flan-T5 models

are LLMs, spacy models are two recent and hybrid frameworks that utilize statistical and

machine-learning methods in conjunction with rule-based NLP to identify clinical phenotypes.

We selected these baseline models based on their inherent capacity for rapid extraction, and their

ability to generate results without requiring training or additional fine-tuning.

Our comparison between scispaCy, medspaCy, Flan-T5-xl, Flan-T5-xxl, GPT-3.5-turbo

and GPT-4 aims to highlight the strengths and weaknesses of each approach for cancer

phenotype extraction from unstructured clinical text, providing valuable insights into their

effectiveness and potential use for cases in cancer phenotype extraction from EHR. In evaluating

these current approaches for phenotype extraction, we also note their limitations.

MATERIALS AND METHODS

To extract a detailed representation of specific lung cancer phenotypes, we used GPT-4,

available through Microsoft’s Azure OpenAI Service. We compared and evaluated the

performance of the current models by comparing true positives (recall) and false positives at the

patient-level. The following subsections discuss the datasets, annotation methods, and

methodologies used for extracted information, baseline comparison techniques, and evaluation

The copyright holder for this preprint (whichthis version posted April 6, 2024. ; https://doi.org/10.1101/2023.09.27.559788doi: bioRxiv preprint

剩余25页未读，继续阅读

百态老人

粉丝: 1w+
资源: 2万+

使用GPT-4改进癌症表型识别：一项性能比较研究

GPT-4.pdf

GPT-4 Guide.pdf

微软 GPT-4 报告 人工通用智能的星星之火GPT-4的早期实验.pdf

GPT-4和GPT-3客户端 g4f.webview.0.2.8.0.exe

海外ChatGPT、GPT-4如何赋能应用.pdf

微软研究院：人工通用智能的星星之火-GPT-4的早期实验.pdf

GPT-2 进行文本生成.pdf

人工智能行业点评：OpenAI访问限流，GPT-4算力测算.pdf

AI：GPT-4有什么不同.zip

Multimodal-GPT-add-baize.zip

最新资源

微软 GPT-4 报告人工通用智能的星星之火GPT-4的早期实验.pdf