从数据到洞察：生物信息学的再发现

需积分: 8 18 浏览量更新于2024-08-07 收藏 90KB PDF 举报

"Bioinformatics and discovery: induction beckons again - 学术论文" 本文由John F. Allen撰写，探讨了生物信息学在生命科学研究中的角色，特别是在基因组学、蛋白质组学和微阵列技术产生的海量数据背景下，是否可以通过计算机软件直接从观察到理解，而无需人类的猜测、想象或假设。在介绍部分，作者指出，近年来在生命科学领域，一种观点再次盛行，即存在一条直接从观察到理解的路径。这一路径认为，知识可以不经过人为的猜测和假说，直接从数据中稳定获取。随着信息技术的发展，我们现在可以立即获取大量的数据，个人计算机也能以极快的速度处理和分析这些数据。因此，有人认为，我们有望看到计算机程序从信息片段中推导出意义、相关性和含义，无论是核苷酸序列还是基因表达模式。文章引用了一篇《自然》杂志的社论，该社论讨论了生物学家越来越依赖计算机来完成他们的思考。社论的标题——“人类能否理解生物学现象？”——暗示着计算机可能已经能够替代人类进行复杂的数据解析和理论构建。然而，社论对生物学家持宽容态度，指出尽管计算机在数据分析方面表现出强大能力，但人类的直觉、创新思维和问题解决能力仍然至关重要。在生物信息学中，数据挖掘和机器学习算法是核心工具，它们被用来识别模式、关联和潜在的生物学机制。例如，这些工具可以用于发现基因与疾病之间的关联，或者预测蛋白质的功能。但是，虽然计算机可以快速处理大量数据，找出统计上的显著性，但解释这些发现、建立生物学模型以及验证假设仍然是人类科学家的重要工作。文章可能会进一步讨论，即使有了强大的计算能力，生物信息学仍面临挑战，如过度拟合、数据噪声和生物学复杂性。因此，人类科学家的介入是必要的，他们能够提供生物学背景知识，解读结果，并在必要时提出新的假设。此外，伦理和隐私问题也需要人类的判断来处理，因为生物信息学的应用可能涉及到个人健康信息。 "Bioinformatics and discovery: induction beckons again"这篇文章探讨了生物信息学在转化大量生命科学数据为理解过程中所扮演的角色，同时也强调了人类智慧在这一过程中的不可或缺性。尽管计算机软件提供了前所未有的分析能力，但理解和解释数据背后的生物学含义仍然需要人类的洞察力和创新思维。

Bioinformatics and discovery:

induction beckons again

John F. Allen

With the flood of information from genomics, proteomics,

and microarrays, what we really need now is the computer

software to tell us what it all means. Or do we?

Introduction

In the life sciences, there has recently been a strong

resurgence of the view that there is a direct route from

observation to understanding. By this route, knowledge can

flow securely from data without the human and fallible

intervention of guesswork, imagination or hypothesis. Infor-

mation technology now puts oceans of data at our immediate

disposal, and even the ubiquitous personal computer can

process and analyse these data at huge speed. Surely, the

thinking goes, we can now expect computer programs to

derive significance, relevance and meaning from chunks of

information, be they nucleotide sequences or gene expression

profiles. A Nature editorial,

(1)

for instance, discusses biolo-

gists' increasing reliance on computers to do their thinking for

them. The editorial is rather kind to the biologists. Its titleÐ

``Can biological phenomena be understood by humans?''Ð

provocatively implies that scientific discovery might well be

carried out, instead, by machine. In contrast with this view,

many are convinced that no purely logical process can turn

observation into understanding. We owe this conviction, first

and foremost, to the work of Karl Popper.

(2±4)

Here I argue that

Popper was correct, and outline the way in which I think his

philosophy applies to bioinformatics. I predict that even the

formidable combination of computing power with ease of

access to data cannot a produce a qualitative shift in the way

that we do science: the making of hypotheses remains an

indispensable component in the growth of knowledge.

The problem of induction

``Logical deduction'' is a process by which the truth of a general

statement entails the truth of a particular statement. For

example, if it is true that ``all men are mortal'', then we can

deduce from the statement ``Socrates is a man'' that ``Socrates

is mortal''. The reverse process, a logical route from the

particular to the general, has been called ``logical induction'',

but it has never been clear how this might work. The possibility

of logical induction was dismissed by the Scottish philosopher

David Hume, in the eighteenth century.

(5)

One of Hume's

concerns was the idea of causality Ð how can we know that

``a'' causes ``b'', when all we can say with certainty is that we

have observed that ``b'' follows ``a'' on a number of occasions?

How many times do we have to observe that ``b'' follows ``a'' in

order for us to be sure that ``a'' causes ``b''? Hume's answer is

that we never can be sure. And what are we doing when we

make predictions about future events? For example, why do

we believe that the sun will rise tomorrow? Admittedly, we have

seen it rise many times before, but extrapolation is always

uncertain, and we feel that ``knowledge'' must be more secure

than this. Hume believed that we can never really know that the

sun will rise tomorrow. Our expectation that it will, like our idea

of causality, has, according to Hume, no rational foundation.

Bertrand Russell put the consequences thus: ``It is im-

portant to discover whether there is any answer to Hume. If

not, it follows that there is no intellectual distinction between

sanity and insanity. The lunatic who believes that he is a

poached egg is to be condemned solely on the ground that he

is in a minority, ... or on the ground that the government does

not agree with him''.

(6)

Russell also pointed to the stark

consequences of having no rational basis for the resolution of

conflicting theories. Writing in 1944, Russell put it thus: ``The

growth of unreason throughout the nineteenth century and

what has passed of the twentieth is a natural sequel to Hume's

destruction of empiricism''.

(6)

Induction and verifiability

In the early twentieth century, logical positivists proposed that

there was an answer to Hume, and that there was indeed a

logical route to certain knowledge. This route was ``scientific

method''. Science, and science alone, could tell us whether ``a''

causes ``b'', and allow us to predict when the sun will rise.

According to the philosophy of logical positivism, a general

statement or theory can be arrived at by inductive reasoning.

Positivists also thought that such a theory, if it is verified by

observation or experiment, can be promoted to a ``law''.

Indeed, positivists required that a theory must be verifiable in

order to count as ``scientific''. Verifiability was the criterion of

what is, and is not, science. Thus, in the view of positivists,

104 BioEssays 23.1 BioEssays 23:104±107, ß 2001 John Wiley & Sons, Inc.

Plant Biochemistry, Lund University, Box 117, SE-221 00 Lund,

Sweden. E-mail: john.allen@plantbio.lu.se

Funding agencies: Crafoord Foundation; Swedish Natural Sciences

Research Foundation.

Commentary

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38689857

粉丝: 8
资源: 888

从数据到洞察：生物信息学的再发现

生物信息学和发现：归纳再次招手

Bioinformatics Data Skills Reproducible and Robust Research with Open azw3

Bioinformatics-Algorithms:Coursera 工作

Bioinformatics-2:基因组测序

Bioinformatics-1：在DNA中查找隐藏的消息

bioinformatics-labs:生物信息学课程的大学作业

Bioinformatics-Algorithms:编程练习 - 生物信息学算法

bioinformatics_algorithms:Coursera 生物信息学算法课程代码

Bioinformatics Learning Tutorial:互动式本科生物学教程-开源

bioinformatics_util：Morrison实验室的常用脚本和工具

最新资源