PAML 4.3版软件教程：最大似然法 phylogenetic analysis

版权申诉

102 浏览量更新于2024-07-07 收藏 363KB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

资源详情

资源推荐

P A M L M A N U A L 8

1. Go to the PAML web site http://abacus.gene.ucl.ac.uk/software/paml.html and download the

lat

2. est archive and save on your hard disk. Unpack it using gzip, with a command like the

following (replace the version numbers and use the correct name for the archive file)

gzip –d paml4.tar.gz

tar xf paml4.tar

3. You can use ls to look at the files in the folder. Delete the Windows executables (.exe files) in the bin folder.

Then cd to the src/ folder to compile using make.

ls -lF bin (this should list the .exe files in the bin folder)

rm –r bin/*.exe

cd src

make

ls -lF

rm *.o

mv baseml basemlg codeml pamp evolver yn00 chi2 ../bin

cd ..

bin/codeml

4. Those commands compile the programs and generate executables called baseml, basemlg,

codeml, pamp, evolver, yn00, and chi2, which you can see with the ls command. Then

remove (rm) the intermediate object files *.o, and move (mv) the compiled executables into

bin/ folder in the PAML main folder (that is, ../bin from paml/src/). Then cd to the PAML

main folder and run codeml, using the default control file codeml.ctl. You can then print

out a copy of codeml.ctl and look at it (and the main result file mlc).

If the compilation (the make command) is unsuccessful, you might have to open and edit the file

Makefile before issuing the make command. For example, you can change cc to gcc and -fast to -

O3 or -O4. If that none of these works, look at the file readme.txt in the src/ folder for compiling

instructions. You can copy the compiling commands onto the command line. For example

cc –o baseml baseml.c tools.c –lm

cc –o codeml codeml.c tools.c -lm

would compile baseml and codeml using the C compiler cc. However, in this case code

optimization is not turned on. You should use compiler switches to optimize the code, say,

cc –o codeml –O3 codeml.c tools.c -lm

Finally, if your current folder is not on your search path, you will have to add ./ in front of the

executable file name even if the executable is in your current working folder; that is, use ./codeml

instead of codeml to run codeml.

Mac OS X

Since Mac OSX is UNIX, you should follow the instructions for UNIX above. Open a command

terminal (Applications-Utilities-Terminal) and then compile and run the programs from the terminal.

You cd to the paml/src/ folder and look at the readme.txt or Makefile files. See above. If you type

commands gcc or make and get a "Command not found" error, you will have to download the Apple

Developer ’s Toolkit at the Apple web site http://developer.apple.com/tools/. There are some notes

about running programs on MAC OS X or UNIX at the FAQ page.

I have stopped distributing executables for old MACs running OS 9 or earlier.

P A M L M A N U A L 9

Running a program

As indicated above, you run a program by typing its name from the command line. You should

know which folder your sequence file, tree file, and control file are, relative to your working folder.

If inexperienced, you may copy the executables to the folder containing your data files. Depending

on the model used, codeml may need a data file such as grantham.dat , dayhoff.dat ,

jones.dat

wag.dat

, m

tREV24.dat

, or

mtmam.dat

, so you should copy these files as well.

The programs produce result files, with names such as

rub

lnf

rst

, or

rates

. You should

not use these names for your own files as otherwise they will be overwritten.

Example data sets

The examples/ folder contains many example data sets. They were used in the original papers to

test the new methods, and I included them so that you could duplicate our results in the papers.

Sequence alignments, control files, and detailed readme files are included. They are intended to

help you get familiar with the input data formats and with interpretation of the results, and also to

help you discover bugs in the program. If you are interested in a particular analysis, get a copy of

the paper that described the method and analyze the example dataset to duplicate the published

results. This is particularly important because the manual, as it is written, describes the meanings of

the control variables used by the programs but does not clearly explain how to set up the control file

to conduct a particular analysis.

examples/HIVNSsites/: This folder contains example data files for the HIV-1 env V3 region

analyzed in Yang et al. (2000b). The data set is for demonstrating the NSsites models

described in that paper, that is, models of variable ω ratios among amino acid sites. Those

models are called the “random-sites ” models by Yang & Swanson (2002) since a priori we

do not know which sites might be highly conserved and which under positive selection.

They are also known as “fishing-expedition ” models. The included data set is the 10th data

set analyzed by Yang et al. (2000b) and the results are in table 12 of that paper. Look at the

readme file in that folder.

examples/lysin/: This folder contains the sperm lysin genes from 25 abalone species

analyzed by Yang, Swanson & Vacquier (2000a) and Yang and Swanson (2002). The data

set is for demonstrating both the “random-sites ” models (as in Yang, Swanson & Vacquier

(2000a)) and the “fixed-sites ” models (as in (Yang and Swanson 2002)). In the latter paper,

we used structural information to partition amino acid sites in the lysin into the “buried ” and

“exposed” classes and assigned and estimated different ω ratios for the two partitions. The

hypothesis is that the sites exposed on the surface are likely to be under positive selection.

Look at the readme file in that folder.

examples/lysozyme/: This folder contains the primate lysozyme c genes of Messier and

Stewart (1997), re-analyzed by Yang (1998). This is for demonstrating codon models that

assign different ω ratios for different branches in the tree, useful for testing positive

selection along lineages. Those models are sometimes called branch models or branch-

specific models. Both the “large ” and the “small ” data sets in Yang (1998) are included.

Those models require the user to label branches in the tree, and the readme file and included

tree file explain the format in great detail. See also the section “Tree file and

representations of tree topology ” later about specifying branch/node labels.

剩余49页未读，继续阅读

hyh15959933972

粉丝: 0
资源: 8万+

PAML 4.3版软件教程：最大似然法 phylogenetic analysis

PAML中文文档/计算分子进化

004.Python爬虫系列-web请求全过程剖析(重点)

程度副词.txt

JEP143C.pdf

工业相机CameraLink v2.0协议文档

时序预测-基于卷积神经网络CNN的数据时间序列预测Matlab程序 单变量

# openssl适用Android以及OpenHarmony的支持32位以及64位arm64-v8a armeabi-v7a

基于springboot房产销售系统设计与实现.docx

stopwords.txt

根据不同的地区和个人口味，月饼的制作方法也有很大的区别 下面提供几种经典月饼的制作步骤，包括广式月饼、苏式月饼和冰皮月饼

iotdb-1.3.2

简道云零代码新动能-企业零代码数字化创新实践案例集2.02022300页.pdf

超市商品库存信息管理系统-大一C语言课设

这是 MATLAB 中基于 OFDM 的发射机的实现

计算机网络习题以及答案3.docx

2000-2021年各省农业碳排放测算数据（计算过程+结果）（全新整理）

c语言数字图像处理-c-image-processing.zip

SNIA-SSS-PTS-2.0.2

轨道交通用钢轨铣磨车，全球前6强生产商排名及市场份额（by QYResearch）.docx

2000-2020年各省资本存量计算（永续盘存法，数据+程序+过程）(全新整理)

最新资源

时序预测-基于卷积神经网络CNN的数据时间序列预测Matlab程序单变量

根据不同的地区和个人口味，月饼的制作方法也有很大的区别下面提供几种经典月饼的制作步骤，包括广式月饼、苏式月饼和冰皮月饼