R语言 rphast包 phyloFit()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-9-28 00:02:01

phyloFit(rphast)
phyloFit()所属R语言包：rphast

                                    Fit a Phylogenetic model to an alignment...
                                       装一个系统发育模型对齐...

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Fit a Phylogenetic model to an alignment
装一个系统发育模型比对

用法----------Usage----------

no.opt=c("backgd"),
init.backgd.from.data=ifelse(is.null(init.mod), TRUE, FALSE),
features=NULL, scale.only=FALSE,
scale.subtree=NULL, nrates=1, alpha=1, rate.constants=NULL,
selection=NULL, init.random=FALSE, init.parsimony=FALSE,
clock=FALSE, EM=FALSE, precision="HIGH", ninf.sites=50, quiet=FALSE,

参数----------Arguments----------

参数：msa
An alignment object.  May be altered if passed in as a pointer to C memory (see Note).
对齐对象。如果获得通过，可能会被改变为一个指针，指向C内存（见注）。

参数：tree
A character string containing a Newick formatted tree defining the topology.  Required if the number of species > 3, unless init.mod is specified.  The topology must be rooted, although the root is ignored if the substitution model is reversible.
一个字符串，其中包含将Newick格式的定义拓扑树。如果数> 3种，，除非init.mod指定需要。必须扎根的拓扑结构，虽然根的替代模型被忽略，如果是可逆的。

参数：subst.mod
The substitution model to use.  Some possible models include "REV", "JC69", "K80", "F81", "HKY85", "R2", "U2".  Run subst.mods() for a full list; some models are experimental.
使用的替代模型。一些可能的模式包括“REV”，“JC69”，“K80”，“F81”中，“HKY85”，“R2”，“U2”。运行subst.mods()的完整列表，一些模型的实验。

参数：init.mod
An object of class tm used to initialize the model.
类的对象tm用于初始化该模型。

参数：no.opt
A character vector indicating which parameters NOT to optimize (instead hold constant at their initial values).  By default, the equilibrium frequencies (backgd) are not optimized.  Other parameters that may be indicated here are "ratematrix" for the entire rate matrix, "kappa" for models with transition/transversion ratios, "branches" to hold all branch lengths constant, "ratevar" for rate variation parameters, "scale" for the tree scaling factor, and "scale_sub" for the subtree scaling factor.  This argument does NOT apply to parameters of a lineage-specific model created with add.alt.mod, though such parameters can be held constant by using appropriate arguments when the model is created.  See add.alt.mod for more details about lineage-specific models.
指示哪些参数的字符矢量NOT优化（而不是在它们的初始值保持恒定）。默认情况下，平衡的频率（backgd）未优化。其他参数，可以显示整个速率矩阵的的“ratematrix”，“KAPPA”的模型转换/颠换比，“分支”来保存所有分支长度不变，速度变化参数的的“ratevar”，“规模”树的比例因子，“scale_sub”的子树的缩放因子。这个说法并不适用于参数的血统特定的模型，创建add.alt.mod，虽然这样的参数可以保持不变创建模型时，使用适当的参数。见add.alt.mod更详细的谱系具体型号。

参数：init.backgd.from.data
A logical value; can be FALSE only if init.mod is provided.  If TRUE, use observed base frequencies in data to initialize equilibrium frequencies.  Otherwise use the values from init.mod.  By default uses init.mod values if provided.
逻辑值，可以是FALSE如果init.mod。“如果TRUE，使用观测到的基频数据初始化平衡频率。否则使用值init.mod。默认情况下使用init.mod值（如果提供）。

参数：features
An object of type feat.  If given, a separate model will be estimated for each feature type.
的对象类型feat。如果给定一个单独的模型来估计每个要素类型。

参数：scale.only
A logical value. If TRUE, estimate only the scale of the tree.  Branches will be held at initial values.  Useful in conjunction with init.mod.
逻辑值。如果TRUE，估计只有树的规模。分行将于初始值。有用的配合init.mod。

参数：scale.subtree
A character string giving the name of a node in a tree.  This option implies scale.only=TRUE.  If given, estimate separate scale factors for subtree beneath identified node and the rest of the tree.  The branch leading to the subtree is included in the subtree.
一个字符串给树中的节点的名称。此选项意味着scale.only = TRUE。如果给出，估计单独的比例因子确定节点和树的其余部分之下的子树。子树的子树的分支。

参数：nrates
An integer.  The number of rate categories to use. Specifying a value greater than one causes the discrete gamma model for rate variation to be used,  unless rate constants are specified.
一个整数。使用速率类别的数目。指定一个大于1的值，导致速率变化离散伽马模型被使用，除非指定速率常数。

参数：alpha
A numeric value > 0, for use with "nrates".  Initial value for alpha, the shape parameter of the gamma distribution.
一个数字值> 0，用于包含“nrates”。 α，伽玛分布的形状参数的初始值。

参数：rate.constants
A numeric vector.  Implies nrates =    length(rate.constants).  Also implies EM=TRUE.  Uses a non-parametric mixture model for rates, instead of a gamma distribution.  The weight associated with each rate will be estimated. alpha may still be used to initialize these weights.
一个数值向量。意味着nrates =    length(rate.constants)。也意味着EM=TRUE。使用非参数混合模型价格，而不是gamma分布。率与每个相关联的权重将被估计。 α可被用来初始化这些权重。

参数：selection
A numeric value.  If provided, use selection in the model. The value given will be the initial value for selection.  If NULL, selection will not be used unless init.mod is provided and indicates a model with selection.  selection scales the rate matrix by s/(1-exp(-s)).  Selection is applied after the rate matrix is scaled so that the expected number of substitutions per unit time is 1.  When using codon models, selection only scales nonsynonymous substitutions.
一个数值。如果提供的话，选择在模型中。给出的值将选择的初始值。如果NULL，选择将不会被使用，除非init.mod设置并指示与选择的一个模型。选择缩放率矩阵由s /（1张（ - ））。选择被施加后的速率矩阵被缩放，使得每单位时间的预期数量的替换为1。当使用密码子模型，选择只尺度非同义替换。

参数：init.random
A logical value.  If TRUE, parameters will be initialized randomly.
逻辑值。如果TRUE，参数将被初始化随机。

参数：init.parsimony
A logical value.  If TRUE, branch lengths will be estimated based on parsimony counts for the alignments. Currently only works for models of order0.
逻辑值。如果TRUE，分支的长度，将估计的基础上简约计数的路线。目前只适用于模型order0。

参数：clock
A logical value.  If TRUE, assume a molecular clock in estimation.
逻辑值。如果TRUE，假设一个分子钟估算。

参数：EM
A logical value.  If TRUE, the model is fit using EM rather than the default BFGS quasi-Newton algorithm.  Not available for all models/options.
逻辑值。如果TRUE，该模型是适合使用EM而不是默认的BFGS拟牛顿算法。不适用于所有型号/选项。

参数：precision
A character vector, one of "HIGH", "MED", or "LOW", denoting the level of precision to use in estimating model parameters. Affects convergence criteria for iterative algorithms: higher precision means more iterations and longer execution time.
一种字符矢量，“HIGH（高）”，“MED”，或“LOW”（低）的任意一个，表示电平的精度，使用估计的模型参数。影响迭代算法的收敛准则：更高的精度意味着更多的迭代次数和执行时间较长。

参数：ninf.sites
An integer.  Require at least this many "informative" sites in order to estimate a model.  An informative site as an alignment column with at least two non-gap and non-missing-data characers.
一个整数。至少需要这么多的“信息”的网站，以估计的模型。一个内容的网站与至少两个非间隙和非缺失数据characers的作为取向列。

参数：quiet
A logical value.  If TRUE, do not report progress to screen.
逻辑值。如果TRUE，不汇报工作进度的屏幕。

参数：bound
Defines boundaries for parameters (see Details below).
定义边界参数（详见下文）。

参数：log.file
If TRUE, write log of optimization to the screen.  If a character string, write log of optimization to the named file. Otherwise write no optimization log.
如果是TRUE，写log到屏幕上的优化。如果一个字符串，写入，优化登录到指定的文件。否则，没有优化log写。

值----------Value----------

An object of class tm (tree model), or (if several models are computed, as is possible with the features or windows options), a list of
tm（树模式），或（如果几个模型的计算，是可能的功能或窗口选项），列表中的对象的类

参数边界----------Parameter boundaries----------

Boundaries can be set for some parameters using the bound argument. The bound argument should be a vector of character strings, each element defines the boundaries for a single parameter.  The boundaries are best explained by example.  A value of c("scale[0,1]",    "scale_sub[1,]", "kappa[,3]") would imply to keep the scale between 0 and 1, the subtree scale between 1 and infinity, and kappa between 0 and 3.  The blank entries in the subtree_scale upper bound and kappa's lower bound indicate not to set this boundary, in which case the normal default boundary will be used for that parameter.  (Most parameters are defined between 0 and infinity).  Most of the parameters listed in the description of no.opt can also have their boundaries set in this way.
可以设置一些参数使用绑定的参数边界。绑定的参数应该是一个字符串向量，每个元素定义了一个单一的参数的边界。的界限的例子是最好的解释。 A值c("scale[0,1]",    "scale_sub[1,]", "kappa[,3]")将意味着在0和1之间，要保持比例1到无穷大，和kappa之间的子树的规模在0和3之间。在subtree_scale上限和kappa的低约束的空白项目表示不设边界，在这种情况下，正常的默认边界将被用于该参数。（0和无穷大之间的大多数参数的定义）。大多数参数在描述no.opt列出也可以以这种方式设置有其边界。

注意----------Note----------

If msa or features object are passed in as pointers to C memory, they may be altered by this function!  Use copy.msa(msa) or
如果MSA或对象特点，通过在C内存的指针，他们可能会被此函数所改变！使用copy.msa(msa)

（作者）----------Author(s)----------

Melissa J. Hubisz and Adam Siepel

实例----------Examples----------

exampleArchive <- system.file("extdata", "examples.zip", package="rphast")
files <- c("ENr334.maf", "ENr334.fa", "gencode.ENr334.gff", "rev.mod")
unzip(exampleArchive, files)
m <- read.msa("ENr334.maf")
mod <- phyloFit(m, tree="((hg18, (mm9, rn4)), canFam2)")
mod
phyloFit(m, init.mod=mod)
likelihood.msa(m, mod)
mod$likelihood
print(mod$likelihood, digits=10)
f <- read.feat("gencode.ENr334.gff")
mod <- phyloFit(m, tree="((hg18, (mm9, rn4)), canFam2)",
            features=f, quiet=TRUE)
names(mod)
mod$other
mod[["5'flank"]]
phyloFit(m, init.mod=mod$AR, nrates=3, alpha=4.0)
phyloFit(m, init.mod=mod$AR, rate.constants=c(10, 5, 1))
# background frequencies options[背景频率选项]

# this should use the background frequencies from the initial mod[这应该使用的背景频率从最初的MOD]
phyloFit(m, init.mod=mod$AR, quiet=TRUE)$backgd
mod$AR$backgd

# this should use the background frequencies from the data[这应该从数据中使用的背景频率]
phyloFit(m, init.mod=mod$AR, init.backgd.from.data=TRUE, quiet=TRUE)$backgd
mod$AR$backgd

# this should optimize the background frequencies[这应该优化后台频率]
phyloFit(m, init.mod=mod$AR, no.opt=NULL, quiet=TRUE)$backgd
mod$AR$backgd

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册