nullp(goseq)
nullp()所属R语言包:goseq
Probability Weighting Function
概率权重函数
译者:生物统计家园网 机器人LoveR
描述----------Description----------
Calculates a Probability Weighting Function for a set of genes based on a given set of biased data (usually gene length) and each genes status as differentially expressed or not.
计算基础上偏颇的数据(通常是基因长度)和每一个基因的地位差异表达或不给定的基因集的概率权重函数。
用法----------Usage----------
nullp(DEgenes, genome, id, bias.data=NULL,plot.fit=TRUE)
参数----------Arguments----------
参数:DEgenes
A named binary vector where 1 represents DE, 0 not DE and the names are gene IDs.
,其中1代表德,没有德的姓名命名的二进制向量基因标识。
参数:genome
A string identifying the genome that genes refer to. For a list of supported organisms run supportedGenomes.
字串genes指识别基因组。为支持生物体的列表执行supportedGenomes。
参数:id
A string identifying the gene identifier used by genes. For a list of supported gene IDs run supportedGeneIDs.
字符串确定genes使用基因标识。为支持基因标识的列表执行supportedGeneIDs。
参数:bias.data
A numeric vector containing the data on which the DE may depend. Usually this is the median transcript length of each gene in bp. If set to NULL nullp will attempt to fetch length using getlength.
一个数字向量的DE可能取决于数据。通常,这是每个基因在BP的中位数的成绩单长度。如果设置NULLnullp将尝试获取长度使用getlength的。
参数:plot.fit
Plot the PWF or not? Calls plotPWF with default values if TRUE.
绘制的法定公益金或不呢?调用plotPWF如果TRUE默认值。
Details
详情----------Details----------
It is essential that the entire analysis pipeline, from summarizing raw reads through to using goseq be done in just one gene identifier format. If your data is in a different format you will need to obtain the gene lengths and supply them to the nullp function using the bias.data arguement. Converting to a supported format from another format should be avoided whenever possible as this will almost always result in data loss.
至关重要的是,整个分析管道总结原料,通过使用读取goseq只是一个基因的标识符格式。如果你的数据是在不同的格式,您将需要获得该基因的长度,并提供他们nullp函数使用bias.dataarguement。支持的格式转换成另一种格式,应尽量避免,因为这几乎总是会导致数据丢失。
NAs are allowed in the bias.data vector if you do not have information about a certain gene. Setting a gene to NA is preferable to removing it from the analysis.
NAS在bias.data向量,如果你没有某个基因的信息。设置一个基因NA是最好从分析中删除。
If bias.data is left as NULL, nullp attempts to use getlength to fetch GO catgeory to gene identifier mappings.
如果bias.data留给NULL,nullp尝试使用getlength获取catgeory去找基因标识映射。
It is recommended you review the fit produced by the nullp function before proceeding by leaving plot.fit as TRUE.
建议你离开nullp函数产生适合,然后再进行检讨plot.fitTRUE。
值----------Value----------
A data frame with 3 columns, named "DEgenes", "bias.data" and "pwf" with the rownames set to the gene names. Each row corresponds to a gene with the DEgenes column specifying if the gene is DE (1 for DE, 0 for not DE), the bias.data column giving the numeric value of the DE bias being accounted for (usually the gene length or number of counts) and the pwf column giving the genes value on the probability weighting function. This object is usually passed to goseq to calculate enriched categories or plotPWF for further plotting.
一个有3列名为“DEgenes”,“bias.data”和“PWF”的基因名称设置的rownames的,数据框。指定的基因,如果DE(德1,不是去0)DEgenes列,每一行对应一个基因,bias.data的DE偏置数值列(通常是基因的长度或数量占计数)和的PWF列给基因的概率加权函数值。这个对象通常是通过goseq计算丰富的类别或plotPWF为进一步策划。
作者(S)----------Author(s)----------
Matthew D. Young <a href="mailto:myoung@wehi.edu.au">myoung@wehi.edu.au</a>
参考文献----------References----------
Genome Biology Date: Feb 2010 Vol: 11 Issue: 2 Pages: R14
参见----------See Also----------
supportedGenomes, supportedGeneIDs, goseq, getlength
supportedGenomes,supportedGeneIDs,goseq,getlength
举例----------Examples----------
data(genes)
pwf <- nullp(genes, 'hg19', 'ensGene')
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|