R语言 goseq包 getlength()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-2-25 20:53:16

getlength(goseq)
getlength()所属R语言包：goseq

                                       Retrieves Gene length data
                                       检索基因长度数据

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Gets the length of each gene in a vector.
获取向量中的每一个基因的长度。

用法----------Usage----------

getlength(genes, genome, id)

参数----------Arguments----------

参数：genes
A vector or list of the genes for which length information is required.
基因向量或列表的长度信息是必需的。

参数：genome
A string identifying the genome that genes refer to.  For a list of supported organisms run supportedGenomes.
字串genes指识别基因组。为支持生物体的列表执行supportedGenomes。

参数：id
A string identifying the gene identifier used by genes.  For a list of supported gene IDs run supportedGeneIDs.
字符串确定genes使用基因标识。为支持基因标识的列表执行supportedGeneIDs。

Details

详情----------Details----------

Length data is obtained from data obtained from the UCSC genome browser for each combination of genome and id.  As fetching this data at runtime is time consuming, a local copy of the length information for common genomes and gene ID are included in the geneLenDataBase package.  This function uses this package to fetch the required data.
获得从每个genome和id的组合从UCSC基因组浏览器获得的数据长度的数据。由于在运行时获取这个数据是费时，geneLenDataBase包中包括一个共同的基因组和基因ID的长度信息的本地副本。这个函数使用这个包来获取所需的数据。

The length of a gene is taken to be the median length of all its mature, mRNA, transcripts.  It is always preferable to obtain length information directly for the gene ID used to summarize your count data, rather than converting IDs and then using the supplied databases.  Even when two genes have a one-to-one mapping between different identifier conventions (which is often not the case), they frequently refer to slightly different regions of the genome with different lengths.  It is therefore recommended that the user perform the full analysis in terms of only one gene ID, or manually obtain their own length data for the identifier used to bin reads by gene.
采取的是中位数，其成熟的mRNA转录基因的长度。它始终是最好获得直接用来总结计数数据，而不是转换ID，然后使用提供的数据库的基因ID的长度信息。甚至当两个基因之间不同的的标识符公约（这往往并非如此）有一个一对一的映射，他们经常是指略有不同区域，不同长度的基因组。因此，建议用户执行全面的分析，只有一个基因ID或手动获取自己的长度数据用于斌读通过基因标识。

值----------Value----------

Returns a vector of the gene lengths, in the same order as genes.  If length data is unavailable for a particular gene NA is returned in that position.  The returned vector is intended for use with the bias.data option of the nullp function.
返回的基因长度的向量，在相同的顺序为genes。如果长度的数据是一个特定的基因无法NA返回在那个位置。使用bias.datanullp功能选项，用于返回向量。

作者（S）----------Author(s)----------

Matthew D. Young <a href="mailto:myoung@wehi.edu.au">myoung@wehi.edu.au</a>

参见----------See Also----------

supportedGenomes, supportedGeneIDs, nullp, geneLenDataBase
supportedGenomes，supportedGeneIDs，nullp，geneLenDataBase

举例----------Examples----------

genes <- c("ENSG00000124208", "ENSG00000182463", "ENSG00000124201", "ENSG00000124205", "ENSG00000124207")
getlength(genes,'hg19','ensGene')

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册

R语言 goseq包 getlength()函数中文帮助文档(中英文对照)

浏览过的版块