exomecounts(exomeCopy)
exomecounts()所属R语言包:exomeCopy
Sample counts from 16 exome sequencing samples from 1000 Genomes Project
从1000基因组计划的16个外显子组测序样品的样品计数
译者:生物统计家园网 机器人LoveR
描述----------Description----------
This data set gives sample read counts in 1000 genomic ranges for 16 exome sequencing samples from the PUR population of the 1000 Genomes Project, along with the GC-content in the ranges. For instructions on how to prepare read count and covariate data, please see the example code in the man pages for subdivideGRanges and countBamInGRanges.
这组数据为样本读千人基因组计划PUR人口从16外显子组测序样品一起在GC含量范围,在1000基因组范围的计数。有关如何准备读计数和协数据说明,请参阅在subdivideGRanges和countBamInGRangesman页面的例子代码。
The genomic ranges are generated from small portion of the CCDS regions of chromosome 1 (hg19). The CCDS regions are subdivided evenly into ranges around 100bp using the subdivideGRanges function with default settings. Only ranges with positive counts across samples are retained. These regions were downloaded as a BED file from the UCSC Genome Browser (http://genome.ucsc.edu/cgi-bin/hgGateway). The mapping files for the exome sequencing data and descriptions of the experiments are available at the 1000 Genomes Project website (http://www.1000genomes.org/data). The directories used are listed in the file 1000Genomes_files.txt in the extdata directory.
生成的基因组范围从1号染色体(hg19)身心区域的一小部分。身心区域范围均匀分为左右subdivideGRanges函数使用默认设置的100个基点。只有跨阳性样本数的范围将被保留。这些区域被下载作为从床UCSC基因组浏览器(http://genome.ucsc.edu/cgi-bin/hgGateway)文件。外显子组测序数据和描述实验的映射文件是在1000基因组计划网站(http://www.1000genomes.org/data)。使用目录中列出的文件1000Genomes_files.txtextdata目录的。
The column names are the sample names from the 1000 Genomes Project. Library format is paired-end reads and sample counts reflect both sequenced reads counted in their respective genomic ranges.
列名是从1000基因组计划的样本名。库格式配对年底读取和样本数反映在各自的基因组范围内两个序列读取。
用法----------Usage----------
data(exomecounts)
格式----------Format----------
A RangedData object.
一个RangedData对象。
源----------Source----------
1000 Genomes Project and Consensus Coding Sequence Project
千人基因组计划和共识项目编码序列
参考文献----------References----------
population-scale sequencing. Nature 467, 1061-1073 (2010). http://dx.doi.org/10.1038/nature09534.
http://www.1000genomes.org/announcements/release-phase-1-exome-alignments-2011-07-19
Identifying a common protein-coding gene set for the human and mouse genomes. Genome research 19, 1316-1323 (2009). http://dx.doi.org/10.1101/gr.080531.108.
参见----------See Also----------
RangedData
RangedData
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|