peakreg(iSeq)
peakreg()所属R语言包:iSeq
Call and merge enriched genomic windows/bins.
打检测和合并丰富的基因组窗口/箱。
译者:生物统计家园网 机器人LoveR
描述----------Description----------
A function used to call and merge enriched bins using the posterior probability calculated by iSeq1 or iSeq2 functions at certain posterior probability and false discovery rate (FDR) cutoffs.
函数调用和使用后验概率计算在一定的后验概率和错误发现率(FDR)截止iSeq1或iSeq2职能合并丰富的垃圾箱。
用法----------Usage----------
peakreg(chrpos,count,pp,cutoff,method=c("ppcut","fdrcut"),maxgap=300)
参数----------Arguments----------
参数:chrpos
A n by 3 matrix or data frame. The rows correspond to genomic bins. The first column contains chromosome IDs; the second and third columns contain the start and end positions of the bin, respectively.
一个由3矩阵或数据框N。行对应的基因组箱。第一列包含染色体的ID,第二和第三列包含垃圾桶的开始和结束位置,分别。
参数:count
A n by 2 matrix containing the number of sequence tags in the bins specified by chrpos. The first column contains the tag counts for chain 1 (usually the forward chain), and the second column contains the tag counts for chain 2 (usually the reverse chain). See the document of the function 'mergetag' for the definition of chain 1 and 2. The function uses the information in 'count' to find the center of the enriched regions, where the true binding sites are usually located.
由2矩阵包含序列标签的在由chrpos指定的垃圾箱数量N。第一列包含链1(通常是正向链)标记计数,第二列包含2链(通常是反向链)标记计数。看到功能mergetag 1和第2链的定义的文件。在“计数”功能使用的信息找到富集的区域,其中真正的结合位点通常位于市中心。
参数:pp
A vector containing the posterior probabilities of bins in the enriched state returned by functions iSeq1 or iSeq2.
返回功能iSeq1或iSeq2一个向量,箱中的富集状态的后验概率。
参数:cutoff
The cutoff value (a scalar) used to call enriched bins. If use posterior probability as a criterion (method="ppcut"), a bin is said to be enriched if its pp is greater than the cutoff. If use FDR as a criterion (method="fdrcut"), bins are said to be enriched if the bin-based FDR is less than the cutoff. The FDR is calculated using a direct posterior probability approach (Newton et al., 2004).
临界值(标量)用于调用丰富的垃圾箱。如果使用后验概率,作为一个标准(方法=“ppcut)的bin被说成是丰富的,如果它的PP是大于截止。如果使用FDR=“fdrcut”)作为一个标准(方法,垃圾桶说要丰富,如果滨为基础的FDR是小于截止。FDR是使用直接的后验概率方法(牛顿等人,2004年)计算。
参数:method
'ppcut' or 'fdrcut'.
期ppcut或fdrcut。
参数:maxgap
The criterion used to merge enriched bins. If the genomic distance of adjacent bins is less than maxgap, the bins will be merged into the same enriched region.
该标准用于合并丰富的垃圾箱。如果相邻的垃圾桶基因距离是少比maxgap,垃圾箱将被合并到相同的富集区域。
值----------Value----------
A data frame with rows corresponding to enriched regions and columns corresponding to the following:
具有丰富的区域和列对应以下行相应的数据框:
参数:chr
Chromosome IDs.
染色体的ID。
参数:gstart
The start genomic position of the enriched region.
富集的区域开始基因组的位置。
参数:gend
The end genomic position of the enriched region.
富集的区域年底的基因组的位置。
参数:rstart
The row number for gstart in chrpos.
行数为gstart在chrpos的。
参数:rend
The row number for gend in chrpos.
行数为gend在chrpos的。
参数:peakpos
The inferred center (peak) of the enriched region.
推断的中心(峰值)的富集区。
参数:meanpp
The mean posterior probability of the merged regions/bins.
合并后的区域/箱平均后验概率。
参数:ct1
total tag counts for the region from gstart to gend for the chain corresponding to count[,1]; ct1=sum(count[rstart:rend,1])
从gstart区域以gend链相应的数[1]总标签数; CT1 = SUM(计数[RSTART:撕裂,1])
参数:ct2
total tag counts for the region from gstart to gend for the chain corresponding to count[,1]; ct2=sum(count[rstart:rend,2])
从gstart区域以gend链相应的数[1]总标签数; CT2 = SUM(计数[RSTART:撕裂,2)
参数:ct12
ct12 = ct1 + ct2
CT12 = CT1 + CT2
参数:sym
A parameter used to measure if the forward and reverse tag counts are symmetrical (or balanced) in enriched regions. The values range from 0.5 (perfect symmetry) to 0 (complete asymmetry).
使用一个参数来衡量,如果是对称的正向和反向的标签计数富集区域(或平衡)。值的范围从0.5(完美对称)0(完成不对称)。
作者(S)----------Author(s)----------
Qianxing Mo <a href="mailto:moq@mskcc.org">moq@mskcc.org</a>
参考文献----------References----------
data analysis. Biostatistics, Advance Access published September 13, 2011. doi:10.1093/biostatistics/kxr029
differential gene expression with a semiparametric hierarchical mixture method. Biostatistics 5 , 155-176.
参见----------See Also----------
iSeq1, iSeq2, mergetag,plotreg
iSeq1,iSeq2,mergetag,plotreg
举例----------Examples----------
data(nrsf)
chip = rbind(nrsf$chipFC1592,nrsf$chipFC1862,nrsf$chipFC2002)
mock = rbind(nrsf$mockFC1592,nrsf$mockFC1862,nrsf$mockFC2002)
tagct = mergetag(chip=chip,control=mock,maxlen=80,minlen=10,ntagcut=20)
tagct22 = tagct[tagct[,1]=="chr22",]
res1 = iSeq1(Y=tagct22[,1:4],gap=200,burnin=200,sampling=500,ctcut=0.95,a0=1,b0=1,
a1=5,b1=1, k0=3,mink=0,maxk=10,normsd=0.1,verbose=FALSE)
reg1 = peakreg(tagct22[,1:3],tagct22[,5:6]-tagct22[,7:8],res1$pp,0.5,
method="ppcut",maxgap=200)
reg2 = peakreg(tagct22[,1:3],tagct22[,5:6]-tagct22[,7:8],res1$pp,0.05,
method="fdrcut",maxgap=200)
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|