fragmentLenDetect(nucleR)
fragmentLenDetect()所属R语言包:nucleR
Fragments length detection from single-end sequencing samples
从单端测序样品片段长度检测
译者:生物统计家园网 机器人LoveR
描述----------Description----------
When using single-ended sequencing, the resulting partial sequences map only in one strand, causing a bias in the coverage profile if not corrected. The only way to correct this is knowing the average size of the real fragments. nucleR uses this information when preprocessing single-ended sequences. You can provide this information by your own (usually a 147bp length is a good aproximation) or you can use this method to automatically guess the size of the inserts.
当使用单端测序,导致部分序列映射只在一个链,如果不纠正,造成覆盖配置文件中的偏见。只有这样,才能纠正这种认识的真实片段的平均大小。 nucleR使用此信息在预处理时的单端序列。你可以提供自己的信息(通常为1 147bp的长度是良好的aproximation),或到自动猜测插入的大小,您可以使用此方法。
用法----------Usage----------
## S4 method for signature 'AlignedRead'
fragmentLenDetect(reads, samples=1000, window=1000, min.shift=1, max.shift=100, mc.cores=1, as.shift=FALSE)
## S4 method for signature 'RangedData'
fragmentLenDetect(reads, samples=1000, window=1000, min.shift=1, max.shift=100, mc.cores=1, as.shift=FALSE)
参数----------Arguments----------
参数:reads
Raw single-end reads (AlignedRead or RangedData format)
原单端读取(AlignedRead或RangedData格式)
参数:samples
Number of samples to perform the analysis (more = slower but more accurate)
执行分析(更多的样本数=速度较慢,但更准确)
参数:window
Analysis window. Usually there's no need to touch this parameter.
分析窗口。一般没有必要碰这个参数。
参数:min.shift, max.shift
Minimum and maximum shift to apply on the strands to detect the optimal fragment size. If the range is too big, the performance decreases.
申请链检测最佳片段大小,最小和最大的转变。如果幅度过大,性能下降。
参数:as.shift
If TRUE, returns the shift needed to align the middle of the reads in opposite strand. If FALSE, returns the mean inferred fragment length.
如果是TRUE,返回需要在对面链读取中间对齐的转变。如果为FALSE,返回的平均推断片段长度。
参数:mc.cores
If multicore support, maximum number of cores allowed to use.
如果多核的支持,允许的最大数量的核心使用。
Details
详情----------Details----------
This function shifts one strand downstream one base by one from min.shift to max.shift. In every step, the correlation on a random position of length window is checked between both strands. The maximum correlation is returned and averaged for samples repetitions.
这函数转移由min.shiftmax.shift之一的一个链下游的一个碱基。在每一步,长度window随机位置上的相关性检查两股。返回最大的相关性和samples重复的平均值。
The final returned length is the best shift detected plus the width of the reads. You can increase the performance of this function by reducing the samples value and/or narrowing the shift range. The window size has almost no impact on the performance, despite a to small value can give biased results.
最终返回的长度是最好的移位加读取宽度检测。你可以增加此功能的性能,减少samples值和/或缩小移位范围。 window大小对性能几乎没有影响,但一个小的价值,可以给偏颇的结果。
值----------Value----------
Inferred mean lenght of the inserts by default, or shift needed to align strands if as.shift=TRUE
推断意味着默认情况下,插入的长度,或转移需要对齐股如果as.shift=TRUE
作者(S)----------Author(s)----------
Oscar Flores <a href="mailtoflores@mmb.pcb.ub.es">oflores@mmb.pcb.ub.es</a>
举例----------Examples----------
#Create a sinthetic dataset, simulating single-end reads, for positive and negative strands[创建一个sinthetic集,模拟单端读取,正面和负面的股]
pos = syntheticNucMap(nuc.len=40, lin.len=130)$syn.reads #Positive strand reads[正股读取]
neg = IRanges(end=start(pos)+147, width=40) #Negative strand (shifted 147bp)[负链(移147bp)]
sim = RangedData(c(pos, neg), strand=c(rep("+", length(pos)), rep("-", length(neg))))
#Detect fragment lenght (we know by construction it is really 147)[检测片段长度(我们知道建设是真的147)]
fragmentLenDetect(sim, samples=50)
#The function restrict the sampling to speed up the example[功能限制采样加快范例]
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|