DEGseq(DEGseq)
DEGseq()所属R语言包:DEGseq
DEGseq: Identify Differentially Expressed Genes from RNA-seq data
DEGseq:差异表达基因的RNA-seq的数据
译者:生物统计家园网 机器人LoveR
描述----------Description----------
This function is used to identify differentially expressed genes from RNA-seq data. It takes uniquely mapped reads from RNA-seq data for the two samples with a gene annotation as input. So users should map the reads (obtained from sequencing libraries of the samples) to the corresponding genome in advance.
这个函数是用来从RNA-seq的数据,以确定差异表达基因。它采用独特的映射从两个样本作为输入的基因注释RNA-seq的数据读取。因此,用户应该读取(从测序的样本库中获得)映射到相应的基因组提前。
用法----------Usage----------
DEGseq(mapResultBatch1, mapResultBatch2, fileFormat="bed", readLength=32,
strandInfo=FALSE, refFlat, groupLabel1="group1", groupLabel2="group2",
method=c("LRT", "CTR", "FET", "MARS", "MATR", "FC"),
pValue=1e-3, zScore=4, qValue=1e-3, foldChange=4, thresholdKind=1,
outputDir="none", normalMethod=c("none", "loess", "median"),
depthKind=1, replicate1="none", replicate2="none",
replicateLabel1="replicate1", replicateLabel2="replicate2")
参数----------Arguments----------
参数:mapResultBatch1
vector containing uniquely mapping result files for technical replicates of sample1 (or replicate1 when method="CTR").
为技术向量,独特的映射结果文件复制SAMPLE1(或replicate1时method="CTR")。
参数:mapResultBatch2
vector containing uniquely mapping result files for technical replicates of sample2 (or replicate2 when method="CTR").
为技术向量,独特的映射结果的文件复制的sample2(或replicate2时method="CTR")。
参数:fileFormat
file format: "bed" or "eland". <br> example of "bed" format: chr12 7 38 readID 2 + <br> example of "eland" format: readID chr12.fa 7 U2 F <br> Note: The field separator character is TAB. And the files must follow the format as one of the examples.
文件格式:"bed"或"eland"。 "bed"格式:chr12 7 38 readID 2 +参考注:字段分隔符是"eland"readID chr12.fa 7 U2 FTAB格式参考范例的参考例子。和文件必须按照格式,作为一个例子。
参数:readLength
the length of the reads (only used if fileFormat="eland").
如果fileFormat="eland"读取长度(只用)。
参数:strandInfo
whether the strand information was retained during the cloning of the cDNAs.
是否被保留在克隆的cDNA链信息。
"TRUE" : retained,
"TRUE" :保留,
"FALSE": not retained.
"FALSE":不保留。
参数:refFlat
gene annotation file in UCSC refFlat format. <br> See http://genome.ucsc.edu/goldenPath/gbdDescriptionsOld.html#RefFlat.
基因UCSC的refFlat格式的注释文件。参考见http://genome.ucsc.edu/goldenPath/gbdDescriptionsOld.html#RefFlat。
参数:groupLabel1
label of group1 on the plots.
图group1的标签。
参数:groupLabel2
label of group2 on the plots.
Group2的标签上的图。
参数:method
method to identify differentially expressed genes. Possible methods are:
差异表达基因的方法来确定。可能的方法是:
"LRT": Likelihood Ratio Test (Marioni et al. 2008),
"LRT":似然比检验(Marioni等2008),
"CTR": Check whether the variation between two Technical Replicates can be explained by the random sampling model (Wang et al. 2009),
"CTR":检查是否可以通过随机抽样模型(Wang等2009)解释两个技术的变化复制,
"FET": Fisher's Exact Test (Joshua et al. 2009),
"FET":Fisher精确检验(约书亚等人2009),
"MARS": MA-plot-based method with Random Sampling model (Wang et al. 2009),
"MARS":主图基于随机抽样模型(Wang等2009)的方法,
"MATR": MA-plot-based method with Technical Replicates (Wang et al. 2009),
"MATR":主图方法与技术复制(王等,2009),
"FC" : Fold-Change threshold on MA-plot.
"FC" :倍数式更改上MA图的阈值。
参数:pValue
pValue threshold (for the methods: LRT, FET, MARS, MATR). <br> only used when thresholdKind=1.
pValue阈值(方法:LRT, FET, MARS, MATR)。参考只用时thresholdKind=1。
参数:zScore
zScore threshold (for the methods: MARS, MATR). <br> only used when thresholdKind=2.
zScore阈值(方法:MARS, MATR)。参考只用时thresholdKind=2。
参数:qValue
qValue threshold (for the methods: LRT, FET, MARS, MATR). <br> only used when thresholdKind=3 or thresholdKind=4.
qValue阈值(方法:LRT, FET, MARS, MATR)。参考只使用时thresholdKind=3或thresholdKind=4。
参数:thresholdKind
the kind of threshold. Possible kinds are:
阈值的一种。可能的类型:
1: pValue threshold,
1:pValue阈值,
2: zScore threshold,
2:zScore阈值,
3: qValue threshold (Benjamini et al. 1995),
3:qValue阈值(。Benjamini等,1995),
4: qValue threshold (Storey et al. 2003),
4:qValue阈值(层等2003),
5: qValue threshold (Storey et al. 2003) and Fold-Change threshold on MA-plot are both required (can be used only when method="MARS").
5:MA图qValue阈值(层等2003)和倍数变化阈值都是必需的(可用于只有当method="MARS")。
参数:foldChange
fold change threshold on MA-plot (for the method: FC).
主图倍阈值(方法:FC)。
参数:outputDir
the output directory.
输出目录。
参数:normalMethod
the normalization method: "none", "loess", "median" (Yang,Y.H. et al. 2002). <br> recommend: "none".
归一化法:"none", "loess", "median"(杨,YH等2002)。参考建议:"none"。
参数:depthKind
1: take the total number of reads uniquely mapped to genome as the depth for each replicate, <br> 0: take the total number of reads uniquely mapped to all annotated genes as the depth for each replicate. <br> We recommend taking depthKind=1, especially when the genes in annotation file are part of all genes.
1:唯一映射到基因组的深度为每个重复读取的总数,参考0:每个重复读取总数唯一映射到所有注释基因的深度。我们建议采取的参考depthKind=1,尤其是当注释文件中的基因是所有基因的一部分。
参数:replicate1
files containing uniquely mapped reads obtained from replicate batch1 (only used when method="MATR").
文件包含唯一映射读取复制BATCH1获得的(只用时method="MATR")。
参数:replicate2
files containing uniquely mapped reads obtained from replicate batch2 (only used when method="MATR").
文件包含唯一映射读取复制batch2获得的(只用时method="MATR")。
参数:replicateLabel1
label of replicate batch1 on the plots (only used when method="MATR").
标签上的图复制BATCH1(仅用于当method="MATR")。
参数:replicateLabel2
label of replicate batch2 on the plots (only used when method="MATR").
图复制batch2标签(仅用于当method="MATR")。
参考文献----------References----------
powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 57, 289-300.
Bioinformatics, 25, 1026-1032.
2-channel gene expression microarrays. BMC Genomics, 10, 221.
Genome Res., 18, 1509-1517.
slide systematic variation. Nucleic Acids Research, 30, e15.
参见----------See Also----------
DEGexp, getGeneExp, readGeneExp, kidneyChr21.bed, liverChr21.bed, refFlatChr21.
DEGexp,getGeneExp,readGeneExp,kidneyChr21.bed,liverChr21.bed,refFlatChr21。
举例----------Examples----------
kidneyR1L1 <- system.file("extdata", "kidneyChr21.bed.txt", package="DEGseq")
liverR1L2 <- system.file("extdata", "liverChr21.bed.txt", package="DEGseq")
refFlat <- system.file("extdata", "refFlatChr21.txt", package="DEGseq")
mapResultBatch1 <- c(kidneyR1L1) ## only use the data from kidneyR1L1 and liverR1L2[#只使用从kidneyR1L1和liverR1L2的数据]
mapResultBatch2 <- c(liverR1L2)
outputDir <- file.path(tempdir(), "DEGseqExample")
DEGseq(mapResultBatch1, mapResultBatch2, fileFormat="bed", refFlat=refFlat,
outputDir=outputDir, method="LRT")
cat("outputDir:", outputDir, "\n")
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|