GeneSelector(GeneSelector)
GeneSelector()所属R语言包:GeneSelector
Select promising candidate genes
选择有前途的候选基因
译者:生物统计家园网 机器人LoveR
描述----------Description----------
Given GeneRankings or AggregatedRankings obtained from several ranking procedures, the aim is to find is a unifying output. A threshold equal to the maximum rank/list position which is still relevant for the question of interest may be provided by the user, or the threshold can adaptively be determined via significance analysis in multiple testing procedures. Then, all genes are checked whether their ranks fall below this threshold consistenly in all ranking procedures used. If this holds, then the gene is selected.<br> A final order of the genes is defined by the following criteria
鉴于GeneRankings或AggregatedRankings从几个排名程序获得的,目的是要找到一个统一的输出。等于最高排名/列表中的位置,这仍然是利益的问题有关的阈值可以由用户提供,或自适应阈值可以通过多个测试程序的意义分析确定。然后,所有的基因检查,他们的队伍是否低于这个阈值consistenly中使用的所有排名程序。如果这样认为,那么被选中的基因。参考一个基因的最终命令是由下列条件定义
1. A user-defined ranking of the used ranking procedures, i.e.
1。一个用户定义的排名,排名程序,即
2. 'Selection', i.e. falling below the threshold.
2。 “选择”,即低于阈值的下降。
3. The obtained ranks. The rank from the most important ranking procedure is considered, then that from the second
3。获得行列。排名被认为是从最重要的排名过程,然后,从第二
用法----------Usage----------
GeneSelector(Rlist, ind = NULL, indstatistic = 1:length(Rlist),
threshold = c("user", "BH", "qvalue", "Bonferroni", "Holm",
"Hochberg", "SidakSS", "SidakSD", "BY"),
maxrank = NULL, maxpval = 0.05)
参数----------Arguments----------
参数:Rlist
A list of objects of class RepeatedRanking or AggregatedRanking, all based on the same data.
一个对象类列表RepeatedRanking或AggregatedRanking,都基于相同的数据。
参数:ind
Indices of genes to be considered. Defaults to all.
指数被认为是基因。默认的所有。
参数:indstatistic
An index vector defining the importance of the elements of Rlist. For instance, if RList consists of five elements, then indstatistic=c(2,4,1,3,5) would give most importance to the second element.
索引向量定义的Rlist元素的重要性。举例来说,如果RList五个要素组成,那么indstatistic=c(2,4,1,3,5)会给最看重的第二个元素。
参数:threshold
Determination of the threshold (s. description). Can be either "user", in which case the threshold is specified via maxrank, or an acronym for one of the following multiple testing procedures (s. help file for mt.rawp2adjp in the package multtest for detailed information and references):
阈值的测定(S.描述)。可以是"user",在这种情况下,阈值指定通过maxrank,或一个缩写为以下多个测试程序(S.mt.rawp2adjp包<帮助文件X>的详细信息和参考):
"BH"Benjamini-Hochberg procedure.
“波黑”Benjamini Hochberg过程。
"qvalue"The q-value of Storey and Tibshirani (2003): "Statistical significance for genomewide studies". PNAS of the USA, 100, 9440-9445.
“qvalue”Q值层和Tibshirani(2003):“统计意义的基因组研究”。国家科学院院刊的美国,100,9440-9445。
"Bonferroni"Bonferroni procedure.
“邦弗朗尼”邦弗朗尼程序。
"Holm"Holm procedure.
“霍尔姆”的霍尔姆过程。
"SidakSS"Sidak single-step procedure.
“SidakSS”Sidak单步骤。
"SidakSD"Sidak step-down procedure.
“SidakSD”Sidak降压过程。
"BY"Benjamini-Yekutieli procedure. In the latter case, the p-values of the element of Rlist attributed most importance (s. indstatistic) are adjusted and the number of p-values falling below maxpval is used as threshold rank. If the most important statistic provides no p-values, then those of the second most are used (if available), and so on.
“BY”Benjamini-Yekutieli程序。在后一种情况下的P-值的Rlist元素最重要归咎于(S.indstatistic)调整和P-值低于maxpval下降使用阈值排名。如果最重要的统计数据提供无P值,然后第二个最被使用(如果可用),并依此类推。
参数:maxrank
A positive integer specifying a user-defined threshold.
一个正整数,指定一个用户定义的阈值。
参数:maxpval
Specified if threshold is not user.
指定threshold如果不user。
值----------Value----------
An object of class GeneSelectorOutput.
对象类GeneSelectorOutput。
作者(S)----------Author(s)----------
Martin Slawski <br>
Anne-Laure Boulesteix
参见----------See Also----------
GeneRanking, AggregatedRanking
GeneRanking,AggregatedRanking
举例----------Examples----------
## Load toy gene expression data[#加载玩具基因表达数据]
data(toydata)
### class labels[##类的标签]
yy <- toydata[1,]
### gene expression[##基因表达]
xx <- toydata[-1,]
### Get Rankings from five different statistics[#获取来自五个不同的统计排名]
ordinaryT <- RankingTstat(xx, yy, type="unpaired")
baldilongT <- RankingBaldiLong(xx, yy, type="unpaired")
samT <- RankingSam(xx, yy, type="unpaired")
wilc <- RankingWilcoxon(xx, yy, type="unpaired")
wilcebam <- RankingWilcEbam(xx, yy, type="unpaired")
### form a list[##形成一个列表]
LL <- list(ordinaryT, baldilongT, samT, wilc, wilcebam)
### order statistics (assign importance)[#顺序统计(分配的重要性)]
ordstat <- c(3,4,2,1,5)
### start GeneSelector, threshold set to rank 50[##启动GeneSelector的,阈值设置排名50]
gk50 <- GeneSelector(LL, indstatistic=ordstat, maxrank=50)
### start GeneSelector, using adaptive threshold based on p-values,[##启动GeneSelector的,使用基于自适应阈值,p值,]
### here using the multiple testing procedure of Hochberg[#这里使用Hochberg多个测试程序]
gkpval <- GeneSelector(LL, indstatistic=ordstat, threshold = "BH", maxpval=0.05)
### show results[#显示结果]
show(gkpval)
str(gkpval)
toplist(gkpval)
### which genes have been selected ?[#基因已被选中?]
SelectedGenes(gkpval)
### Detailed information about gene 4[基因4#的详细信息]
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|