找回密码
 注册
查看: 584|回复: 0

R语言 PREDA包 SODEGIRpreprocessingGE()函数中文帮助文档(中英文对照)

[复制链接]
发表于 2012-2-26 11:25:56 | 显示全部楼层 |阅读模式
SODEGIRpreprocessingGE(PREDA)
SODEGIRpreprocessingGE()所属R语言包:PREDA

                                         Wrapper function for gene expression data preprocessing for SODEGIR analysis
                                         包装功能的基因表达数据预处理SODEGIR分析

                                         译者:生物统计家园网 机器人LoveR

描述----------Description----------

Wrapper function for gene expression data preprocessing for SODEGIR analysis
包装功能的基因表达数据预处理SODEGIR分析


用法----------Usage----------


SODEGIRpreprocessingGE(SampleInfoFile = NULL, CELfiles_dir = NULL,
AffyBatchInput = NULL, custom_cdfname, arrayNameColumn = NULL,
sampleNameColumn = NULL, classColumn,
referenceGroupLabel, statisticType, optionalAnnotations = NULL,
retain.chrs = NULL, reference_position_type = "median",
testedTail = "both", singleSampleOutput = TRUE,
varianceAll=FALSE)



参数----------Arguments----------

参数:SampleInfoFile
Path to sample info file  
样品info文件的路径


参数:CELfiles_dir
Path to directory containing raw CEL data files for Affymetrix arrays  
Affymetrix公司的阵列包含原料CEL的数据文件目录的路径


参数:AffyBatchInput
Alternatively input raw data can be provided as an AffyBatch object. In this case sample classes will be inferred from phenodata contained in AffyBatch object. In particular classColumn parameter will refer to the column in pData(AffyBatchInput) object.  
另外,输入原始数据,可以提供作为AffyBatch对象。在这种情况下,样本类将推断从载phenodata AffyBatch对象。尤其classColumn参数将参照PDATA(AffyBatchInput)对象中的列。


参数:custom_cdfname
Specify the cdf library to be used for data preprocessing  
指定用于数据预处理的CDF库


参数:arrayNameColumn
Column of sampleinfo file containing the name of raw data (CEL) files  
sampleinfo文件名称(CEL)的原始数据列包含文件


参数:sampleNameColumn
Column of sampleinfo file containing the name to be used for samples labels  
sampleinfo文件列载样品标签中使用的名称


参数:classColumn
Column of sampleinfo file containing the label of sample classes. If input raw data are provided as an AffyBatch object, this parameter refers intead to the column in pData(AffyBatchInput) object.  
sampleinfo文件包含的样本类的标签栏。如果输入的原始数据提供作为AffyBatch对象,这个参数是指intead PDATA(AffyBatchInput)对象中列。


参数:referenceGroupLabel
Specify which class label is used for the reference sample used in computing statistics for differential expression.  
指定用于计算统计差异表达的参考样本的类标签。


参数:statisticType
Stastistic for differential expression that is computed on input data. Possible values are "tstatistic", "SAM" (SAM statistical score for differential expression), "FC" (Fold Change), "FCmedian" (fold change computed on medians)  
差异表达stastistic对输入数据进行计算。可能的值是“tstatistic”,“萨姆”(SAM差异表达的统计得分),“财委会”(倍数更改),“FCmedian”(中位数计算倍数变化)


参数:optionalAnnotations
Character vector to select additional annotations fields to be included into the GenomicAnnotations object.  
被列入到GenomicAnnotations对象的特征向量来选择额外的注释字段。


参数:retain.chrs
Numeric vector, containing the list of chromosomes selected for the output GenomicAnnotations object. E.g. set retain.chrs=1:22 to limit the GenomicAnnotations object to chromosomes from 1 to 22. This might be ueseful to limit GenomiAnnotations objects to autosomic chromosomes.  
数字向量,列表中包含的为输出GenomicAnnotations对象选择的染色体。例如设置retain.chrs = 1时22分,以限制GenomicAnnotations对象从1到22条染色体。这限制GenomiAnnotations的对象autosomic染色体可能是ueseful。


参数:reference_position_type
Specify which genomic coordinate must be used as reference position for PREDA analysis. Possible values are "start", "end", "median", "strand.start" or "strand.end".  "strand.start" is strand specific start: i.e. start on positive strand but end on negative strand. "strand.end" is strand specific end.  
指定的基因坐标必须用于参考普雷达分析位置。可能的值是“开始”,“结束”,“中位数”,“strand.start”或“strand.end”。 “strand.start”是钢绞线的具体启动:即正股负链上月底开始。 “strand.end”是链特定的结束。


参数:testedTail
Specify what tail of the distribution will be tested for significantly extreme values in PREDA analysis. Possible values are "both", "upper" or "lower".  
指定将普雷达分析测试显着极端值分布的尾部。可能的值是“既”,“上”或“低”。


参数:singleSampleOutput
Logical, if TRUE a statistic comparing each sample with the reference group is computed.  
逻辑,如果每个样品与参照组比较TRUE统计计算。


参数:varianceAll
This parameter affect the computation only when singleSampleOutput is TRUE.  varianceAll is itself a logical parameter. If TRUE, all pathological (e.g. tumor) samples and all normal (reference) samples are used to estimate  variance in the comparison of individual pathological samples  to the normal reference, as described in the original SODEGIR apper by Bicciato et al. (Nucleic Acids Res. 2009).  The original SODEGIR statistic for Gene Expression was based on the SAM score. Therefore in the current PREDA version the varianceAll=TRUE parameter can be used only for SAM statistic: when singleSampleOutput is TRUE and a different statisticType is used, the variance is actually computed using only the normal (reference) samples.  If FALSE (default value), the computation of statistics for single sample VS reference comparisons only take into account the variance in the reference group of samples.  
这个参数影响的计算,只有当singleSampleOutput为TRUE。 varianceAll本身就是一个逻辑参数。如果是TRUE,所有病变(如肿瘤)的样品和一切正常(参考)样本方差估计在个别的病理标本比较正常的参考,在的原始SODEGIR冲击片雷管描述由Bicciato等。 (核酸氨基酸水库2009年)。基因表达的原始SODEGIR统计是基于SAM的评分。因此,在当前普雷达版本的varianceAll = true参数,可以只用萨姆统计:为TRUE时singleSampleOutput,和使用不同statisticType,方差计算使用正常(参考)样本。如果为FALSE(默认值),单一样本VS参考比较的统计数据的计算只考虑在参考样本组的差异。


Details

详情----------Details----------

Preprocess raw (CEL) files for Affymetrix gene expression arrays using user defined CDF libraries and RMA normalization.
原料预处理(CEL)的文件Affymetrix基因表达阵列使用用户定义民防库和RMA标准化。

Then statistics for differential expression are computed comparing each sample with the reference group.
然后计算每个样品与参照组比较差异表达的统计数字。

Then annotations are retrieved from the corresponding annotation library.
注释,然后从相应的注释库检索。

Please note this function is a user-friendly preprocessing function for Affy gene expression microarrays. Step by step preprocessing functions can be used with any other platform.
请注意:此功能是用户友好的预处理功能,为Affy基因表达芯片。一步一步预处理功能,可用于任何其他平台。


值----------Value----------

A DataForPREDA object is returned.
返回一个DataForPREDA对象。


作者(S)----------Author(s)----------



Francesco Ferrari




参考文献----------References----------

Mangano, Francesco Ferrari, Luca Beltrame, Ingrid Cifola, Clelia Peano, Aldo Solari, and Cristina Battaglia. A computational procedure to identify significant overlap of differentially expressed and genomic imbalanced regions in cancer datasets. Nucleic Acids Res, 37(15):5057-70, August 2009.

参见----------See Also----------

preprocessingGE, DataForPREDA
preprocessingGE,DataForPREDA


举例----------Examples----------


  ## Not run: [#无法运行:]
require(PREDAsampledata)

CELfilesPath <- system.file("sampledata", "GeneExpression",
package = "PREDAsampledata")

infofile <- file.path(CELfilesPath , "sampleinfoGE_PREDA.txt")

SODEGIRGEDataForPREDA<-SODEGIRpreprocessingGE(SampleInfoFile=
infofile,
CELfiles_dir=CELfilesPath,
custom_cdfname="gahgu133plus2",
arrayNameColumn=1,
sampleNameColumn=2,
classColumn="Class",
referenceGroupLabel="normal",
statisticType="tstatistic",
optionalAnnotations=c("SYMBOL", "ENTREZID"),
retain.chrs=1:22
)


  
## End(Not run)[#结束(不运行)]

转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。


注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 注册

本版积分规则

手机版|小黑屋|生物统计家园 网站价格

GMT+8, 2025-2-1 03:51 , Processed in 0.023040 second(s), 16 queries .

Powered by Discuz! X3.5

© 2001-2024 Discuz! Team.

快速回复 返回顶部 返回列表