R语言 snpStats包 read.pedfile()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-2-26 14:46:20

read.pedfile(snpStats)
read.pedfile()所属R语言包：snpStats

                                       Read a pedfile as "SnpMatrix" object
                                       阅读作为“SnpMatrix”对象pedfile，

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Reads diallelic data in linkage "pedfile" format, with one line of data per sample (subject) containing six mandatory fields followed by pairs of fields, one pair for each locus,  giving the two alleles observed.
读取在联动“pedfile的”格式的数据，每样本的数据，包含六个必须填写其次是对领域，对每个座位，给两个等位基因观察（主题）一行diallelic。

用法----------Usage----------

read.pedfile(file, n, snps, which, split = "\t| +", sep = ".", na.strings = "0", lex.order = FALSE)

参数----------Arguments----------

参数：file
The input pedfile. This may be (but need not be) gzipped
输入pedfile。这可能是（但不必）gzip压缩

参数：n
(Optional) The number of lines of data to be read. If not supplied the pedfile is read once and rewound to determine how many lines it contains
（可选），数据线，要读。如果不提供pedfile读取一次倒带，以确定它包含多少行

参数：snps
(Optional) Either a character vector giving the names of the loci, or a single character variable giving the name of a locus information file from which these can be read. This file is assumed to be white-space delimited with one line per locus and no header line. If this argument is not supplied, locus names are generated as a numerical sequence, prefixed by locus and a separator character
（可选）字符向量给予的位点，或一个单一的字符变量的轨迹可以读取这些信息文件的名称的名称。这个文件被认为是同一条线每个座位和没有标题行白色的空间分隔。如果这种说法是不提供的，轨迹名称生成一个数值序列，前缀locus和分隔符

参数：which
(Optional) If locus names are to be read from a file, this argument should specify which column contains the names. If not supplied, the first column giving unique locus names is used
（可选）如果轨迹的名字将被从文件中读取，这种说法应指定哪一列包含的名称。如果没有提供，用于提供独特的轨迹名称的第一列

参数：split
A "regexp" specifying how the input pedfile will be split into fields. The default value specifies either a TAB character or one or more spaces
一个“正则表达式”指定输入pedfile如何将分裂成领域。默认值指定一个制表符或一个或多个空格

参数：sep
The separator character used in constructing row and column names of the output SnpMatrix object
用于兴建行和输出SnpMatrix对象的列名的分隔符

参数：na.strings
One or more strings to be set to NA. Any field taking one  of these values will be set to NA
设置一个或多个字符串NA。任何领域采取下列值之一，将设置NA

参数：lex.order
If TRUE, then alleles will be allocated to internal 1 and 2 values in lexographic order. Otherwise they are converted in the order in which they are encountered when reading the file (the default setting)
如果属实，那么等位基因将被分配到内部1和2lexographic顺序值。否则，它们将转换中，他们遇到的读取文件（默认设置）时的顺序

Details

详情----------Details----------

Row names for the output SnpMatrix object and for the accompanying subject description dataframe are taken as the pedigree identifiers, when these provide the required unique identifiers. When these are duplicated, an attempt is made to use the pedigree-member identifiers instead but, when these too are duplicated, row names are obtained by concatenating, with a separator character, the pedigree and pedigree-member identifiers.
输出SnpMatrix对象和随附的主题描述dataframe的行名谱系标识符，当这些提供所需的唯一标识符。当这些被复制，企图血统成员的标识符，而不是使用，但，当这些重复行名称通过连接，分隔符，谱系和血统成员的标识符。

值----------Value----------

A list, comprising
一个列表，包括

参数：genotypes
The output genotype data as an object of class "SnpMatrix". If either the pedigree or pedigree-member identifiers in the ped file are not duplicated, these are used for the row names of the output object. Otherwise these two fields are concatenated, separated by sep
输出作为一个对象类"SnpMatrix"基因型数据。如果ped文件中的系谱或系谱成员的标识符不重复，这些都是用于输出对象的行名。否则，这两个领域连接在一起，分开sep

参数：fam
A dataframe containing the first six fields in the pedfile. The row names will correspond with those of the SnpMatrix
一个dataframe第一在pedfile 6领域。该行的名称将对应与SnpMatrix的

参数：map
A dataframe giving the alleles at each locus. If locus names were obtained from a dataframe read from an existing file, then the allele information is simply appended to this frame. Otherwise a new dataframe is created. The row names will correspond with the column names of the SnpMatrix
一个dataframe给每个座位的等位基因。如果从现有的文件读dataframe取得的轨迹名称，然后等位基因信息只是追加到这个帧。否则创建一个新的dataframe。行名称符合SnpMatrix 列名

注意----------Note----------

This function is written entirely in R and may not be particularly fast. However, it imposes no restrictions on the allele codes recognized.
此功能完全被写入R和可能不是特别快。然而，对认可的等位基因编码没有任何限制。

Homozygous genotypes may be represented in the input file either (a) by coding both alleles to the same value, or (b) setting the second allele to "missing" (as specified by the missing.allele argument). No special provision is made to read XSnpMatrix objects; such data should first be read as a SnpMatrix and then coerced to an XSnpMatrix using new or as.
在输入文件：（一）两个等位基因编码为相同的值，或（二）“失踪”（如通过missing.allele参数指定）设置第二个等位基因纯合子基因型可派。没有特殊的规定是由读取XSnpMatrix对象，这样的数据，首先应读为1 SnpMatrix然后强迫XSnpMatrix用new或as。

作者（S）----------Author(s)----------

David Clayton <a href="mailto:david.clayton@cimr.cam.ac.uk">david.clayton@cimr.cam.ac.uk</a>

参见----------See Also----------

SnpMatrix-class, XSnpMatrix-class
SnpMatrix-class，XSnpMatrix-class

举例----------Examples----------

##[＃]
## No example supplied yet[＃没有例子还提供]
##[＃]

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册

R语言 snpStats包 read.pedfile()函数中文帮助文档(中英文对照)

浏览过的版块