R语言 VariantAnnotation包 PolyPhenDb-class()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-2-26 15:57:02

PolyPhenDb-class(VariantAnnotation)
PolyPhenDb-class()所属R语言包：VariantAnnotation

                                    PolyPhenDb objects
                                       PolyPhenDb对象

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

The PolyPhenDb class is a container for storing a connection to a PolyPhen  sqlite database.
PolyPhenDb类是一个用于存储连接到PolyPhen SQLite数据库的容器。

Details

详情----------Details----------

PolyPhen (Polymorphism Phenotyping) is a tool which predicts the possible  impact of an amino acid substitution on the structure and function of a  human protein by applying empirical rules to the sequence, phylogenetic  and structural information characterizing the substitution.
PolyPhen（表型多态性）是一个工具，它预测的氨基酸替代的运用经验规则的序列，进化和结构信息表征的替代人类蛋白的结构和功能上可能造成的影响。

PolyPhen makes its predictions using UniProt features, PSIC profiles scores  derived from multiple alignment and matches to PDP or PQS structural  databases. The procedure can be roughly outlined in the following  steps, see the references for complete details,
PolyPhen使得其预测使用UniProt功能，从多个对齐PSIC型材所得分数和比赛的PDP或PQS的结构数据库。这个过程大致可以概述以下步骤，请参阅完整的详细信息参考，

sequence-based characterization of substitution site
基于序列表征替代网站

calculation of PSIC profile scores for two amino acid variants
PSIC轮廓分数计算两个氨基酸变异

calculation of structural parameters and contacts
计算结构参数和接触

prediction
预测

PolyPhen uses empirically derived rules to predict that a non-synonymous  SNP is
PolyPhen使用经验得出的规则来预测一个非同义SNP是

probably damaging : it is with high confidence supposed to affect protein function or structure
可能是破坏性的：它是应该影响蛋白质的功能或结构的高信心

possibly damaging : it is supposed to affect protein function or structure
可能是破坏性的：它应该是影响蛋白质的功能或结构

benign : most likely lacking any phenotypic effect
良性：最有可能没有任何表型的影响

unknown : when in some rare cases, the lack of data do not allow PolyPhen to make a prediction
未知：缺乏数据时，在某些罕见的情况下，不容许PolyPhen作出预测

方法----------Methods----------

In the code below, x is a PolyPhenDb object.
在下面的代码，x是PolyPhenDb对象。

metadata(x): Returns x's metadata in a data frame.
metadata(x)：返回x的一个数据框中的元数据。

cols(x): Returns the names of the cols that can be used to subset the data columns. For column descriptions see ?PolyPhenDbColumns.
cols(x)：返回可以使用的数据列的子集cols的名字。列说明?PolyPhenDbColumns。

keys(x): Returns the names of the keys that can be used to subset the data rows. The keys values are the rsid's.
keys(x)：返回keys可以用于数据行的一个子集的名称。 keys值是RSID的。

select(x, keys = NULL, cols = NULL, ...): Returns a subset of data defined by the character vectors keys  and cols. If no keys are supplied, all rows are returned. If no cols are supplied, all columns are returned. See ?PolyPhenDbColumns for column descriptions.
select(x, keys = NULL, cols = NULL, ...)：返回一个数据子集定义的字符向量keys和cols。如果没有keys提供，返回所有的行。如果没有cols提供的，所有列返回。看到?PolyPhenDbColumns列的说明。

duplicateRSID(x): Returns a named list of duplicate rsid groups. The names are the  keys, the list elements are the rsid's that have been  reported as having identical chromosome position and alleles and  therefore translating into the same amino acid residue substitution.
duplicateRSID(x)：返回重复RSID群体的命名列表。名字是keys，列表中的元素是RSID已报道具有相同的染色体上的位置和等位基因，因此翻译成相同的氨基酸残基替代。

作者（S）----------Author(s)----------

Valerie Obenchain <vobencha@fhcrc.org>

参考文献----------References----------

http://genetics.bwh.harvard.edu/pph2/dokuwiki/
Kondrashov AS, Sunyaev SR. Nat Methods 7(4):248-249 (2010).
survey. Nucleic Acids Res 30(17):3894-3900 (2002).
Kuznetsov EN. PSIC: profile extraction from sequence alignments with position-specific counts of independent observations. Protein Eng 12(5):387-394 (1999).

参见----------See Also----------

?PolyPhenDbColumns
?PolyPhenDbColumns

举例----------Examples----------

library(PolyPhen.Hsapiens.dbSNP131)

## metadata[＃元数据]
metadata(PolyPhen.Hsapiens.dbSNP131)

## available rsid's [＃可用RSID的]
head(keys(PolyPhen.Hsapiens.dbSNP131))

## column descriptions found at ?PolyPhenDbColumns[＃列说明？PolyPhenDbColumns]
cols(PolyPhen.Hsapiens.dbSNP131)

## subset on keys and cols [＃键和cols的子集]
subst <- c("AA1", "AA2", "PREDICTION")
rsids <- c("rs2142947", "rs4995127", "rs3026284")
select(PolyPhen.Hsapiens.dbSNP131, keys=rsids, cols=subst)

## retrieve substitution scores [＃检索替代分数]
subst <- c("IDPMAX", "IDPSNP", "IDQMIN")
select(PolyPhen.Hsapiens.dbSNP131, keys=rsids, cols=subst)

## retrieve the PolyPhen-2 classifiers [＃检索PolyPhen-2分类。]
subst <- c("PPH2CLASS", "PPH2PROB", "PPH2FPR", "PPH2TPR", "PPH2FDR")
select(PolyPhen.Hsapiens.dbSNP131, keys=rsids, cols=subst)

## duplicate groups of rsid's[＃重复RSID的群体。]
duplicateRSID(PolyPhen.Hsapiens.dbSNP131, c("rs71225486", "rs1063796"))

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册