ConsensusSequence(DECIPHER)
ConsensusSequence()所属R语言包:DECIPHER
Create A Consensus Sequence
创建一个共识序列
译者:生物统计家园网 机器人LoveR
描述----------Description----------
Forms a consensus sequence representing a set of sequences.
形成了一个共识序列,代表一组序列。
用法----------Usage----------
ConsensusSequence(myDNAStringSet,
threshold = 0.05,
ambiguity = TRUE,
noConsensusChar = "N",
minInformation = 0.75,
ignoreNonBases = FALSE,
includeTerminalGaps = FALSE,
verbose = TRUE)
参数----------Arguments----------
参数:myDNAStringSet
A DNAStringSet object of aligned sequences.
一个DNAStringSet对象的排列顺序。
参数:threshold
Maximum fraction of sequence information that may be lost in forming the consensus.
最高分数的序列信息,可能会失去形成共识。
参数:ambiguity
Logical specifying whether to consider ambiguity as split between their respective nucleotides. Degeneracy codes are specified in the IUPAC_CODE_MAP.
逻辑指定是否要考虑他们各自的核苷酸之间的分裂歧义。退化代码指定在IUPAC_CODE_MAP。
参数:noConsensusChar
Single character from the DNA_ALPHABET giving the base to use when there is no consensus in a position.
单字符的DNA_ALPHABET给碱基使用时,有没有共识,在一个位置。
参数:minInformation
Minimum fraction of information required to form consensus in each position.
形成共识,在每一个位置的信息的最低分数。
参数:ignoreNonBases
Logical specifying whether to count gap ("-") or mask ("+") characters towards the consensus.
逻辑指定是否算差距(“ - ”)或面罩(“+”)字符对共识。
参数:includeTerminalGaps
Logical specifying whether or not to include terminal gaps ("-" characters on each end of the sequence) into the formation of consensus.
逻辑指定是否包括终端的差距(“ - ”每个序列结束字符),将形成共识。
参数:verbose
Logical indicating whether to print the elapsed time upon completion.
逻辑表示是否打印完成后的运行时间。
Details
详情----------Details----------
Two key parameters control the degree of consensus. The default threshold (0.05) indicates that at least 95% of sequence information will be represented by the consensus sequence. The default minInformation (0.75) specifies that at least 75% of sequences must contain the information in the consensus, otherwise the noConsensusChar is used.
两个关键参数的控制程度的共识。默认threshold(0.05)表明,至少有95%的序列信息,将由代表的共识序列。默认minInformation(0.75)的规定,至少有75%的序列必须包含在协商一致的信息,否则noConsensusChar使用。
If ambiguity = TRUE (the default) then degeneracy codes are split between their respective bases according to the IUPAC_CODE_MAP. For example, an "R" would count as half an "A" and half a "G". If ambiguity = FALSE then degeneracy codes are not considered in forming the consensus. If includeNonBases = TRUE (the default) then gap ("-") and mask ("+") characters are counted towards the consensus, otherwise they are omitted from development of the consensus.
如果ambiguity = TRUE(默认),然后简码分裂根据各自的碱基之间IUPAC_CODE_MAP。例如,一个“R”算作半的“A”和半年的“G”。如果ambiguity = FALSE然后简码没有形成共识。如果includeNonBases = TRUE(默认),那么差距(“ - ”)和掩码(“+”)字符对计算的共识,否则,他们被省略,从发展的共识。
值----------Value----------
A DNAStringSet with a single consensus sequence.
一个DNAStringSet与一个单一的共识序列。
作者(S)----------Author(s)----------
Erik Wright <a href="mailto ECIPHER@cae.wisc.edu">DECIPHER@cae.wisc.edu</a>
参见----------See Also----------
IdConsensus, Seqs2DB
IdConsensus,Seqs2DB
举例----------Examples----------
dna <- DNAStringSet(c("ANGCT-","-ACCT-"))
ConsensusSequence(dna)
# returns "ANSCT-"[回报“ANSCT”]
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|