找回密码
 注册
查看: 390|回复: 0

R语言 TraMineR包 seqici()函数中文帮助文档(中英文对照)

[复制链接]
发表于 2012-10-1 11:38:58 | 显示全部楼层 |阅读模式
seqici(TraMineR)
seqici()所属R语言包:TraMineR

                                        Complexity index of individual sequences
                                         单个顺序的复杂性指数

                                         译者:生物统计家园网 机器人LoveR

描述----------Description----------

Computes the complexity index, a composite measure of sequence complexity. The index uses the number of transitions in the sequence as a measure of the complexity induced by the state ordering and the longitudinal entropy as a measure of the complexity induced
计算复杂性指数,综合衡量序列复杂性。索引使用序列中的跃变的次数,作为衡量诱导的状态排序的复杂性和纵向熵作为衡量的复杂诱导


用法----------Usage----------


seqici(seqdata, with.missing=FALSE)



参数----------Arguments----------

参数:seqdata
a sequence object as returned by the the seqdef function.
返回的序列对象的seqdef功能。


参数:with.missing
if set to TRUE, missing status (gaps in sequences) is handled as an additional state when computing the state distribution and the number of transitions in the sequence.
如果设置为TRUE,缺失的现状(序列中的差距)的处理计算时的状态分布和数量的转换序列中的一个额外的状态。


Details

详细信息----------Details----------

The complexity index C(s) of a sequence s is
的复杂性指数C(s)的序列s

where q(s) is the number of transitions in the sequence,  q_{max} the maximum number of transitions,   h(s) the within entropy, and h_{max} the theoretical maximum  entropy which is h_max = -log 1/|A|.
其中q(s)是序列中的转换次数,q_{max}的最大数量的转换,h(s)内的熵,和h_{max}最大熵理论,这是<X >。

The index C(s) is the geometric mean of its two components which are normalized. The minimum value of 0 can only be reached by a sequence made of one distinct state, containing thus 0 transitions and having an entropy of 0. The maximum 1 of C(s) is reached when the two following conditions are fulfilled: i) Each of the state in the alphabet is present in the sequence and the total durations are uniform, that is, equal to l/a and ii) The number of transitions in the sequence is equal to l-1, that is, the length l_d of the DSS is equal to the length of the sequence l
指数C(s)是它的两个组件,这些组件标准化的几何平均值。只能达到最小值0的序列由一个不同的状态,因此,包含0的跳变,并具有0的熵。最大的1C(s)达到以下两个条件都满足时,I)的状态在字母表中的每一个序列中,总的持续时间是一致的,那就是等于l/a和ii)在序列中的转换次数等于l-1,即,长度l_d的DSS是等于序列长度l的


值----------Value----------

a vector of length equal to the number of sequences in seqdata containing the complexity index value of each sequence.
的向量长度的序列的数量等于seqdata含有每个序列的复杂性指数值。


(作者)----------Author(s)----------


Alexis Gabadinho



参考文献----------References----------


N.S. (2010). "Indice de complexit茅 pour le tri et la comparaison de s茅quences cat茅gorielles", In Extraction et gestion des connaissances (EGC 2010), Revue des nouvelles technologies de

参见----------See Also----------

seqient, seqST
seqient,seqST


实例----------Examples----------


## Creating a sequence object from the mvad data set[#创建一个序列对象MVAD数据集]
data(mvad)
mvad.labels <- c("employment", "further education", "higher education",
                    "joblessness", "school", "training")
mvad.scodes <- c("EM","FE","HE","JL","SC","TR")
mvad.seq <- seqdef(mvad, 15:86, states=mvad.scodes, labels=mvad.labels)

##[#]
mvad.ci <- seqici(mvad.seq)
summary(mvad.ci)
hist(mvad.ci)

## Example using with.missing argument[#示例使用with.missing参数]
data(ex1)
ex1.seq <- seqdef(ex1, 1:13)
seqici(ex1.seq)
seqici(ex1.seq, with.missing=TRUE)

转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。


注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 注册

本版积分规则

手机版|小黑屋|生物统计家园 网站价格

GMT+8, 2024-11-30 05:47 , Processed in 0.091912 second(s), 15 queries .

Powered by Discuz! X3.5

© 2001-2024 Discuz! Team.

快速回复 返回顶部 返回列表