suda2(sdcMicro)
suda2()所属R语言包:sdcMicro
Suda2: Detecting Special Uniques
Suda2:检测特种唯一的
译者:生物统计家园网 机器人LoveR
描述----------Description----------
SUDA risk measure for data from (stratified) simple random sampling.
数据(分层)简单随机抽样的的SUDA风险措施。
用法----------Usage----------
suda2(data,variables=NULL,missing=-999,DisFraction=0.01)
参数----------Arguments----------
参数:data
object of class “data.frame”
对象类的“数据框”
参数:variables
Categorical (key) variables. Either the column names or and index of the variables to be used for risk measurement.
分类(关键)变量。要么的列名或变量的指数要用于风险计量。
参数:missing
Missing value coding in the given data set.
在给定的数据集缺失值编码。
参数:DisFraction
It is the sampling fraction for the simple random sampling, and the common sampling fraction for stratified sampling. By default, it's set to 0.01.
这是简单随机抽样,和常见的抽样比为分层抽样,抽样比为。默认情况下,它被设置为0.01。
Details
详细信息----------Details----------
Suda 2 is a recursive algorithm for finding Minimal Sample Uniques. The algorithm generates all possible variable subsets of defined categorical key variables and scans them for unique patterns in the subsets of variables. The lower the amount of variables needed to receive uniqueness, the higher the risk of the corresponding observation.
苏达是一个递归算法寻找最小的样品唯一。该算法生成所有可能的变量子集的定义分类的关键变量,变量的子集扫描他们的独特模式。量越低的变量需要接收的独特性,相应的观察的风险越高。
值----------Value----------
参数:ContributionPercent
The contribution of each key variable to the SUDA score, calculated for each row.
每一个关键变量的贡献,以的SUDA得分,计算出每一行。
参数:score
The suda score.
苏达得分。
参数:disscore
The dis suda score
显示苏达得分
(作者)----------Author(s)----------
Alexander Kowarik based on the C++ code from the
Organisation For Economic Co-Operation And Development.
For the C++ code:
This work is being supported by the International Household
Survey Network and funded by a DGF Grant provided by
the World Bank to the PARIS21 Secretariat at the Organisation for Economic
Co-operation and Development (OECD). This work builds on previous work which
is elsewhere acknowledged.
参考文献----------References----------
A Measure of Disclosure Risk for Microdata. Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 64 (4), pp 855–867.
SUDA: A Program for Detecting Special Uniques, Using DIS to Modify the Classification of Special Uniques
A recursive search algorithm for statistical disclosure assessment. Data Min Knowl Disc 16:165 – 196
实例----------Examples----------
data(testdata)
data_suda2 <- suda2(testdata,variables=c("urbrur","roof","walls","water","sex"))
data_suda2
summary(data_suda2)
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|