R语言 RRF包 rrfImpute()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-9-28 21:08:28

rrfImpute(RRF)
rrfImpute()所属R语言包：RRF

                                    Missing Value Imputations by RRF
                                       缺失值的估算RRF

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Impute missing values in predictor data using proximity from RRF.
填补缺失值，从RRF的预测数据接近。

用法----------Usage----------

## Default S3 method:[默认方法]
rrfImpute(x, y, iter=5, ntree=300, ...)
## S3 method for class 'formula'[类formula的方法]
rrfImpute(x, data, ..., subset)

参数----------Arguments----------

参数：x
A data frame or matrix of predictors, some containing NAs, or a formula.
一个数据框或矩阵的预测，一些含NA的，或者一个公式。

参数：y
Response vector (NA's not allowed).
响应矢量（NA的不允许的）。

参数：data
A data frame containing the predictors and response.
一个数据框包含的预测和响应。

参数：iter
Number of iterations to run the imputation.
迭代次数运行归集。

参数：ntree
Number of trees to grow in each iteration of RRF.
树木生长在每次迭代的快速反应部队的数目。

参数：...
Other arguments to be passed to RRF.
其他参数传递给RRF。

参数：subset
A logical vector indicating which observations to use.
逻辑向量，观测使用。

Details

详细信息----------Details----------

The algorithm starts by imputing NAs using na.roughfix.  Then RRF is called with the completed data.  The proximity matrix from the RRF is used to update the imputation of the NAs.  For continuous predictors, the imputed value is the weighted average of the non-missing obervations, where the weights are the proximities.  For categorical predictors, the imputed value is the category with the largest average proximity.  This process is iterated iter times.
该算法开始归咎于使用NAna.roughfix的。 RRF被称为完整的数据。快速反应部队的相似性矩阵用于更新归集的NA的。对于连续的预测，估算值是加权平均的非的失踪obervations，其中权重的近似。对于分类预测，估算值是平均接近最大的类别。反复进行该处理iter倍。

Note: Imputation has not (yet) been implemented for the unsupervised case.  Also, Breiman (2003) notes that the OOB estimate of error from RRF tend to be optimistic when run on the data matrix with imputed values.
注：插补（还）没有在无人监督的情况下实施。此外，的布雷曼博士（2003年）指出，OOB错误估计RRF上运行时，数据矩阵估算值倾向于乐观。

值----------Value----------

A data frame or matrix containing the completed data matrix, where NAs are imputed using proximity from RRF.  The first column contains the response.
一个数据框或矩阵包含完整的数据矩阵，其中NA的打杀从RRF使用接近。第一列包含的响应。

（作者）----------Author(s)----------

Andy Liaw

参考文献----------References----------

Random Forest V4.0. http://oz.berkeley.edu/users/breiman/Using_random_forests_v4.0.pdf

参见----------See Also----------

na.roughfix.
na.roughfix。

实例----------Examples----------

data(iris)
iris.na <- iris
set.seed(111)
## artificially drop some data values.[＃人为删除一些数据值。]
for (i in 1:4) iris.na[sample(150, sample(20)), i] <- NA
set.seed(222)
iris.imputed <- rrfImpute(Species ~ ., iris.na)
set.seed(333)
iris.rf <- RRF(Species ~ ., iris.imputed)
print(iris.rf)

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册

R语言 RRF包 rrfImpute()函数中文帮助文档(中英文对照)

浏览过的版块