R语言 rminer包 holdout()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-9-27 00:01:37

holdout(rminer)
holdout()所属R语言包：rminer

                                       Computes indexes for holdout data split into training and test sets.
                                       计算到训练和测试集分割为维持数据的索引。

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Computes indexes for holdout data split into training and test sets.
计算到训练和测试集分割为维持数据的索引。

用法----------Usage----------

参数----------Arguments----------

参数：y
desired target: numeric vector; or factor – then a stratified holdout is applied (i.e. the proportions of the classes are the same for each set).
所需的目标：数值向量;或因子 - 然后分层截留被施加的（即的类的比例为每一组是相同的）。

参数：ratio
split ratio (in percentage – sets the training set size; or in total number of examples – sets the test set size).
分流比（以百分比 - 设置训练集的大小，或在实施例的总数 - 设置测试集的大小）。

参数：internalsplit
if TRUE then the training data is further split into training and validation sets. The same ratio parameter is used for the internal split.
如果TRUE然后训练数据被进一步划分为训练和验证集。同样的ratio参数被用于内部分割。

参数：mode
sampling mode. Options are:
取样模式。选项有：

random – standard randomized holdout;
random  - 标准随机坚持自己的;

order – static mode, where the first examples are used for training and the later ones for testing (useful for time series data);
order - 静态模式下，第一个例子是用于培训和后来的测试（时间序列数据非常有用）;

incremental – incremental retraining mode (e.g. useful for spam detection), similar to order except that ratio=batch size and iter is used. In each iteration, the training set size grows, while the test set size is equal to ratio except for the last batch (where it may be smaller).
incremental - 增量再培训模式（如有用的垃圾邮件检测），类似order除了ratio=批量大小和iter使用的。在每一次迭代中，训练集规模的增长，而测试集的大小等于ratio除了最后一批（在那里它可能会更小）。

参数：iter
iteration of the incremental retraining mode (only used when mode=="incremental", typically iter is set within a cycle, see the example below).
迭代增量的再培训模式（仅用于当mode=="incremental"，通常iter设置在一个周期内，见下面的例子）。

Details

详细信息----------Details----------

Computes indexes for holdout data split into training and test sets (if y is a factor then a stratified holdout is applied).
计算指标维持数据分割成训练集和测试集（如果y是一个因素，然后分层抵抗的应用）。

值----------Value----------

A list with the components:
的组件列表：

$tr – numeric vector with the training examples indexes;
$的TR  - 数字矢量的训练例子指标;

$ts -- numeric vector with the test examples indexes;
TS  - 数字矢量的测试例子指标;

$itr -- numeric vector with the internal training examples indexes;
ITR  - 内部培训的例子指标的数值向量;

$val -- numeric vector with the internal validation examples indexes; </ul>
VAL  - 数字矢量的内部验证的例子指标; </ ul>

（作者）----------Author(s)----------

Paulo Cortez <a href="http://www3.dsi.uminho.pt/pcortez">http://www3.dsi.uminho.pt/pcortez</a>

参考文献----------References----------

See <code>fit</code>.

参见----------See Also----------

fit, predict.fit, mining, mgraph, mmetric, savemining, Importance.
fit，predict.fit，mining，mgraph，mmetric，savemining，Importance。

实例----------Examples----------

### simple examples:[＃＃简单的例子：]
H=holdout(1:10,ratio=2,internal=TRUE,mode="order")
print(H)
H=holdout(1:10,ratio=2/3,internal=TRUE,mode="order")
print(H)
H=holdout(1:10,ratio=2/3,internal=TRUE,mode="random")
print(H)
H=holdout(1:10,ratio=2/3,internal=TRUE,mode="random")
print(H)

### classification example[＃＃分类示例]
data(iris)
# random stratified holdout[随机分层抵抗的]
H=holdout(iris$Species,ratio=2/3,internal=TRUE)
print(summary(iris[H$itr,]))
print(summary(iris[H$val,]))
print(summary(iris[H$tr,]))
print(summary(iris[H$ts,]))
M=fit(Species~.,iris[H$tr,],model="dt") # training data only[训练数据只]
P=predict(M,iris[H$ts,]) # test data[测试数据]
print(mmetric(iris$Species[H$ts],P,"CONF"))

### regression example with incremental training[＃＃回归与增量训练的例子]
ts=c(1,4,7,2,5,8,3,6,9,4,7,10,5,8,11,6,9)
d=CasesSeries(ts,c(1,2,3))
for(b in 1:3) # iterations[迭代]
{
H=holdout(d$y,ratio=4,mode="incremental",iter=b)
print(H)
M=fit(y~.,d[H$tr,],model="mlpe",search=2)
P=predict(M,d[H$ts,])
cat("batch :",b,"TR size:",length(H$tr),"TS size:",
   length(H$ts),"mae:",mmetric(d$y[H$ts],P,"MAE"),"\n")
}

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册