R语言 TWIX包 tune.TWIX()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-10-1 13:14:41

tune.TWIX(TWIX)
tune.TWIX()所属R语言包：TWIX

                                    Parameter tuning via K-fold cross-validation.
                                       通过K-折交叉验证的参数整定。

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

This function tunes hyperparameters: minbuck, cp for TWIX or rpart, mtry for randomForest.
这个函数的曲调的的超minbuck，CP TWIX或rpart，mtry randomForest。

用法----------Usage----------

tune.TWIX(formula, data = NULL, minbuck = seq(5,20,by=5),
      maxdepth=30, Devmin=0.0, splitf="deviance",
      method="deviance", topn.method="single",
      topN=1, tol=0.25, cp=0.0, xval=10, runs = 1,
      trace.plot=FALSE, score=1, predict="best",
      cluster=NULL, seed.cluster=NULL, multicore=FALSE)

tune.cp.TWIX(formula, data = NULL, cp=seq(0.01,0.1,length=10),
      minbuck=1, maxdepth=30, Devmin=0.0, splitf="deviance",
      method="deviance", topn.method="single",
      topN=1, tol=0, xval=10, runs = 1, trace.plot=FALSE,
      score=1, predict="best", cluster=NULL,
      seed.cluster=NULL, multicore=FALSE)

tune.rpart(formula, data = NULL, minbuck=seq(5,20,by=5),
      parms=list(split="information"), maxdepth=30,
      cp=0.0, xval=10, runs=1, trace.plot=FALSE,
      cluster=NULL, seed.cluster=NULL, multicore=FALSE)

tune.cp.rpart(formula, data = NULL, cp=seq(0,0.3,0.05),
      minbuck=0, parms=list(split="information"),
      maxdepth=30, xval=10, runs=10, trace.plot=FALSE,
      cluster=NULL, seed.cluster=NULL, multicore=FALSE)

tune.RF(formula, data = NULL, mtry = 2

ncol(data)-1),
 ntree=500, replace=TRUE, xval=10, runs = 1,
 trace.plot = FALSE, cluster=NULL, seed.cluster=NULL,
 multicore=FALSE)

参数----------Arguments----------

参数：formula
formula of the form y ~ x1 + x2 + ..., where y must be a factor and x1,x2,... are numeric or factor.
公式的形式y ~ x1 + x2 + ...，其中y必须是一个因素，x1,x2,...都是数字或因素。

参数：data
an optional data frame containing the variables in the model.
一个可选的数据框包含在模型中的变量。

参数：minbuck
the sampling space for parameter minbuck.
样本空间的参数minbuck。

参数：maxdepth
set the maximum depth of the final tree.
设定最终树的最大深度。

参数：xval
number of cross-validations.
一些交叉验证。

参数：runs
number of runs.
的运行次数。

参数：splitf
kind of the splitting function to be used. It can be one of "deviance"(default) or "p-adj". If set to "p-adj", the p-value adjusted classification tree will be performed.
种的分离功能被使用。它可以是之一"deviance"（默认）或"p-adj"。如果设置为"p-adj"，p值调整分类树将被执行。

参数：Devmin
the minimum improvement on entropy by splitting. If "splitf" set to "p-adj", "Devmin" will be the significance level alpha.
最低的改进熵的分裂。如果"splitf"设置为"p-adj"，"Devmin"将是显着性水平α的。

参数：method
Which split points will be used? This can be "deviance" (default), "grid" or "local". If the method is set to: "local" - the program uses the local maxima of the split function (entropy), "deviance" - all values of the entropy, "grid" - grid points.
将使用哪一个分割点呢？这可能是"deviance"（默认），"grid"或"local"。如果method设置为： "local" - 该程序使用split函数的局部最大值（熵），参考"deviance"“ - 所有的熵值， "grid" - 网格点。

参数：topn.method
one of "complete"(default) or "single". A specification of the consideration of the split points. If set to "complete" it uses split points from all variables, else it uses split points per variable.
"complete"（默认）或"single"。一个规范的分割点的代价。如果设置为"complete"使用分割点，从所有的变量，否则它使用的分割点，每个变量。

参数：topN
integer vector. How many splits will be selected and at which level?
整数向量。多少拆分将被选中，在哪一级？

参数：tol
parameter, which will be used, if topn.method is set to "single".
参数，该参数将被使用，如果topn.method设置为"single"。

参数：cp
complexity parameter. See rpart.control.
复杂的参数。见rpart.control。

参数：score
Specifies the method for model selection. See TWIX.
指定模型选择的方法。见TWIX。

参数：predict
a string that specifies what predicion mehod will be used. Possible values are "best" for a best single-tree prediction and "ensemble" for prediction based on all trees (see bagg).
字符串指定了将用于predicion mehod，。可能的值是“最好”的最佳单树预测，预测的“合奏”的基础上所有的树木（见bagg）。

参数：cluster
the name of the snow cluster, if parallel computing will be used
雪聚类的名称，如果将被用于并行计算

参数：seed.cluster
an integer to be supplied to set.seed, or NULL not to set reproducible seeds.
要被提供给set.seed，或NULL不设置重现种子的整数。

参数：multicore
a logical value for parallel execution with package multicore.
一逻辑值，包multicore的并行执行。

参数：parms
optional parameters for the splitting function. See rpart.
可选参数的分裂功能。见rpart。

参数：mtry
Number of variables randomly sampled as candidates at each split. See randomForest.
随机取样，每个分割的候选人数的变量。见randomForest。

参数：ntree
Number of trees to grow. See randomForest.
树木生长的数量。见randomForest。

参数：replace
a logical indicating whether sampling of observations is done with or without replacement.
逻辑观测的采样是否完成或没有更换。

参数：trace.plot
Should trace plot be ploted?
跟踪图ploted？

参见----------See Also----------

plot.TWIX, TWIX
plot.TWIX，TWIX

实例----------Examples----------

library(mlbench)
data(PimaIndiansDiabetes2)
Pima <- na.omit(PimaIndiansDiabetes2)

tune.TWIX(diabetes~.,data=Pima,minbuck=c(5,10,15,20,25),runs=5)

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册