R语言 RTextTools包 cross_validate()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-9-28 22:41:47

cross_validate(RTextTools)
cross_validate()所属R语言包：RTextTools

                                       used for cross-validation of various algorithms.
                                       使用各种算法进行交叉验证。

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Performs n-fold cross-validation of specified algorithm.
执行交叉验证指定的算法n倍。

用法----------Usage----------

cross_validate(container, nfold, algorithm = c("SVM", "SLDA", "BOOSTING",
"BAGGING", "RF", "GLMNET", "TREE", "NNET", "MAXENT"), seed = NA,
method = "C-classification", cross = 0, cost = 100, kernel = "radial",
maxitboost = 100, maxitglm = 500, size = 1, maxitnnet = 1000, MaxNWts = 10000,
rang = 0.1, decay = 5e-04, ntree = 200, l1_regularizer = 0, l2_regularizer = 0,
use_sgd = FALSE, set_heldout = 0, verbose = FALSE)

参数----------Arguments----------

参数：container
Class of type matrix_container-class generated by the create_container function.
类类型matrix_container-class：create_container功能所产生的。

参数：nfold
Number of folds to perform for cross-validation.
进行交叉验证的折数。

参数：algorithm
A string specifying which algorithm to use. Use print_algorithms to see a list of options.
一个字符串，指定要使用的算法。使用print_algorithms看到一个选项列表。

参数：seed
Random seed number used to replicate cross-validation results.
随机种子数，用于复制交叉验证结果。

参数：method
Method parameter for SVM implentation. See e1071 documentation for more details.
SVM软件实现的方法的参数。 e1071文档了解更多信息。

参数：cross
Cross parameter for SVM implentation. See e1071 documentation for more details.
SVM软件实现交叉参数。 e1071文档了解更多信息。

参数：cost
Cost parameter for SVM implentation. See e1071 documentation for more details.
SVM软件实现的成本参数。 e1071文档了解更多信息。

参数：kernel
Kernel parameter for SVM implentation. See e1071 documentation for more details.
SVM软件实现的内核参数。 e1071文档了解更多信息。

参数：maxitboost
Maximum iterations parameter for boosting implentation. See caTools documentation for more details.
最大迭代参数，为促进软件实现。 caTools文档了解更多信息。

参数：maxitglm
Maximum iterations parameter for glmnet implentation. See glmnet documentation for more details.
glmnet软件实现的最大迭代参数。 glmnet文档了解更多信息。

参数：size
Size parameter for neural networks implentation. See nnet documentation for more details.
尺寸参数神经网络的软件实现。 nnet文档了解更多信息。

参数：maxitnnet
Maximum iterations for neural networks implentation. See nnet documentation for more details.
神经网络软件实现的最大迭代次数。 nnet文档了解更多信息。

参数：MaxNWts
Maximum number of weights parameter for neural networks implentation. See nnet documentation for more details.
最大的权重参数的神经网络软件实现。 nnet文档了解更多信息。

参数：rang
Range parameter for neural networks implentation. See nnet documentation for more details.
神经网络软件实现的范围参数。 nnet文档了解更多信息。

参数：decay
Decay parameter for neural networks implentation. See nnet documentation for more details.
衰减参数神经网络的软件实现。 nnet文档了解更多信息。

参数：ntree
Number of trees parameter for RandomForests implentation. See randomForest documentation for more details.
数的树木参数RandomForests软件实现。 randomForest文档了解更多信息。

参数：l1_regularizer
An numeric turning on L1 regularization and setting the regularization parameter. A value of 0 will disable L1 regularization. See maxent documentation for more details.
numeric打开L1正规化和正规化参数设置。值为0将禁用L1正规化。 maxent文档了解更多信息。

参数：l2_regularizer
An numeric turning on L2 regularization and setting the regularization parameter. A value of 0 will disable L2 regularization. See maxent documentation for more details.
numeric打开L2正规化和正则化参数。值为0将禁用L2正规化。 maxent文档了解更多信息。

参数：use_sgd
A logical indicating that SGD parameter estimation should be used. Defaults to FALSE. See maxent documentation for more details.
Alogical这表明的SGD参数估计应该使用。默认为FALSE的。 maxent文档了解更多信息。

参数：set_heldout
An integer specifying the number of documents to hold out. Sets a held-out subset of your data to test against and prevent overfitting. See maxent documentation for more details.
integer指定的文件数量保持。您的数据，以测试和防止过度拟合设置一个持有出来的子集。 maxent文档了解更多信息。

参数：verbose
A logical specifying whether to provide descriptive output about the training process. Defaults to FALSE, or no output. See maxent documentation for more details.
Alogical指定是否提供描述性的训练过程中的输出。默认为FALSE，或无输出。 maxent文档了解更多信息。

（作者）----------Author(s)----------

Loren Collingwood <lorenc2@uw.edu>, Timothy P. Jurka <tpjurka@ucdavis.edu>

实例----------Examples----------

library(RTextTools)
data(NYTimes)
data <- NYTimes[sample(1:3100,size=100,replace=FALSE),]
matrix <- create_matrix(cbind(data["Title"],data["Subject"]), language="english",
removeNumbers=TRUE, stemWords=FALSE, weighting=weightTfIdf)
container <- create_container(matrix,data$Topic.Code,trainSize=1:75, testSize=76:100,
virgin=FALSE)
svm <- cross_validate(container,2,algorithm="SVM")
maxent <- cross_validate(container,2,algorithm="MAXENT")

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册