R语言 RWeka包 evaluate_Weka_classifier()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-9-29 20:55:17

evaluate_Weka_classifier(RWeka)
evaluate_Weka_classifier()所属R语言包：RWeka

                                    Model Statistics for R/Weka Classifiers
                                       R / Weka中分类器的模型统计

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Compute model performance statistics for a fitted Weka classifier.
Weka中分类的拟合计算模型的性能统计数据。

用法----------Usage----------

evaluate_Weka_classifier(object, newdata = NULL, cost = NULL,
                     numFolds = 0, complexity = FALSE,
                     class = FALSE, seed = NULL, ...)

参数----------Arguments----------

参数：object
a Weka_classifier object.
一个Weka_classifier对象。

参数：newdata
an optional data frame in which to look for variables with which to evaluate.  If omitted or NULL, the training instances are used.
一个可选的数据框中的变量，以评估。如果省略或NULL，培训实例。

参数：cost
a square matrix of (mis)classification costs.
方阵（MIS）的分类成本。

参数：numFolds
the number of folds to use in cross-validation.
交叉验证中使用的倍数的数目。

参数：complexity
option to include entropy-based statistics.
选项，包括基于熵的统计数据。

参数：class
option to include class statistics.
选项，包括类统计数据。

参数：seed
optional seed for cross-validation.
可选的种子进行交叉验证。

参数：...
further arguments passed to other methods (see details).
进一步的参数传递给其他方法（见详情）。

Details

详细信息----------Details----------

The function computes and extracts a non-redundant set of performance  statistics that is suitable for model interpretation. By default the  statistics are computed on the training data.
该函数计算并提取一个非冗余的性能统计数据是合适的模型解释。默认情况下，计算统计数据的培训资料。

Currently argument ... only supports the logical variable normalize which tells Weka to normalize the cost matrix so that the cost of a correct classification is zero.
目前参数...仅支持逻辑变量normalize它告诉Weka中标准化的成本矩阵，以便正确分类的成本是零。

Note that if the class variable is numeric only a subset of the statistics are available. Arguments complexity and class are then not applicable and therefore ignored.
请注意，如果类变量是数字只有一小部分的统计数据是可用的。参数complexity和class然后不适用，因此被忽略。

值----------Value----------

An object of class Weka_classifier_evaluation, a list of the following components:
类的一个对象Weka_classifier_evaluation，以下组件的列表：

参数：string
character, concatenation of the string representations of the performance statistics.
字符，串联的性能统计数据的字符串表示形式。

参数：details
vector, base statistics, e.g., the percentage of instances correctly classified, etc.
矢量，基部统计，例如，实例百分比正确分类，等等。

参数：detailsComplexity
vector, entropy-based statistics (if selected).
矢量，基于熵的统计数据（如果选择）。

参数：detailsClass
matrix, class statistics, e.g., the true positive rate, etc., for each level of the response variable (if selected).
矩阵，类统计数据，例如，真正的阳性率等，为每个级别的响应变量（如果选择）。

参数：confusionMatrix
table, cross-classification of true and predicted classes.
表，交叉分类的真实和预测类。

参考文献----------References----------

Data Mining: Practical Machine Learning Tools and Techniques. 2nd Edition, Morgan Kaufmann, San Francisco.

实例----------Examples----------

## Use some example data.[＃使用一些示例数据。]
w <- read.arff(system.file("arff","weather.nominal.arff",
         package = "RWeka"))

## Identify a decision tree.[确定决策树。]
m <- J48(play~., data = w)
m

## Use 10 fold cross-validation.[使用10倍交叉验证。]
e <- evaluate_Weka_classifier(m,
                           cost = matrix(c(0,2,1,0), ncol = 2),
                           numFolds = 10, complexity = TRUE,
                           seed = 123, class = TRUE)
e
summary(e)
e$details

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册