scattMiss(VIM)
scattMiss()所属R语言包:VIM
Scatterplot with information about missing/imputed values
散点图与信息丢失/估算值
译者:生物统计家园网 机器人LoveR
描述----------Description----------
In addition to a standard scatterplot, lines are plotted for the missing values in one variable. If there are imputed values, they will be highlighted.
除了一个标准的散点图,线绘制在一个变量中缺少的值。如果有估算值,他们将被高亮显示。
用法----------Usage----------
scattMiss(x, delimiter = NULL, side = 1, col = c("skyblue","red",
"orange","lightgrey"), alpha = NULL, lty = c("dashed","dotted"),
lwd = par("lwd"), quantiles = c(0.5, 0.975), inEllipse = FALSE,
zeros = FALSE, xlim = NULL, ylim = NULL, main = NULL, sub = NULL,
xlab = NULL, ylab = NULL, interactive = TRUE, ...)
参数----------Arguments----------
参数:x
a matrix or data.frame with two columns.
matrix或data.frame有两列。
参数:delimiter
a character-vector to distinguish between variables and imputation-indices for imputed variables (therefore, x needs to have colnames). If given, it is used to determine the corresponding imputation-index for any imputed variable (a logical-vector indicating which values of the variable have been imputed). If such imputation-indices are found, they are used for highlighting and the colors are adjusted according to the given colors for imputed variables (see col).
一个特征向量来区分变量和插补指数估算的变量(因此,x需要有colnames)。如果给出,它被用来确定相应的插补索引的任何估算的变量(一个逻辑矢量指示变量的值已被插补)。如果这样的归集指数被发现,它们用于高亮和颜色的调整,根据给定的颜色估算变量(见col“)。
参数:side
if side=1, a rug representation and vertical lines are plotted for the missing/imputed values in the second variable; if side=2, a rug representation and horizontal lines for the missing/imputed values in the first variable.
如果side=1表示,地毯和垂直线绘制在第二个变量的丢失/估算值;如果side=2,地毯表示,在第一个变量的丢失/估算值的水平线。
参数:col
a vector of length four giving the colors to be used in the plot. The first color is used for the scatterplot, the second/third color for the rug representation for missing/imputed values. The second color is also used for the lines for missing values. Imputed values will be highlighted with the third color, and the fourth color is used for the ellipses (see "Details"). If only one color is supplied, it is used for the scatterplot, the rug representation and the lines, whereas the default color is used for the ellipses. Else if a vector of length two is supplied, the default color is used for the ellipses as well.
一个向量的长度为4的给图中要使用的颜色。第一种颜色的散点图,第二/第三颜色的地毯表示缺少/估算值。第二颜色也用于缺失值的线。插补值将高亮显示与第三颜色,第四颜色用于椭圆的(见“详细信息”)。如果只有一种颜色被提供时,它被用于散点图,地毯代表性和线,而使用默认颜色为椭圆。否则,如果供给的矢量的长度为2的为椭圆形,以及,使用默认颜色。
参数:alpha
a numeric value between 0 and 1 giving the level of transparency of the colors, or NULL. This can be used to prevent overplotting.
0和1之间的数值给出的水平的透明度,颜色,或NULL。这可以被用来防止overplotting。
参数:lty
a vector of length two giving the line types for the lines and ellipses. If a single value is supplied, it will be used for both.
一个向量的长度2,线和椭圆形的线类型。如果一个单一的值被提供时,它会被用于两个。
参数:lwd
a vector of length two giving the line widths for the lines and ellipses. If a single value is supplied, it will be used for both.
给的行线和椭圆的宽度的长度为2的矢量。如果一个单一的值被提供时,它会被用于两个。
参数:quantiles
a vector giving the quantiles of the chi-square distribution to be used for the tolerance ellipses, or NULL to suppress plotting ellipses (see "Details").
一个向量给位数的卡方分布的容差椭圆,或NULL抑制绘制的省略号(见“详细信息”)。
参数:inEllipse
plot lines only inside the largest ellipse. Ignored if quantiles is NULL or if there are imputed values.
图线内最大的椭圆形。如果忽略quantiles是NULL或是否有估算值。
参数:zeros
a logical vector of length two indicating whether the variables are semi-continuous, i.e., contain a considerable amount of zeros. If TRUE, only the non-zero observations are used for computing the tolerance ellipses. If a single logical is supplied, it is recycled. Ignored if quantiles is NULL.
逻辑向量的长度为2,表示的变量是否是半连续的,即,包含了相当数量的零。如果TRUE,只有非零的观测是用于计算的公差椭圆。如果一个单一的逻辑被提供时,它被回收。忽略如果quantiles是NULL。
参数:xlim, ylim
axis limits.
轴的限制。
参数:main, sub
main and sub title.
主,子标题。
参数:xlab, ylab
axis labels.
轴标签。
参数:interactive
a logical indicating whether the side argument can be changed interactively (see "Details").
逻辑是否side参数是可以改变的交互方式(见“详细信息”)。
参数:...
further graphical parameters to be passed down (see par).
进一步的图形来传递参数(见par“)。
Details
详细信息----------Details----------
Information about missing values in one variable is included as vertical or horizontal lines, as determined by the side argument. The lines are thereby drawn at the observed x- or y-value. In case of imputed values, they will additionally be highlighted in the scatterplot. Supplementary, percentage coverage ellipses can be drawn to give a clue about the shape of the bivariate data distribution.
缺少一个变量的值是作为水平或垂直线,由side参数信息。线由此,在所观察到的x或y值绘制的。的估算值的情况下,他们会更突出显示在散点图。补充,百分比覆盖椭圆可以得出的二元数据分布的形状,得到一个线索。
If interactiveis TRUE, clicking in the bottom margin redraws the plot with information about missing/imputed values in the first variable and clicking in the left margin redraws the plot with information about missing/imputed values in the second variable. Clicking anywhere else in the plot quits the interactive session.
interactive如果是TRUE,点击底部边缘重绘的图与信息丢失/估算,在第一个变量的值,并单击左边距重绘的图与信息丢失/估算值在第二个变量。点击其他地方的图退出交互式会话。
注意----------Note----------
The argument zeros has been introduced in version 1.4. As a result, some of the argument positions have changed.
参数zeros已经推出1.4版本。因此,一些参数位置发生了变化。
(作者)----------Author(s)----------
Andreas Alfons, modifications by Bernd Prantner
参考文献----------References----------
Exploring incomplete data using visualization tools. Journal of Advances in Data Analysis and Classification, Online first. DOI: 10.1007/s11634-011-0102-y.
参见----------See Also----------
marginplot
marginplot
实例----------Examples----------
data(tao, package = "VIM")
## for missing values[#遗漏值]
scattMiss(tao[,c("Air.Temp", "Humidity")])
## for imputed values[#估算值]
scattMiss(kNN(tao[,c("Air.Temp", "Humidity")]), delimiter = "_imp")
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|