overfittingDiagnostics(BayesPeak)
overfittingDiagnostics()所属R语言包:BayesPeak
Overfitting diagnostic functions
过学习诊断功能
译者:生物统计家园网 机器人LoveR
描述----------Description----------
Three functions that provide diagnostic plots and tools to mitigate the effects of overfitting.
三大功能,提供诊断图和工具,以减轻过度拟合的影响。
用法----------Usage----------
plot.overfitdiag(x, whatX = "lambda1", whatY = "score",
logX = TRUE, logY = FALSE,
main = "Overfitting diagnostic", ...)
identify.overfitdiag(x, whatX = "lambda1", whatY = "score",
logX = TRUE, logY = FALSE,
main = "Overfitting diagnostic", ...)
region.overfitdiag(x, whatX = "lambda1", whatY = "score",
logX = TRUE, logY = FALSE,
main = "Overfitting diagnostic", ...)
参数----------Arguments----------
参数:x
Raw output from the bayespeak function.
从原料输出bayespeak功能。
参数:whatX, whatY
Character. The quantities to plot on the X and Y axes. Common choices would be "lambda1", "score", "calls". Any choice in names(raw.output$QC) is, in theory, acceptable (except for "chr" and "status", which do not correspond to numeric quantities).
字符。绘制的X和Y轴的数量。常见的选择将是"lambda1","score","calls"。任何选择names(raw.output$QC)“是,在理论上,可以接受的(除了"chr"和"status",这不符合数字量)。
参数:logX, logY
Logical. If TRUE, the quantity on the corresponding axis undergoes a log transformation before being plotted.
逻辑。如果是TRUE,相应的轴的数量经历了之前被绘制数转换。
参数:main
Title of plot (corresponds to main argument in plot function).
图的标题(相当于main功能plot参数)。
参数:...
Further arguments.
进一步的论据。
plot.overfitdiag passes these through to plot.
plot.overfitdiag通过plot的这些。
identify.overfitdiag passes these through to identify.
identify.overfitdiag通过identify的这些。
region.overfitdiag passes these through to plot.overfitdiag.
region.overfitdiag通过plot.overfitdiag的这些。
Details
详情----------Details----------
These three functions are used to investigate the prevalence of overfitting in a data set, and to aid selection of sensible criteria for performing overfitting corrections.
这三种功能是用于调查的数据集过拟合的患病率,并帮助选择合理的标准执行过拟合改正。
plot.overfitdiag provides a scatterplot of the key parameters associated with jobs. Please see section 9 of the vignette for an description of how to interpret this information.
plot.overfitdiag提供与工作相关的关键参数的散点图。如何解释此信息的描述请参阅暗角9条。
identify.overfitdiag is used after a call plot.overfitdiag, with the same arguments, to find out which job was plotted at a particular location. The interface is operated in the same manner as identify - left-click on the plot to label the job closest to that point, and right-click on the plot to end this process.
identify.overfitdiag呼叫plot.overfitdiag用同样的参数,找出它的工作是在一个特定的位置绘制,使用后。接口在相同的方式运作identify - 左击图标签最接近这一点的工作,并在图上单击右键结束这个进程。
region.overfitdiag is used to define an overfit region on the plot, and return the jobs in the region. The function is used in the same manner as locator - left-click on the plot to define the vertices of a polygon, and then right click anywhere to close the polygon (there is no need to left-click on the first vertex again). The area selected will be filled in with red hatching. The function then returns the IDs of the jobs in the hatched area. Typically, this output will be used as an exclude.jobs argument in summarize.peaks
region.overfitdiag用来定义图过拟合的区域,并返回在该区域的工作。该函数用于在相同的方式locator - 左键单击来定义一个多边形的顶点上的图,然后用鼠标点击任何地方关闭的多边形(也没有必要在左边点击第一个顶点再次)。选定的区域将被填充在与红色孵化。的功能,然后返回在阴影区域作业的ID。通常情况下,此输出将作为一个exclude.jobs参数summarize.peaks
值----------Value----------
All three functions output to the active graphical device. In addition, identify.overfitdiag and region.overfitdiag return integer vectors corresponding to the jobs selected on the plot.
所有这三个函数的输出,以积极的图形设备。此外,identify.overfitdiag和region.overfitdiag返回相应的整数向量积选定的工作。
作者(S)----------Author(s)----------
Jonathan Cairns
举例----------Examples----------
data(raw.output)
plot.overfitdiag(raw.output)
##recreate figures in vignette[#重新暗角数字]
plot.overfitdiag(raw.output, whatX="calls", logX = TRUE, whatY = "lambda1", logY = TRUE)
plot.overfitdiag(raw.output, whatX="calls", logX = TRUE, whatY = "score", logY = TRUE)
## Not run: [#无法运行:]
##identify particular jobs in the plot[#确定图特别工作]
plot.overfitdiag(raw.output, whatX="calls", logX = TRUE, whatY = "score", logY = TRUE)
identify.overfitdiag(raw.output, whatX="calls", logX = TRUE, whatY = "score", logY = TRUE)
##define an overfit region[#定义一个过拟合区域]
##left-click to define the polygon vertices, right-click to close the polygon[#左单击“定义的多边形顶点,右键单击要关闭的多边形]
sel <- region.overfitdiag(raw.output, whatX="calls", logX = TRUE, whatY = "score", logY = TRUE)
output <- summarize.peaks(raw.output, exclude.jobs = sel)
## End(Not run)[#结束(不运行)]
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|