pipeline(flowPhyto)
pipeline()所属R语言包:flowPhyto
Run the SeaFlow Pipeline
运行SeaFlow管道
译者:生物统计家园网 机器人LoveR
描述----------Description----------
run the pipeline
运行管道
用法----------Usage----------
pipeline(cruise.name='', repo=REPO.PATH, range=NULL, steps=1:4, pct=.97,
clust.concat.ct=3, resample.size=300, resamp.concat.max=10,
filter.width=1.5, filter.notch=1, filter.edge=1,
classify.func =2, classify.varnames=CHANNEL.CLMNS.SM, classify.numc=0, classify.noise=0,
map.margin=2,
concat.sds=!is.na(match(1,steps)), load.to.db=FALSE, preplot=FALSE, cleanup=TRUE,
input.path=paste(repo, '/', cruise.name, sep=''),
output.path=input.path, log.dir=output.path,
def.path=paste(input.path,'/', 'pop.def.tab',sep=''), parallel=TRUE, submit.cmd='qsub')
参数----------Arguments----------
参数:cruise.name
Simplified cruise name (same name as the subdirectory in the seaflow data dir).
简化邮轮名称(在seaflow数据目录的子目录名称相同)。
参数:steps
Which steps of the pipeline to run. step 1 is filter, step 2 is classify, step 3 is census and consensus, step 4 is summarize. 1:2 will do step 1 to 2, etc.
哪些步骤运行的管道。第1步是过滤器,第2步是分类,第3步是普查,并达成共识,第4步是总结。 1:2将执行步骤1至2等
参数:pct
percentage completion (number of indicator files created vs input files) each job step should go to.
完成百分比(与输入文件创建的指标文件数)每个作业步骤应该去。
参数:clust.concat.ct
Number of event file to concatenate at a time during the clustering/classification step.
事件文件的数量,连接在聚类/分类步骤一次。
参数:map.margin
Margin in latitude/longitude around the map plots.
周围的图图的经度,纬度/保证金。
参数:resample.size
Minimum number of events in a population.
在人口活动的最低数量。
参数:resamp.concat.max
Maximum number of allowable event files to concatenate to generate statistics from.
允许事件文件的最大数量来连接生成统计。
参数:filter.notch
the location of the x=y (by default) point to create the notch in the gated filter
X = Y的位置(默认)创建选通滤波器陷波点
参数:filter.width
the margin of error for particle alignment determination in the filter step.
颗粒过滤器的步骤对齐决心保证金的错误。
参数:filter.edge
location of the boundary layer between water/air. Particles located at the boundary layer scatter light that can be detected by the position detectors for the filter step.
水/空气之间的边界层的位置。在位于边界层散射光滤波器的阶跃的位置探测器,可以通过检测的颗粒。
参数:classify.func
Choose the clustering method, either flowClust (func = 1) or flowMeans (func = 2, by default) function
聚类分析方法的选择,要么flowClust(FUNC = 1)或flowMeans(FUNC = 2,默认情况下)功能
参数:classify.varnames
A character vector specifying the variables (columns) to be included in clustering when choosing flowMeans.
指定一个字符向量聚类包括在选择flowMeans时必须在变量(列)。
参数:classify.numc
Number of clusters when choosing flowMeans. If set to 0 (default) the value matches the number of populations defined in pop.def table . If set to NA, the optimal number of clusters will be estimated automatically.
的聚类在选择flowMeans时数。如果设置为0(默认)值pop.def表中定义的人口数量相匹配。如果设置为NA,最优簇数目将自动估计。
参数:classify.noise
Set up the noise threshold for phytoplankton cells. Only cells with chlorophyll value higher than the noise will be clustered
成立浮游植物单元的噪声阈值。叶绿素值较高的噪声只有单元将聚集
参数:concat.sds
Determines if the sds files in the individual julian day directories should be concatenated together into sds.tab
如果在个别儒略日目录SDS文件确定应连接在一起成sds.tab
参数:load.to.db
Load the sds and stat files to the database.
SDS和stat文件加载到数据库中。
参数:preplot
Preplot the level 2 analysis plots to 'output.path'.
Preplot的第2级分析output.path“的图。
参数:cleanup
Cleanup the submission and (non error reporting) R CMD BATCH log files.
清理的意见书(非错误报告)规CMD批处理log文件。
参数:input.path
Path to the directory with input data (raw evt or opp files.
输入数据(EVT原料或OPP文件的目录的路径。
参数:output.path
Path to the directory where you wish to output data.
你要输出数据的目录的路径。
参数:log.dir
Path to the directory where log file will be written.
将被写入log文件的目录的路径。
参数:def.path
Path to the file that defines how to gate & cluster the events into populations.
路径的文件,它定义如何门及聚类成群体事件。
参数:parallel
Boolean indicating if the job should be run in parallel using qsub (vs in serial)
布尔值,表示如果作业应当运行在并行使用qsub(与串行)
参数:repo
Full path to your SeaFlow repository
您SeaFlow库的完整路径
参数:range
A named, two-integer vector specifying the start and end (inclusive) range for subsetting the input files used in each analysis step (with the exception of summarize). Values should be a (evt/opp) file numbers and names should be strings corresponding to the year_julianday directory names. The nv() function is useful for creating this vector.
命名,两个整数向量指定的子集在每个分析步骤(异常总结)用于输入文件的开始和结束(含)的范围。值应该是(EVT / OPP)的文件编号和名称应该是字符串相应的year_julianday目录名。 NV()函数是用于创建这个向量。
参数:submit.cmd
the command used to deploy an R CMD BATCH system call to a cluster. Must be used in conjunction with parallel=TRUE.
用来部署一个R CMD批处理系统命令调用到聚类。必须使用结合并行= TRUE。
举例----------Examples----------
example.cruise.name <- 'seaflow_cruise'
temp.out.dir <- '.' #path.expand('~')[path.expand(~)]
output.path <- paste(temp.out.dir,'/',example.cruise.name,sep='')
seaflow.path <- system.file("extdata", example.cruise.name, package="flowPhyto")
file.copy(from=seaflow.path, to=temp.out.dir, recursive=TRUE)
pipeline(repo= temp.out.dir, cruise.name='seaflow_cruise', steps=4, parallel=FALSE)
unlink(example.cruise.name, recursive=TRUE)
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|