R语言 genoset包 boundingIndices2()函数中文帮助文档(中英文对照)

loveR · 发表于 2012-2-25 19:34:26

boundingIndices2(genoset)
boundingIndices2()所属R语言包：genoset

                                    Find indices of features bounding a set of chromsome ranges/genes...
                                       包围了一套染色体范围/基因功能指数......

                                       译者：生物统计家园网机器人LoveR

描述----------Description----------

Find indices of features bounding a set of chromsome ranges/genes
包围了一套染色体范围/基因功能指数

用法----------Usage----------

参数----------Arguments----------

参数：starts
numeric or integer, first base position of each query range
数字或整数，每个查询范围的基础地位

参数：stops
numeric or integer, last base position of each query range
数字或整数，每个查询范围的最后一个碱基的地位

参数：positions
Base positions in which to search
相应的职位，在其中搜索

参数：offset
integer, value to add to all returned indices. For the case where positions represents a portion of some larger array (e.g. a chr in a genome)
整数，值添加到所有返回的指数。对于此案的立场表示一些较大的阵列的一部分（例如，在一个基因组CHR）

Details

详情----------Details----------

This function is similar to findOverlaps but it guarantees at least two features will be covered. This is useful in the case of finding features corresponding to a set of genes. Some genes will fall entirely between two features and thus would not return any ranges with findOverlaps. Specifically, this function will find the indices of the features (first and last) bounding the ends of a range/gene (start and stop) such that first <= start <= stop <= last. Equality is necessary so that multiple conversions between indices and genomic positions will not expand with each conversion. This function uses findIntervals, which is for k queries and n features is O(k * log(n)) generally and ~O(k) for sorted queries. Therefore will be dramatically faster for sets of query genes that are sorted by start position within each chromosome.  This should give performance for k genes and n features that is ~O(k) for starts and O(k * log(n)) for stops and ~O(k * log(n)) overall.  Ranges/genes that are outside the range of feature positions will be given the indices of the corresponding first or last index rather than 0 or n + 1 so
这种功能类似findOverlaps的，但它保证至少有两个功能将被覆盖。这是在寻找相应的一组基因功能的情况下非常有用。有些基因会完全落在两个功能之间，因此不会返回与findOverlaps任何范围。具体来说，这个函数会发现功能指数（第一个和最后一个）边界两端范围/基因（启动和停止），这样第一页<= START <=停止<=最后。平等是必要的，因此，指数和基因组的位置之间的多次转换会不会扩大相互转换。此功能使用findIntervals，k查询和n功能是为O（K *log（n））一般~O（K）排序查询。因此将极大地加快查询的基因集在每个染色体的位置开始排序。这应该给ķ基因和N的功能是~O（K）为启动和O的性能（K *log（n））的站吧~O（K *log（N））整体。范围/基因特征位置的范围以外的，将给予相应的第一个或最后一个指数的指数，而不是0或N + 1

值----------Value----------

integer matrix of 2 columms for start and stop index of range in data
2 columms启动和停止数据索引范围的整数矩阵

作者（S）----------Author(s)----------

Peter M. Haverty

举例----------Examples----------

转载请注明:出自生物统计家园网(http://www.biostatistic.net)。

注：
注1：为了方便大家学习，本文档为生物统计家园网机器人LoveR翻译而成，仅供个人R语言学习参考使用，生物统计家园保留版权。
注2：由于是机器人自动翻译，难免有不准确之处，使用时仔细对照中、英文内容进行反复理解，可以帮助R语言的学习。
注3：如遇到不准确之处，请在本贴的后面进行回帖，我们会逐渐进行修订。

账号		自动登录	找回密码
密码			注册