icuSetCollate(base)
icuSetCollate()所属R语言包:base
Setup Collation by ICU
设置排序规则由加护病房
译者:生物统计家园网 机器人LoveR
描述----------Description----------
Controls the way collation is done by ICU (an optional part of the R build).
控制的方式,由ICU(的R版本的可选部分)进行整理。
用法----------Usage----------
icuSetCollate(...)
参数----------Arguments----------
参数:...
Named arguments, see "Details".
命名参数,请参阅“详细资料”。
Details
详情----------Details----------
Optionally, R can be built to collate character strings by ICU (http://site.icu-project.org). For such systems, icuSetCollate can be used to tune the way collation is done. On other builds calling this function does nothing, with a warning.
(可选),R可内置整理字符串ICU(http://site.icu-project.org)。对于这样的系统,icuSetCollate可以用来调整的整理完成的方式。其他版本上调用这个函数什么也不做,与警告。
Possible arguments are
可能的参数
locale: A character string such as "da_DK" giving the country whose collation rules are to be used. If present, this
locale:"da_DK"给国家,要使用的排序规则,如字符串。如果存在,此
case_first: "upper", "lower" or "default", asking for upper- or lower-case characters to be sorted first. The default is usually lower-case first, but not in
case_first:"upper","lower"或"default",大写或小写字符的要求,首先进行排序。默认的通常是小写的第一个,但不是在
alternate_handling: Controls the handling of "variable" characters (mainly punctuation and symbols). Possible values are "non_ignorable" (primary strength) and
alternate_handling:变的字符(主要是标点和符号)控制处理。可能的值是"non_ignorable"(主要力量)和
strength: Which components should be used? Possible values "primary", "secondary", "tertiary"
strength:应使用哪些组件?可能的值"primary","secondary","tertiary"
french_collation: In a French locale the way accents affect collation is from right to left, whereas in most other locales it is from left to right. Possible values "on", "off"
french_collation:在法语语言环境口音影响整理的方式是由右至左,而在大多数其他语言环境,它是由左到右。可能的值"on","off"
normalization: Should strings be normalized? Possible values are "on" and "off" (default). This affects the
normalization:字符串被归?可能的值是"on"和"off"(默认)。这会影响
case_level: An additional level between secondary and tertiary, used to distinguish large and small Japanese Kana
case_level:二级和三级之间的水平,用来区分大型和小型的日文假名
hiragana_quaternary: Possible values "on" (sort
hiragana_quaternary:可能的值"on"(排序
Only the first three are likely to be of interest except to those with a detailed understanding of collation and specialized requirements.
只有前三个都可能有兴趣的整理和专业要求的详细了解除外。
Some examples are case_level="on", strength="primary" to ignore accent differences and alternate_handling="shifted" to ignore space and punctuation characters.
一些例子case_level="on", strength="primary"忽略口音的差异和alternate_handling="shifted"忽略空格和标点字符。
Note that these settings have no effect if collation is set to the C locale, unless locale is specified.
请注意,这些设置整理设置C语言环境,如果没有任何效果,除非locale指定。
注意----------Note----------
As from R 2.9.0, ICU is used by default wherever it is available: this include Mac OS >= 10.4 and many Linux installations.
从R 2.9.0,ICU是默认使用所有可用:这包括Mac OS中> = 10.4和许多Linux安装。
参见----------See Also----------
Comparison, sort
相比之下,sort
The ICU user guide chapter on collation (http://userguide.icu-project.org/collation).
ICU的用户指南整理章(http://userguide.icu-project.org/collation)。
举例----------Examples----------
## these examples depend on having ICU available, and on the locale[#这些例子依赖于加护病房提供,并在语言环境]
x <- c("Aarhus", "aarhus", "safe", "test", "Zoo")
sort(x)
icuSetCollate(case_first="upper"); sort(x)
icuSetCollate(case_first="lower"); sort(x)
icuSetCollate(locale="da_DK", case_first="default"); sort(x)
icuSetCollate(locale="et_EE"); sort(x)
转载请注明:出自 生物统计家园网(http://www.biostatistic.net)。
注:
注1:为了方便大家学习,本文档为生物统计家园网机器人LoveR翻译而成,仅供个人R语言学习参考使用,生物统计家园保留版权。
注2:由于是机器人自动翻译,难免有不准确之处,使用时仔细对照中、英文内容进行反复理解,可以帮助R语言的学习。
注3:如遇到不准确之处,请在本贴的后面进行回帖,我们会逐渐进行修订。
|