 bd7lx
58.125% disconnected | 登录后回复主题 | 跳到回复(2) | | |  | 对正则表达式的增强 Oniguruma ... by bd7lx ... 31 月 26 天前 ... 880 次点击
Oniguruma
日本人搞得名字,是未来新的ruby 中正则表达式库
Oniguruma, a Japanese name meaning roughly ghost wheel or demon wheel. (It is commonly misspelled by non-Japanese; remember you can't spell Oniguruma without "guru.")
+不敢写轮子害怕给禁了
现在ruby 1.8 也可通过gem 安装,特性主要对多字节的语言支持,速度提高,也能支持其它语言的正则式语法
http://oniguruma.rubyforge.org/oniguruma/
Supported character encodings:
ASCII, UTF-8, UTF-16BE, UTF-16LE, UTF-32BE, UTF-32LE,
EUC-JP, EUC-TW, EUC-KR, EUC-CN,
Shift_JIS, Big5, GB18030, KOI8-R, CP1251,
ISO-8859-1, ISO-8859-2, ISO-8859-3, ISO-8859-4, ISO-8859-5,
ISO-8859-6, ISO-8859-7, ISO-8859-8, ISO-8859-9, ISO-8859-10,
ISO-8859-11, ISO-8859-13, ISO-8859-14, ISO-8859-15, ISO-8859-16
(GB18030 encoding was contributed by KUBO Takehiro)
(CP1251 encoding was contributed by Byte)
应该说是个真正成熟状态的项目,ruby 1.9 就会包含该库了
ruby way 第2版 第3章3.13对Oniguruma 有比较详细的叙述,不惜笔墨,其地位已经是Regexp engine | | | | |
|