Oracle 学习笔记: 正则表达式

全部 Linux 数据库敏捷编程数据结构软件测试项目管理 Oracle 编程综合互联网 Erlang MySQL

浏览 6062 次

锁定老帖子主题：Oracle 学习笔记: 正则表达式精华帖 (0) :: 良好帖 (0) :: 新手帖 (0) :: 隐藏帖 (0)
作者	正文
logicgate 等级: 性别: 文章: 1624 积分: 282 来自: 新力吐皮	发表时间：2009-05-31 最后修改：2009-06-18 相关推荐: stlport 5.1.6 MinGW 编译安装 STLPort5.1.0 VC6 STLport-5.1.4 编译，安装 STLport-5.1.0源码 STLPort 5.1.5 编译更多相关推荐数据库 Oracle 10g数据库内建了符合IEEE POSIX (Portable Operating System for Unix)标准的正则表达式。熟练使用正则表达式，可以写出简洁，强大的SQL语句。正则表达式有几个优点优于常见的LIKE操作符和INSTR、SUBSTR及REPLACE 函数的。这些传统的SQL 函数不便于进行模式匹配。只有LIKE 操作符通过使用%和_字符匹配，但LIKE不支持表达式的重复、复杂的更替、字符范围、字符列表和POSIX 字符类等等。元字符（Meta Character）： ^ 使表达式定位至一行的开头 $ 使表达式定位至一行的末尾 * 匹配 0 次或更多次 ? 匹配 0 次或 1 次 + 匹配 1 次或更多次 {m} 正好匹配 m 次 {m,} 至少匹配 m 次 {m, n} 至少匹配 m 次但不超过 n 次 [:alpha:] 字母字符 [:lower:] 小写字母字符 [:upper:] 大写字母字符 [:digit:] 数字 [:alnum:] 字母数字字符 [:space:] 空白字符（禁止打印），如回车符、换行符、竖直制表符和换页符[:punct:] 标点字符 [:cntrl:] 控制字符（禁止打印） [:print:] 可打印字符 \| 分隔替换选项，通常与分组操作符 () 一起使用 ( ) 将子表达式分组为一个替换单元、量词单元或后向引用单元 [char] 字符列表 Oracle 10g提供了四个regexp function: REGEXP_LIKE , REGEXP_REPLACE , REGEXP_INSTR , REGEXP_SUBSTR 。 REGEXP_LIKE：比较一个字符串是否与正则表达式匹配 (srcstr, pattern [, match_option]) REGEXP_INSTR：在字符串中查找正则表达式，并且返回匹配的位置 (srcstr, pattern [, position [, occurrence [, return_option [, match_option]]]]) REGEXP_SUBSTR：返回与正则表达式匹配的子字符串 (srcstr, pattern [, position [, occurrence [, match_option]]]) REGEXP_REPLACE：搜索并且替换匹配的正则表达式 (srcstr, pattern [, replacestr [, position [, occurrence [, match_option]]]]) 其中各参数的含义为: srcstr: 被查找的字符数据。 pattern: 正则表达式。 occurrence: 出现的次数。默认为1。 position: 开始位置 return_option: 默认值为0，返回该模式的起始位置；值为1则返回符合匹配条件的下一个字符的起始位置。 replacestr: 用来替换匹配模式的字符串。 match_option: 匹配方式选项。缺省为c。 c：case sensitive I：case insensitive n：(.)匹配任何字符（包括newline) m：字符串存在换行的时候被作为多行处理下面通过一些具体的例子来说明如何使用这四个函数。首先创建一个测试数据表， SQL> create table person ( 2 first_name varchar(20), 3 last_name varchar(20), 4 email varchar(100), 5 zip varchar(6)); Table created. SQL> insert into person values ('Steven', 'Chen', 'steven@hp.com', '123456'); 1 row created. SQL> insert into person values ('James', 'Li', 'jamesli@sun.com' \|\| chr(10) \|\| 'lijames@oracle.com', '1b3d5f'); 1 row created. SQL> commit; Commit complete. SQL> select * from person; FIRST_NAME LAST_NAME EMAIL ZIP ---------- ---------- -------------------- ------ Steven Chen steven@hp.com 123456 James Li jamesli@sun.com 1b3d5f lijames@oracle.com 1。REGEXP_LIKE SQL> select zip as invalid_zip from person where regexp_like(zip, '[^[:digit:]]'); INVALID_ZIP -------------------- 1b3d5f SQL> select first_name from person where regexp_like(first_name, '^S.n$'); FIRST_NAME ---------- Steven SQL> select first_name from person where regexp_like(first_name, '^s.n$'); no rows selected SQL> select first_name from person where regexp_like(first_name, '^s.n$', 'c'); no rows selected SQL> select first_name from person where regexp_like(first_name, '^s.n$', 'i'); FIRST_NAME ---------- Steven SQL> select email from person where regexp_like(email, '^james.com$'); no rows selected SQL> select email from person where regexp_like(email, '^james.com$', 'n'); EMAIL -------------------- jamesli@sun.com lijames@oracle.com SQL> select email from person where regexp_like(email, '^li.com$'); no rows selected SQL> select email from person where regexp_like(email, '^li.com$', 'm'); EMAIL -------------------- jamesli@sun.com lijames@oracle.com 注意上面分别测试了不同的match_option对结果的影响。 2。REGEXP_INSTR 查找zip中第一个非数字字符的位置 SQL> select regexp_instr(zip, '[^[:digit:]]') as position from person; POSITION ---------- 0 2 从第三个字符开始，查找zip中第二个非数字字符的位置 SQL> select regexp_instr(zip, '[^[:digit:]]', 3, 2) as position from person; POSITION ---------- 0 6 从第三个字符开始，查找zip中第二个非数字字符的下一个字符位置 SQL> select regexp_instr(zip, '[^[:digit:]]', 3, 2, 1) as position from person; POSITION ---------- 0 7 3。REGEXP_SUBSTR SQL> select regexp_substr(zip, '[^[:digit:]]') as zip from person; ZIP ------------------ b SQL> select regexp_substr(zip, '[^[:digit:]]', 3, 2) as zip from person; ZIP ------------ f 4。REGEXP_REPLACE 把zip中所有非数字字符替换为0 SQL> update person set zip=regexp_replace(zip, '[^[:digit:]]', '0') 2 where regexp_like(zip, '[^[:digit:]]'); 1 row updated. SQL> select zip from person; ZIP ------ 123456 103050 后向引用（backreference）：后向引用是一个很有用的特性。它能够把子表达式的匹配部分保存在临时缓冲区中，供以后重用。缓冲区从左至右进行编号，并利用 `\digit` 符号进行访问。子表达式用一组圆括号来显示。利用后向引用可以实现较复杂的替换功能。 SQL> select regexp_replace('Steven Chen', '(.) (.)', '\2, \1') as reversed_name from dual; REVERSED_NAME -------------------- Chen, Steven 在DDL中也可以正则表达式，比如Constraint, index, view SQL> alter table person add constraint constraint_zip check (regexp_like(zip, '^[[:digit:]]+$')); SQL> create index person_idx on person(regexp_substr(last_name, '^[[:upper:]]')); 声明：ITeye文章版权属于作者，受法律保护。没有作者书面许可不得转载。推荐链接
返回顶楼

论坛首页 → 综合技术版

跳转论坛: