Construct
Matches
Characters
x |
The character x |
\\ |
The backslash character |
\0n |
The character with octal value 0n (0 <= n <= 7) |
\0nn |
The character with octal value 0nn (0 <= n <= 7) |
\0mnn |
The character with octal value 0mnn (0 <= m <= 3, 0 <= n <= 7) |
\xhh |
The character with hexadecimal value 0xhh |
\uhhhh |
The character with hexadecimal value 0xhhhh |
\t |
The tab character ('\u0009') |
\n |
The newline (line feed) character ('\u000A') |
\r |
The carriage-return character ('\u000D') |
\f |
The form-feed character ('\u000C') |
\a |
The alert (bell) character ('\u0007') |
\e |
The escape character ('\u001B') |
\cx |
The control character corresponding to x |
Character classes
[abc] |
a, b, or c (simple class) |
[^abc] |
Any character except a, b, or c (negation) |
[a-zA-Z] |
a through z or A through Z, inclusive (range) |
[a-d[m-p]] |
a through d, or m through p: [a-dm-p] (union) |
[a-z&&[def]] |
d, e, or f (intersection) |
[a-z&&[^bc]] |
a through z, except for b and c: [ad-z] (subtraction) |
[a-z&&[^m-p]] |
a through z, and not m through p: [a-lq-z](subtraction) |
Predefined character classes
. |
Any character (may or may not match line terminators) |
\d |
A digit: [0-9]
|
\D |
A non-digit: [^0-9]
|
\s |
A whitespace character: [ \t\n\x0B\f\r]
|
\S |
A non-whitespace character: [^\s]
|
\w |
A word character: [a-zA-Z_0-9]
|
\W |
A non-word character: [^\w]
|
POSIX character classes (US-ASCII only)
\p{Lower} |
A lower-case alphabetic character: [a-z]
|
\p{Upper} |
An upper-case alphabetic character:[A-Z]
|
\p{ASCII} |
All ASCII:[\x00-\x7F]
|
\p{Alpha} |
An alphabetic character:[\p{Lower}\p{Upper}]
|
\p{Digit} |
A decimal digit: [0-9]
|
\p{Alnum} |
An alphanumeric character:[\p{Alpha}\p{Digit}]
|
\p{Punct} |
Punctuation: One of !"#$%&'()*+,-./:;<=>?@[\]^_`{|}~
|
\p{Graph} |
A visible character: [\p{Alnum}\p{Punct}]
|
\p{Print} |
A printable character: [\p{Graph}\x20]
|
\p{Blank} |
A space or a tab: [ \t]
|
\p{Cntrl} |
A control character: [\x00-\x1F\x7F]
|
\p{XDigit} |
A hexadecimal digit: [0-9a-fA-F]
|
\p{Space} |
A whitespace character: [ \t\n\x0B\f\r]
|
java.lang.Character classes (simple java character type)
\p{javaLowerCase} |
Equivalent to java.lang.Character.isLowerCase() |
\p{javaUpperCase} |
Equivalent to java.lang.Character.isUpperCase() |
\p{javaWhitespace} |
Equivalent to java.lang.Character.isWhitespace() |
\p{javaMirrored} |
Equivalent to java.lang.Character.isMirrored() |
Classes for Unicode blocks and categories
\p{InGreek} |
A character in the Greek block (simple block) |
\p{Lu} |
An uppercase letter (simple category) |
\p{Sc} |
A currency symbol |
\P{InGreek} |
Any character except one in the Greek block (negation) |
[\p{L}&&[^\p{Lu}]] |
Any letter except an uppercase letter (subtraction) |
Boundary matchers
^ |
The beginning of a line |
$ |
The end of a line |
\b |
A word boundary |
\B |
A non-word boundary |
\A |
The beginning of the input |
\G |
The end of the previous match |
\Z |
The end of the input but for the final terminator, if any |
\z |
The end of the input |
Greedy quantifiers
X?
|
X, once or not at all |
X*
|
X, zero or more times |
X+
|
X, one or more times |
X{n}
|
X, exactly n times |
X{n,}
|
X, at least n times |
X{n,m}
|
X, at least n but not more than m times |
Reluctant quantifiers
X??
|
X, once or not at all |
X*?
|
X, zero or more times |
X+?
|
X, one or more times |
X{n}?
|
X, exactly n times |
X{n,}?
|
X, at least n times |
X{n,m}?
|
X, at least n but not more than m times |
Possessive quantifiers
X?+
|
X, once or not at all |
X*+
|
X, zero or more times |
X++
|
X, one or more times |
X{n}+
|
X, exactly n times |
X{n,}+
|
X, at least n times |
X{n,m}+
|
X, at least n but not more than m times |
Logical operators
XY |
X followed by Y |
X|Y |
Either X or Y |
(X)
|
X, as a capturing group
|
Back references
\n |
Whatever the nth capturing group matched |
Quotation
\ |
Nothing, but quotes the following character |
\Q |
Nothing, but quotes all characters until \E
|
\E |
Nothing, but ends quoting started by \Q
|
Special constructs (non-capturing)
(?:X)
|
X, as a non-capturing group |
(?idmsux-idmsux) |
Nothing, but turns match flags i d m s u x on - off |
(?idmsux-idmsux:X) |
X, as a non-capturing group with the given flags i d m s u x on - off |
(?=X)
|
X, via zero-width positive lookahead |
(?!X)
|
X, via zero-width negative lookahead |
(?<=X)
|
X, via zero-width positive lookbehind |
(?<!X)
|
X, via zero-width negative lookbehind |
(?>X)
|
X, as an independent, non-capturing group |
Construct
Matches
Characters
x |
The character x |
\\ |
The backslash character |
\0n |
The character with octal value 0n (0 <= n <= 7) |
\0nn |
The character with octal value 0nn (0 <= n <= 7) |
\0mnn |
The character with octal value 0mnn (0 <= m <= 3, 0 <= n <= 7) |
\xhh |
The character with hexadecimal value 0xhh |
\uhhhh |
The character with hexadecimal value 0xhhhh |
\t |
The tab character ('\u0009') |
\n |
The newline (line feed) character ('\u000A') |
\r |
The carriage-return character ('\u000D') |
\f |
The form-feed character ('\u000C') |
\a |
The alert (bell) character ('\u0007') |
\e |
The escape character ('\u001B') |
\cx |
The control character corresponding to x |
Character classes
[abc] |
a, b, or c (simple class) |
[^abc] |
Any character except a, b, or c (negation) |
[a-zA-Z] |
a through z or A through Z, inclusive (range) |
[a-d[m-p]] |
a through d, or m through p: [a-dm-p] (union) |
[a-z&&[def]] |
d, e, or f (intersection) |
[a-z&&[^bc]] |
a through z, except for b and c: [ad-z] (subtraction) |
[a-z&&[^m-p]] |
a through z, and not m through p: [a-lq-z](subtraction) |
Predefined character classes
. |
Any character (may or may not match line terminators) |
\d |
A digit: [0-9]
|
\D |
A non-digit: [^0-9]
|
\s |
A whitespace character: [ \t\n\x0B\f\r]
|
\S |
A non-whitespace character: [^\s]
|
\w |
A word character: [a-zA-Z_0-9]
|
\W |
A non-word character: [^\w]
|
POSIX character classes (US-ASCII only)
\p{Lower} |
A lower-case alphabetic character: [a-z]
|
\p{Upper} |
An upper-case alphabetic character:[A-Z]
|
\p{ASCII} |
All ASCII:[\x00-\x7F]
|
\p{Alpha} |
An alphabetic character:[\p{Lower}\p{Upper}]
|
\p{Digit} |
A decimal digit: [0-9]
|
\p{Alnum} |
An alphanumeric character:[\p{Alpha}\p{Digit}]
|
\p{Punct} |
Punctuation: One of !"#$%&'()*+,-./:;<=>?@[\]^_`{|}~
|
\p{Graph} |
A visible character: [\p{Alnum}\p{Punct}]
|
\p{Print} |
A printable character: [\p{Graph}\x20]
|
\p{Blank} |
A space or a tab: [ \t]
|
\p{Cntrl} |
A control character: [\x00-\x1F\x7F]
|
\p{XDigit} |
A hexadecimal digit: [0-9a-fA-F]
|
\p{Space} |
A whitespace character: [ \t\n\x0B\f\r]
|
java.lang.Character classes (simple java character type)
\p{javaLowerCase} |
Equivalent to java.lang.Character.isLowerCase() |
\p{javaUpperCase} |
Equivalent to java.lang.Character.isUpperCase() |
\p{javaWhitespace} |
Equivalent to java.lang.Character.isWhitespace() |
\p{javaMirrored} |
Equivalent to java.lang.Character.isMirrored() |
Classes for Unicode blocks and categories
\p{InGreek} |
A character in the Greek block (simple block) |
\p{Lu} |
An uppercase letter (simple category) |
\p{Sc} |
A currency symbol |
\P{InGreek} |
Any character except one in the Greek block (negation) |
[\p{L}&&[^\p{Lu}]] |
Any letter except an uppercase letter (subtraction) |
Boundary matchers
^ |
The beginning of a line |
$ |
The end of a line |
\b |
A word boundary |
\B |
A non-word boundary |
\A |
The beginning of the input |
\G |
The end of the previous match |
\Z |
The end of the input but for the final terminator, if any |
\z |
The end of the input |
Greedy quantifiers
X?
|
X, once or not at all |
X*
|
X, zero or more times |
X+
|
X, one or more times |
X{n}
|
X, exactly n times |
X{n,}
|
X, at least n times |
X{n,m}
|
X, at least n but not more than m times |
Reluctant quantifiers
X??
|
X, once or not at all |
X*?
|
X, zero or more times |
X+?
|
X, one or more times |
X{n}?
|
X, exactly n times |
X{n,}?
|
X, at least n times |
X{n,m}?
|
X, at least n but not more than m times |
Possessive quantifiers
X?+
|
X, once or not at all |
X*+
|
X, zero or more times |
X++
|
X, one or more times |
X{n}+
|
X, exactly n times |
X{n,}+
|
X, at least n times |
X{n,m}+
|
X, at least n but not more than m times |
Logical operators
XY |
X followed by Y |
X|Y |
Either X or Y |
(X)
|
X, as a capturing group
|
Back references
\n |
Whatever the nth capturing group matched |
Quotation
\ |
Nothing, but quotes the following character |
\Q |
Nothing, but quotes all characters until \E
|
\E |
Nothing, but ends quoting started by \Q
|
Special constructs (non-capturing)
(?:X)
|
X, as a non-capturing group |
(?idmsux-idmsux) |
Nothing, but turns match flags i d m s u x on - off |
(?idmsux-idmsux:X) |
X, as a non-capturing group with the given flags i d m s u x on - off |
(?=X)
|
X, via zero-width positive lookahead |
(?!X)
|
X, via zero-width negative lookahead |
(?<=X)
|
X, via zero-width positive lookbehind |
(?<!X)
|
X, via zero-width negative lookbehind |
(?>X)
|
X, as an independent, non-capturing group |
Construct
Matches
Characters
x |
The character x |
\\ |
The backslash character |
\0n |
The character with octal value 0n (0 <= n <= 7) |
\0nn |
The character with octal value 0nn (0 <= n <= 7) |
\0mnn |
The character with octal value 0mnn (0 <= m <= 3, 0 <= n <= 7) |
\xhh |
The character with hexadecimal value 0xhh |
\uhhhh |
The character with hexadecimal value 0xhhhh |
\t |
The tab character ('\u0009') |
\n |
The newline (line feed) character ('\u000A') |
\r |
The carriage-return character ('\u000D') |
\f |
The form-feed character ('\u000C') |
\a |
The alert (bell) character ('\u0007') |
\e |
The escape character ('\u001B') |
\cx |
The control character corresponding to x |
Character classes
[abc] |
a, b, or c (simple class) |
[^abc] |
Any character except a, b, or c (negation) |
[a-zA-Z] |
a through z or A through Z, inclusive (range) |
[a-d[m-p]] |
a through d, or m through p: [a-dm-p] (union) |
[a-z&&[def]] |
d, e, or f (intersection) |
[a-z&&[^bc]] |
a through z, except for b and c: [ad-z] (subtraction) |
[a-z&&[^m-p]] |
a through z, and not m through p: [a-lq-z](subtraction) |
Predefined character classes
. |
Any character (may or may not match line terminators) |
\d |
A digit: [0-9]
|
\D |
A non-digit: [^0-9]
|
\s |
A whitespace character: [ \t\n\x0B\f\r]
|
\S |
A non-whitespace character: [^\s]
|
\w |
A word character: [a-zA-Z_0-9]
|
\W |
A non-word character: [^\w]
|
POSIX character classes (US-ASCII only)
\p{Lower} |
A lower-case alphabetic character: [a-z]
|
\p{Upper} |
An upper-case alphabetic character:[A-Z]
|
\p{ASCII} |
All ASCII:[\x00-\x7F]
|
\p{Alpha} |
An alphabetic character:[\p{Lower}\p{Upper}]
|
\p{Digit} |
A decimal digit: [0-9]
|
\p{Alnum} |
An alphanumeric character:[\p{Alpha}\p{Digit}]
|
\p{Punct} |
Punctuation: One of !"#$%&'()*+,-./:;<=>?@[\]^_`{|}~
|
\p{Graph} |
A visible character: [\p{Alnum}\p{Punct}]
|
\p{Print} |
A printable character: [\p{Graph}\x20]
|
\p{Blank} |
A space or a tab: [ \t]
|
\p{Cntrl} |
A control character: [\x00-\x1F\x7F]
|
\p{XDigit} |
A hexadecimal digit: [0-9a-fA-F]
|
\p{Space} |
A whitespace character: [ \t\n\x0B\f\r]
|
java.lang.Character classes (simple java character type)
\p{javaLowerCase} |
Equivalent to java.lang.Character.isLowerCase() |
\p{javaUpperCase} |
Equivalent to java.lang.Character.isUpperCase() |
\p{javaWhitespace} |
Equivalent to java.lang.Character.isWhitespace() |
\p{javaMirrored} |
Equivalent to java.lang.Character.isMirrored() |
Classes for Unicode blocks and categories
\p{InGreek} |
A character in the Greek block (simple block) |
\p{Lu} |
An uppercase letter (simple category) |
\p{Sc} |
A currency symbol |
\P{InGreek} |
Any character except one in the Greek block (negation) |
[\p{L}&&[^\p{Lu}]] |
Any letter except an uppercase letter (subtraction) |
Boundary matchers
^ |
The beginning of a line |
$ |
The end of a line |
\b |
A word boundary |
\B |
A non-word boundary |
\A |
The beginning of the input |
\G |
The end of the previous match |
\Z |
The end of the input but for the final terminator, if any |
\z |
The end of the input |
Greedy quantifiers
X?
|
X, once or not at all |
X*
|
X, zero or more times |
X+
|
X, one or more times |
X{n}
|
X, exactly n times |
X{n,}
|
X, at least n times |
X{n,m}
|
X, at least n but not more than m times |
Reluctant quantifiers
X??
|
X, once or not at all |
X*?
|
X, zero or more times |
X+?
|
X, one or more times |
X{n}?
|
X, exactly n times |
X{n,}?
|
X, at least n times |
X{n,m}?
|
X, at least n but not more than m times |
Possessive quantifiers
X?+
|
X, once or not at all |
X*+
|
X, zero or more times |
X++
|
X, one or more times |
X{n}+
|
X, exactly n times |
X{n,}+
|
X, at least n times |
X{n,m}+
|
X, at least n but not more than m times |
Logical operators
XY |
X followed by Y |
X|Y |
Either X or Y |
(X)
|
X, as a capturing group
|
Back references
\n |
Whatever the nth capturing group matched |
Quotation
\ |
Nothing, but quotes the following character |
\Q |
Nothing, but quotes all characters until \E
|
\E |
Nothing, but ends quoting started by \Q
|
Special constructs (non-capturing)
(?:X)
|
X, as a non-capturing group |
(?idmsux-idmsux) |
Nothing, but turns match flags i d m s u x on - off |
(?idmsux-idmsux:X) |
X, as a non-capturing group with the given flags i d m s u x on - off |
(?=X)
|
X, via zero-width positive lookahead |
(?!X)
|
X, via zero-width negative lookahead |
(?<=X)
|
X, via zero-width positive lookbehind |
(?<!X)
|
X, via zero-width negative lookbehind |
(?>X)
|
X, as an independent, non-capturing group |
Construct
Matches
Characters
x |
The character x |
\\ |
The backslash character |
\0n |
The character with octal value 0n (0 <= n <= 7) |
\0nn |
The character with octal value 0nn (0 <= n <= 7) |
\0mnn |
The character with octal value 0mnn (0 <= m <= 3, 0 <= n <= 7) |
\xhh |
The character with hexadecimal value 0xhh |
\uhhhh |
The character with hexadecimal value 0xhhhh |
\t |
The tab character ('\u0009') |
\n |
The newline (line feed) character ('\u000A') |
\r |
The carriage-return character ('\u000D') |
\f |
The form-feed character ('\u000C') |
\a |
The alert (bell) character ('\u0007') |
\e |
The escape character ('\u001B') |
\cx |
The control character corresponding to x |
Character classes
[abc] |
a, b, or c (simple class) |
[^abc] |
Any character except a, b, or c (negation) |
[a-zA-Z] |
a through z or A through Z, inclusive (range) |
[a-d[m-p]] |
a through d, or m through p: [a-dm-p] (union) |
[a-z&&[def]] |
d, e, or f (intersection) |
[a-z&&[^bc]] |
a through z, except for b and c: [ad-z] (subtraction) |
[a-z&&[^m-p]] |
a through z, and not m through p: [a-lq-z](subtraction) |
Construct
Matches
Characters
x |
The character x |
\\ |
The backslash character |
\0n |
The character with octal value 0n (0 <= n <= 7) |
\0nn |
The character with octal value 0nn (0 <= n <= 7) |
\0mnn |
The character with octal value 0mnn (0 <= m <= 3, 0 <= n <= 7) |
\xhh |
The character with hexadecimal value 0xhh |
\uhhhh |
The character with hexadecimal value 0xhhhh |
\t |
The tab character ('\u0009') |
\n |
The newline (line feed) character ('\u000A') |
\r |
The carriage-return character ('\u000D') |
\f |
The form-feed character ('\u000C') |
\a |
The alert (bell) character ('\u0007') |
\e |
The escape character ('\u001B') |
\cx |
The control character corresponding to x |
Character classes
[abc] |
a, b, or c (simple class) |
[^abc] |
Any character except a, b, or c (negation) |
[a-zA-Z] |
a through z or A through Z, inclusive (range) |
[a-d[m-p]] |
a through d, or m through p: [a-dm-p] (union) |
[a-z&&[def]] |
d, e, or f (intersection) |
[a-z&&[^bc]] |
a through z, except for b and c: [ad-z] (subtraction) |
[a-z&&[^m-p]] |
a through z, and not m through p: [a-lq-z](subtraction) |
相关推荐
PHP 去HTML,截取指定汉字正则表达试PHP 去HTML,截取指定汉字正则表达试PHP 去HTML,截取指定汉字正则表达试PHP 去HTML,截取指定汉字正则表达试PHP 去HTML,截取指定汉字正则表达试PHP 去HTML,截取指定汉字正则表达试...
正则表达 Delphi版本的解析函数集合
正则表达生成器本人使用的工具,让大家一起尝试使用,谢谢!
C#正则表达示练习器, C#正则表达示练习器
递归 正则表达示 RandomAccessFile 结合
30分钟教你学会正则表达试.在很短的时间里教会你如何灵活掌握运用正则表达式不是梦
"正则表达生成工具"正是为了解决手动构造正则表达式困难而设计的一款实用软件,它可以自动生成常见的正则表达式模式,极大地提高了工作效率。 首先,让我们深入了解一下正则表达式的概念。正则表达式由一系列字符和...
在C#编程中,正则表达式是一种强大的文本处理工具,用于匹配、查找、替换等操作。本主题将深入探讨如何使用C#的正则表达式来获取多个子匹配文本,帮助你更好地理解和运用这一技术。 一、正则表达式基础 1. 正则...
### 正则表达式基本说明 正则表达式是一种强大的文本匹配工具,广泛应用于各种编程语言和工具中,用于搜索、替换以及提取特定格式的文本数据。本文将根据提供的部分内容来详细阐述正则表达式的各个组成部分及其用法...
### Java JDK正则表达式详解 #### 一、概述 正则表达式是Java中用于处理字符串和文本的强大工具。它允许开发人员高效地搜索、替换或验证文本数据中的模式。在Java中,正则表达式的操作主要依赖于`java.util.regex`...
正则表达式(Regular Expression)是一种强大的文本处理工具,它使用一种特定的语法来描述字符串的模式。在编程和数据分析中,正则表达式常用于数据验证、搜索和替换等操作。以下是标题和描述中提到的8个最基本的...
### 正则表达语法详解 #### 一、概述 正则表达语法是在计算机科学领域内广泛使用的一种模式匹配工具,它能够帮助用户精确地搜索、替换或者操作文本数据中的特定模式。本文主要介绍的是在LabVIEW环境下如何使用正则...
"VC++ 正则表达源码" 提供了一种在Visual C++环境下实现正则表达式的解决方案,适用于那些需要在C++项目中进行数据验证和文本处理的开发者。 正则表达式的基本概念包括: 1. **字符类**:用于指定一组字符,例如 `...
javascript正则表达
正则表达式(Regular Expression,简称regex)是一种强大的文本处理工具,用于匹配、查找、替换等操作。在编程中,正则表达式被广泛应用于数据验证、文本提取和格式化等场景。本软件是一个用Java编写的正则表达式...
常用正则表达式大全 正则表达式是每个程序员都应该掌握的一种必备技能,以下是一些常用的正则表达式: 匹配中文字符 [\u4e00-\u9fa5] 匹配双字节字符(包括汉字在内) [^\x00-\xff] 匹配空行 \n[\s| ]*\r 匹配 ...
"正则表达工具.rar" 是一个包含正则表达式工具的压缩文件,其中的 "Match Tracer" 可能是一款专门用于正则表达式测试和调试的应用程序。这类工具通常具有以下功能: 1. **模式构建**:提供用户友好的界面,帮助用户...