[/ Copyright 2006-2007 John Maddock. Distributed under the Boost Software License, Version 1.0. (See accompanying file LICENSE_1_0.txt or copy at http://www.boost.org/LICENSE_1_0.txt). ] [section:collating_names Collating Names] [section:digraphs Digraphs] The following are treated as valid digraphs when used as a collating name: "ae", "Ae", "AE", "ch", "Ch", "CH", "ll", "Ll", "LL", "ss", "Ss", "SS", "nj", "Nj", "NJ", "dz", "Dz", "DZ", "lj", "Lj", "LJ". So for example the expression: [pre \[\[.ae.\]-c\] ] will match any character that collates between the digraph "ae" and the character "c". [endsect] [section:posix_symbolic_names POSIX Symbolic Names] The following symbolic names are recognised as valid collating element names, in addition to any single character, this allows you to write for example: [pre \[\[.left-square-bracket.\]\[.right-square-bracket.\]\]] if you wanted to match either "\[" or "\]". [table [[Name][Character]] [[NUL] [\\x00]] [[SOH] [\\x01]] [[STX] [\\x02]] [[ETX] [\\x03]] [[EOT] [\\x04]] [[ENQ] [\\x05]] [[ACK] [\\x06]] [[alert] [\\x07]] [[backspace] [\\x08]] [[tab] [\\t]] [[newline] [\\n]] [[vertical-tab] [\\v]] [[form-feed] [\\f]] [[carriage-return] [\\r]] [[SO] [\\xE]] [[SI] [\\xF]] [[DLE] [\\x10]] [[DC1] [\\x11]] [[DC2] [\\x12]] [[DC3] [\\x13]] [[DC4] [\\x14]] [[NAK] [\\x15]] [[SYN] [\\x16]] [[ETB] [\\x17]] [[CAN] [\\x18]] [[EM] [\\x19]] [[SUB] [\\x1A]] [[ESC] [\\x1B]] [[IS4] [\\x1C]] [[IS3] [\\x1D]] [[IS2] [\\x1E]] [[IS1] [\\x1F]] [[space] [\\x20]] [[exclamation-mark] [!]] [[quotation-mark] ["]] [[number-sign] [#]] [[dollar-sign] [$]] [[percent-sign] [%]] [[ampersand] [&]] [[apostrophe] [\']] [[left-parenthesis] [(]] [[right-parenthesis] [)]] [[asterisk] [\*]] [[plus-sign] [+]] [[comma] [,]] [[hyphen] [-]] [[period] [.]] [[slash] [ / ]] [[zero] [0]] [[one] [1]] [[two] [2]] [[three] [3]] [[four] [4]] [[five] [5]] [[six] [6]] [[seven] [7]] [[eight] [8]] [[nine] [9]] [[colon] [\:]] [[semicolon] [;]] [[less-than-sign] [<]] [[equals-sign] [=]] [[greater-than-sign] [>]] [[question-mark] [?]] [[commercial-at] [@]] [[left-square-bracket] [\[]] [[backslash][\\]] [[right-square-bracket][\]]] [[circumflex][~]] [[underscore][_]] [[grave-accent][`]] [[left-curly-bracket][{]] [[vertical-line][|]] [[right-curly-bracket][}]] [[tilde][~]] [[DEL][\\x7F]] ] [endsect] [section:named_unicode Named Unicode Characters] When using [link boost_regex.unicode Unicode aware regular expressions] (with the `u32regex` type), all the normal symbolic names for Unicode characters (those given in Unidata.txt) are recognised. So for example: [pre \[\[.CYRILLIC CAPITAL LETTER I.\]\] ] would match the Unicode character 0x0418. [endsect] [endsect]