![]() ![]() But \c* might match a line feed or the letter j. Either way through \c_ would match control characters 0x00 through 0x1F. Or the application may just flip bit 0x40. The application may take the last 5 bits that character index in the code page or its Unicode code point to form an ASCII control character. Some allow any character after \c while other allow ASCII characters. Using characters other than letters after \c is not recommended because the behavior is inconsistent between applications. Only Java requires the A to Z to be uppercase. Most flavors allow the second letter to be lowercase, with no difference in meaning. \cM matches a carriage return, just like \r, \x0D, and \u000D. These are equivalent to \x01 through \x1A (26 decimal). The second letter is an uppercase letter A through Z, to indicate Control+A through Control+Z. The letter after the backslash is always a lowercase c. Many regex flavors also support the tokens \cA through \cZ to insert ASCII control characters. JGsoft V2 matches any vertical whitespace with \v. The JGsoft flavor originally matched only the vertical tab with \v. Earlier versions treated it as a needlessly escaped literal v. Perl 5.10, PCRE 7.2, PHP 5.2.4, R, Delphi XE, and later versions treat it as a shorthand. That includes the vertical tab, form feed, and all line break characters. In other flavors, \v is a shorthand that matches any vertical whitespace character. In some flavors, \v matches the vertical tab (ASCII 0x0B). Remember that Windows text files use \r\n to terminate lines, while UNIX text files use \n. More exotic non-printables are \a (bell, 0x07), \e (escape, 0x1B), and \f (form feed, 0x0C). Use \t to match a tab character (ASCII 0x09), \r for carriage return (0x0D) and \n for line feed (0x0A). You can use special character sequences to put non-printable characters in your regular expression. ![]()
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |