Regular Expressions Flashcards
What does the regex operator ^ signify?
The ^ operator signifies the start of a line or string. It is used to match a pattern only if it appears at the beginning.
What does the regex operator $ signify?
The $ operator signifies the end of a line or string. It is used to match a pattern only if it appears at the end.
What does the regex operator . signify?
The . operator matches any single character except a newline. It acts as a wildcard in regex.
What does the regex operator * signify?
The * operator matches zero or more occurrences of the preceding character or group. For example, a* matches ‘’, ‘a’, ‘aa’, etc.
What is the function of the regex operator *?
The * operator matches zero or more occurrences of the preceding character or group. It allows for repetition, including none at all.
What does the regex operator + signify?
The + operator matches one or more occurrences of the preceding character or group. For example, a+ matches ‘a’, ‘aa’, ‘aaa’, etc.
What does the regex operator ? signify?
The ? operator matches zero or one occurrence of the preceding character or group. For example, colou?r matches both ‘color’ and ‘colour’.
What does the regex operator [] signify?
Square brackets [] define a character set and match any one character within the set. For example, [abc] matches ‘a’, ‘b’, or ‘c’.
What does the regex operator [^ ] do?
The [^ ] operator defines a negated character set and matches any character not in the set. For example, [^abc] matches any character except ‘a’, ‘b’, or ‘c’.
What does the regex operator | signify?
The | operator acts as an OR condition and matches either pattern on its sides. For example, cat|dog matches ‘cat’ or ‘dog’.
What does the regex operator () signify?
Parentheses () are used for grouping patterns. They allow regex operators to act on the entire group as a unit.
What does the regex operator \d signify?
The \d operator matches any digit (0–9). It is equivalent to the character set [0-9].
What does the regex operator \D signify?
The \D operator matches any non-digit character. It is equivalent to the negated set [^0-9].
What does the regex operator \w signify?
The \w operator matches any word character (alphanumeric and underscore). It is equivalent to [a-zA-Z0-9_].
What does the regex operator \W signify?
The \W operator matches any non-word character. It is equivalent to the negated set [^a-zA-Z0-9_].
What does the regex operator \s signify?
The \s operator matches any whitespace character, such as spaces, tabs, or newlines.
What does the regex operator \S signify?
The \S operator matches any non-whitespace character.
What does the regex operator {n} signify?
The {n} operator matches exactly ‘n’ occurrences of the preceding character or group. For example, a{3} matches ‘aaa’.
What does the regex operator {n,} signify?
The {n,} operator matches ‘n’ or more occurrences of the preceding character or group. For example, a{2,} matches ‘aa’, ‘aaa’, etc.
What does the regex operator {n,m} signify?
The {n,m} operator matches between ‘n’ and ‘m’ occurrences of the preceding character or group. For example, a{2,4} matches ‘aa’, ‘aaa’, or ‘aaaa’.
What does the regex operator \ signify?
The \ operator escapes special characters, allowing them to be treated as literals. For example, \* matches the ‘*’ character instead of treating it as a regex operator.
What does the regex operator \b signify?
The \b operator matches a word boundary, ensuring the match occurs at the start or end of a word.
What does the regex operator \B signify?
The \B operator matches a non-word boundary, ensuring the match occurs within a word.
What does the regex operator (?=…) signify?
The (?=…) operator is a positive lookahead, ensuring that a certain pattern follows the current position without consuming characters.
What does the regex operator (?!…) signify?
The (?!…) operator is a negative lookahead, ensuring that a certain pattern does not follow the current position without consuming characters.
What does the regex operator (?<=…) signify?
The (?<=…) operator is a positive lookbehind, ensuring that a certain pattern precedes the current position without consuming characters.
What does the regex operator (?<!…) signify?
The (?<!…) operator is a negative lookbehind, ensuring that a certain pattern does not precede the current position without consuming characters.
^hello
Matches any string that starts with ‘hello’. For example, it will match ‘hello world’ but not ‘a hello world’.
.*
Matches zero or more of any character except newline. This can match an empty string, ‘abc’, or even ‘hello123’.
\d{3}-\d{2}-\d{4}
Matches a Social Security number format, such as ‘123-45-6789’. It consists of three digits, a dash, two digits, another dash, and four digits.
[A-Za-z]+
Matches one or more uppercase or lowercase letters. For example, it will match ‘Hello’ and ‘abc’ but not ‘123’.
colou?r
Matches both ‘color’ and ‘colour’. The ‘?’ means the preceding character ‘u’ is optional.
\bword\b
Matches the word ‘word’ as a whole word only. It won’t match ‘sword’ or ‘wording’ but will match ‘word’.
\d+.?\d*
Matches integers or decimals. For example, it matches ‘123’, ‘45.67’, or ‘89.’.
(cat|dog)
Matches either ‘cat’ or ‘dog’. The ‘|’ acts as an OR operator between the two options.
^[a-z0-9_-]{3,16}$
Matches usernames 3 to 16 characters long, allowing lowercase letters, numbers, underscores, and hyphens. It enforces both the length and allowed characters.
\w+@\w+.\w+
Matches simple email addresses such as ‘example@mail.com’. It requires a word, an ‘@’, another word, a ‘.’, and a final word.
\s{2,}
Matches two or more whitespace characters in a row. Useful for finding excessive spaces.
(?i)hello
Matches ‘hello’ in a case-insensitive way. It will match ‘HELLO’, ‘hello’, or ‘HeLLo’.
[0-9]{4,}-[A-Z]{2}
Matches a pattern like ‘1234-AB’ where four or more digits are followed by a dash and two uppercase letters.
(https?|ftp)://[^\s/$.?#].[^\s]*
Matches URLs that start with ‘http’, ‘https’, or ‘ftp’, such as ‘https://example.com’.
^.{8,}$
Matches any string that is at least 8 characters long. The ‘.’ matches any character, and ‘{8,}’ enforces the minimum length.
^[^@]+@[^@]+.[a-zA-Z]{2,}$
Matches valid email addresses by ensuring there is no ‘@’ at the start, no ‘@’ immediately before the domain, and the domain has at least two letters.
([A-Z][a-z]*){2,}
Matches strings with at least two capitalized words, such as ‘JohnDoe’ or ‘JaneSmith’.
([01]?\d|2[0-3]):[0-5]\d
Matches valid 24-hour time formats, such as ‘23:59’, ‘5:30’, or ‘00:00’.
(?<=$)\d+(.\d{2})?
Matches monetary values preceded by a ‘$’ symbol, such as ‘$100’ or ‘$99.99’. It uses a positive lookbehind.
^[A-Fa-f0-9]{6}$
Matches valid hexadecimal color codes, such as ‘FFAACC’ or ‘abcdef’.