Products
In-IDE
IDE extension that lets you fix coding issues before they exist!
Discover SonarQube for IDE
SaaS
Setup is effortless and analysis is automatic for most languages
Discover SonarQube Cloud
Self-Hosted
Fast, accurate analysis; enterprise scalability
Discover SonarQube Server

Python static code analysis

Unique rules to find Bugs, Vulnerabilities, Security Hotspots, and Code Smells in your PYTHON code

Filtered: 26 rules found

regex

Impact

Clean code attribute

Regular expression quantifiers and character classes should be used concisely

intentionality - clear

maintainability

Code Smell

Quick FixIDE quick fixes available with SonarQube for IDE

Why is this an issue?

A regular expression is a sequence of characters that specifies a match pattern in text. Among the most important concepts are:

Character classes: defines a set of characters, any one of which can occur in an input string for a match to succeed.
Quantifiers: used to specify how many instances of a character, group, or character class must be present in the input for a match.
Wildcard (.): matches all characters except line terminators (also matches them if the s flag is set).

Many of these features include shortcuts of widely used expressions, so there is more than one way to construct a regular expression to achieve the same results. For example, to match a two-digit number, one could write [0-9]{2,2} or \d{2}. The latter is not only shorter but easier to read and thus to maintain.

This rule recommends replacing some quantifiers and character classes with more concise equivalents:

\d for [0-9] and \D for [^0-9]
\w for [A-Za-z0-9_] and \W for [^A-Za-z0-9_]
. for character classes matching everything (e.g. [\w\W], [\d\D], or [\s\S] with s flag)
x? for x{0,1}, x* for x{0,}, x+ for x{1,}, x{N} for x{N,N}

r"[0-9]"        # Noncompliant - same as r"\d"
r"[^0-9]"       # Noncompliant - same as r"\D"
r"[A-Za-z0-9_]" # Noncompliant - same as r"\w"
r"[\w\W]"       # Noncompliant - same as r"."
r"a{0,}"        # Noncompliant - same as r"a*"

Use the more concise version to make the regex expression more readable.

r"\d"
r"\D"
r"\w"
r"."
r"a*"

Available In:

Catch issues on the fly,
in your IDE

Detect issues in your GitHub, Azure DevOps Services, Bitbucket Cloud, GitLab repositories

Analyze code in your
on-premise CI

Available Since
9.4

Analyze code in your
on-premise CI

Developer Edition
Available Since
9.4

In-IDE

SaaS

Self-Hosted