URL Scheme Extraction Regex for Python
/^([a-zA-Z][a-zA-Z0-9+\-.]{0,20}):///What this pattern does
This page provides a well-structured, multi-part regular expression for matching url scheme extraction, ported and verified for Python. A rigorously tested regex reduces debugging time and protects your application from edge-case failures. The snippet below is ready to drop into your Python project — whether you're validating in a Django view, a FastAPI endpoint, or a standalone data processing script.
Python Implementation
# URL Scheme Extraction
# ReDoS-safe | RegexVault — Web & Network > URL
import re
url_scheme_extraction_pattern = re.compile(r'^([a-zA-Z][a-zA-Z0-9+\-.]{0,20})://')
def validate_url_scheme_extraction(value: str) -> bool:
return bool(url_scheme_extraction_pattern.fullmatch(value))
# Example
print(validate_url_scheme_extraction("https://example.com")) # TrueTest Cases
Matches (Valid) | Rejects (Invalid) |
|---|---|
https://example.com | ://example.com |
ftp://files.example.com | 1http://example.com |
ws://socket.example.com | example.com |
git+ssh://github.com/user/repo | http//example.com |
coap://iot.example.com | http:/example.com |
When to use this pattern
This pattern is drawn from the Web & Network > URL category and carries a ReDoS-safe certification. That matters for Python developers because particularly important in Python web servers where CPU-bound regex operations can stall concurrent request handling. RegexVault audits patterns against known backtracking attack vectors, ensuring you have the necessary context before using this regex in a high-stakes production environment.
Common Pitfalls
Schemes are case-insensitive per RFC 3986 — normalize to lowercase after extraction.
Technical Notes
Capture group 1 contains the scheme. Scheme syntax per RFC 3986: starts with a letter, followed by letters/digits/+/-/. The length cap of 20 prevents runaway matching on non-URL strings.
Have a pattern that belongs in the vault?
Submit it for review — community-verified patterns get credited to your GitHub handle. Free submissions join the queue. Priority review available for $15.
Submit a Pattern