URL Host Extraction Regex for Java
/^[a-zA-Z][a-zA-Z0-9+\-.]{0,20}://(?:[^@/\s]+@)?([a-zA-Z0-9\-._\[\]]+)(?::[0-9]+)?(?:/[^\s]*)?$/iWhat this pattern does
This page provides a comprehensive, battle-tested regular expression for matching url host extraction, ported and verified for Java. A rigorously tested regex reduces debugging time and protects your application from edge-case failures. The snippet below is ready to drop into your Java project — whether you're validating in a Spring Boot controller, a Jakarta EE service, or a standalone utility class.
Java Implementation
// URL Host Extraction
// ReDoS-safe | RegexVault — Web & Network > URL
import java.util.regex.Pattern;
public class UrlHostExtractionValidator {
private static final Pattern PATTERN =
Pattern.compile("^[a-zA-Z][a-zA-Z0-9+\\-.]{0,20}://(?:[^@/\\s]+@)?([a-zA-Z0-9\\-._\\[\\]]+)(?::[0-9]+)?(?:/[^\\s]*)?$");
public static boolean validate(String input) {
return PATTERN.matcher(input).matches();
}
// Example
public static void main(String[] args) {
System.out.println(validate("https://example.com")); // true
}
}Test Cases
Matches (Valid) | Rejects (Invalid) |
|---|---|
https://example.com | not-a-url |
http://user:pass@api.example.com/path | ://no-scheme.com |
https://192.168.1.1:8080/resource | example.com |
ws://socket.example.com:3000 | https://example.com |
ftp://files.example.net | — |
When to use this pattern
This pattern is drawn from the Web & Network > URL category and carries a ReDoS-safe certification. That matters for Java developers because critical in Java applications since the JVM regex engine uses backtracking and is susceptible to ReDoS without careful pattern design. RegexVault audits patterns against known backtracking attack vectors, ensuring you have the necessary context before using this regex in a high-stakes production environment.
Common Pitfalls
The credentials (user:pass@) must be fully consumed before capturing the host, otherwise the @ character breaks the extraction.
Technical Notes
Capture group 1 contains the hostname or IP (including IPv6 brackets if present). Handles optional auth prefix before @.
Have a pattern that belongs in the vault?
Submit it for review — community-verified patterns get credited to your GitHub handle. Free submissions join the queue. Priority review available for $15.
Submit a Pattern