Indian GSTIN (GST Identification Number) Regex for Java
/^[0-3][0-9][A-Z]{5}[0-9]{4}[A-Z][1-9A-Z]Z[0-9A-Z]$/What this pattern does
This page provides a well-structured, multi-part regular expression for matching indian gstin (gst identification number), ported and verified for Java. Financial data validation has zero tolerance for false negatives — a missed invalid entry can corrupt downstream calculations. The snippet below is ready to drop into your Java project — whether you're validating in a Spring Boot controller, a Jakarta EE service, or a standalone utility class.
Java Implementation
// Indian GSTIN (GST Identification Number)
// ReDoS-safe | RegexVault — Finance > Tax & Registration
import java.util.regex.Pattern;
public class IndianGstinGstIdentificationNumberValidator {
private static final Pattern PATTERN =
Pattern.compile("^[0-3][0-9][A-Z]{5}[0-9]{4}[A-Z][1-9A-Z]Z[0-9A-Z]$");
public static boolean validate(String input) {
return PATTERN.matcher(input).matches();
}
// Example
public static void main(String[] args) {
System.out.println(validate("27AAPFU0939F1ZV")); // true
}
}Test Cases
Matches (Valid) | Rejects (Invalid) |
|---|---|
27AAPFU0939F1ZV | AAPFU0939F1ZV |
29AABCR1718E1Z1 | 27aapfu0939f1zv |
19AADCB2230M1Z0 | 27AAPFU0939F1Z |
07AAACP0160H1Z9 | 40AAPFU0939F1ZV |
When to use this pattern
This pattern is drawn from the Finance > Tax & Registration category and carries a ReDoS-safe certification. That matters for Java developers because critical in Java applications since the JVM regex engine uses backtracking and is susceptible to ReDoS without careful pattern design. RegexVault audits patterns against known backtracking attack vectors, ensuring you have the necessary context before using this regex in a high-stakes production environment.
Common Pitfalls
GSTIN structure encodes the PAN (Personal Account Number) of the business owner — treat it as sensitive data. New UTs (like Ladakh, code 38) may require pattern updates.
Technical Notes
Structure: 2-digit state code (01-37) + 10-char PAN + 1 entity number + Z (default) + 1 checksum. State codes 01-37 (38 states/UTs). The 13th character is always Z. Checksum uses a modulus-based algorithm.
Have a pattern that belongs in the vault?
Submit it for review — community-verified patterns get credited to your GitHub handle. Free submissions join the queue. Priority review available for $15.
Submit a Pattern