Skip to content

Instantly share code, notes, and snippets.

@eightyknots
Last active July 7, 2024 20:19
Show Gist options
  • Save eightyknots/4372d1166a192d5e9754 to your computer and use it in GitHub Desktop.
Save eightyknots/4372d1166a192d5e9754 to your computer and use it in GitHub Desktop.
AvReg: Aviation regex match toolkit

AvReg: The Aviation RegEx Match Toolkit

General Tips

  • The PCRE flavour of RegEx is used here.
  • Append the i modifier to the end of the regex to make any pattern case-insensitive.

Aircraft

Purpose Description RegEx Example
Registration National registration /^[A-Z]-[A-Z]{4}|[A-Z]{2}-[A-Z]{3}|N[0-9]{1,5}[A-Z]{0,2}$/ N390HA
IATA aircraft type 3-character type code /^[A-Z0-9]{3}$/ 32N
ICAO aircraft type (Typically) 4-character type code /^[A-Z]{1}[A-Z0-9]{1,3}$/ A32N

Airline Codes

Purpose Description RegEx Example
IATA code Commercial service mark /^[A-Z][\d]|[\d][A-Z]|[A-Z]{2}$/ CX
ICAO code Operational service mark /^[A-Z]{3}$/ CPA
Ticketing prefix eTicket operator code /^[0-9]{3}$/ 160

Airport Codes

Purpose Description RegEx Example
IATA code Commercial service mark /^[A-Z]{3}$/ LHR
ICAO code Operational service mark /^[A-Z]{4}$/ EGLL
FAA code US FAA-specific locator /^[A-Z0-9]{3,4}$/ L67

Air Navigation & Communication

Notes:

  • Privately-owned Canadian NDBs may utilize a letter and number combination.
  • VFR squawk codes are generally 1200 in North America and 7000 in Europe.
Purpose Description RegEx Example
NDB Non-directional beacon identifier /^[A-Z]{1,3}$/ TD
VOR VHF omnidirectional range ident /^[A-Z]{3}$/ APU
INT Airway intersection waypoint /^[A-Z]{5}$/ PRAWN
Squawk Code Unique transponder octal code /^[0-7]{4}$/ 0318
Distress If match, aircraft is in distress /^7[567]00$/ 7700
VFR Squawk XPDR code for aircraft under VFR /^(1200)|(7000)$/ 1200
Runways Standard runway identifiers 01-36 /^(0?[1-9]|[1-2]\d|3[0-6])[LCR]?$/ 36L

Ticketing & Business Operations

Note that for PNR record identifiers, some GDS providers and operators use 5-character PNR idents, but most use 6-character ones. Additionally, for readibility purposes, some airlines and systems will skip 0, 1, I, L, and O.

Purpose Description RegEx Example
PNR identifier Passenger record locator /^[A-Z0-9]{5,6}$/ J5XTP2
E-ticket number Ticketing and itinerary identifier /^[0-9]{3}(-)?[0-9]{10}$/ 160-4837291830

Flight Operations

Purpose Description RegEx Example
Flight number IATA (marketing) flight number /^([A-Z][\d]|[\d][A-Z]|[A-Z]{2})(\d{1,})$/ BA026
Callsign ICAO (operational) flight number /^[A-Z]{3}[A-Z0-9]{1,}$/ BAW319K
@mtowers
Copy link

mtowers commented Feb 21, 2023

The FAA airport code regex is incomplete. FAA identifiers can be 3-5 alphanumeric characters (with some exceptions). Although, I can't find any examples of a five-character location.

I think something like this is a little closer: /(\b[A-Z0-9]{3,4}\b)+/

As per Wikipedia on FAA identifiers:

The Federal Aviation Administration location identifier (FAA LID) is a three- to five-character alphanumeric code identifying aviation related facilities inside the United States, though some codes are reserved for, and are managed by other entities.[1]: §1–2-1 

For nearly all major airports, the assigned identifiers are alphabetic three-letter codes, such as ORD for Chicago O’Hare International Airport. Minor airfields are typically assigned a mix of alphanumeric characters, such as 8N2 for Skydive Chicago Airport and 0B5 for Turners Falls Airport. Private airfields are assigned a four-character identifier, such as 1CA9 for Los Angeles County Fire Department Heliport. The location identifiers are coordinated with the Transport Canada Identifiers described below.

In general, the FAA has authority to assign all three-letter identifiers (except those beginning with the letters K, N, W, and Y), all three and four character alphanumeric identifiers, and five-letter identifiers for the United States and its jurisdictions. The Department of the Navy assigns three-letter identifiers beginning with the letter N for the exclusive use of that Department. Transport Canada assigns three character identifiers beginning with Y. The block beginning with letter Q is under international telecommunications jurisdiction, but is used internally by FAA Technical Operations to identify National Airspace equipment not covered by any other identifying code system. The block beginning with Z identifies United States Air Route Traffic Control Centers.[1]: §1–2-2

@eightyknots
Copy link
Author

@mtowers - Thank you, I've updated the FAA identifiers per your suggestion with the exception of using $^ tokens instead of word boundaries. This is just to keep everything else in line.

@Tricky-D & @obotor - Thanks for your inputs as well. I will review & make update shortly on registration numbers. It may come down to splitting US since we can be more specific with FAA registrations.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment