Global Patient Set Tools
Semantic Tag–Based Filtering
SNOMED International makes available a tool to filter the downloaded GPS files based on a concept's fully specified name (FSN), which allows implementers to restrict GPS content to a specific clinical domain:
https://ihtsdo.github.io/snomed-gps-extractor/

Semantic tags are derived from the FSN and can be used to:
Include or exclude concepts based on high-level meaning (e.g. disorders, findings, substances)
Produce domain-specific GPS subsets
Reduce dataset size for constrained or focused implementations
How to Use the Tool
Your file is processed locally in the browser and is never uploaded to any server.
Upload: Drag and drop the downloaded SNOMED CT GPS file (TSV format).
Configure:
Toggle "Active Concepts Only" to exclude inactive records.
Select the desired Semantic Tags from the categorized list.
Add any Custom Tags if needed.
Process: Click "Process & Download" to get your filtered dataset.
There is also a Command-Line Interface (CLI) included in the GitHub repository, intended for automated and repeatable processing, such as CI/CD pipelines or scheduled dataset builds.
GPS Extractor Tool
This is a utility tool for extracting and processing SNOMED CT terminology data from an RF2 release. It produces the SNOMED International GPS (Global Patient Set) format and offers advanced filtering capabilities via both a command-line interface (CLI) and a modern web interface.
Role in GPS Implementations
Implementers may need to recreate the Global Patient Set (GPS) from an existing SNOMED CT Edition RF2 distribution into formats suitable for runtime use in applications, such as value sets, lookup tables, or terminology service artifacts. The snomed-gps-extractor tooling supports this process by providing a repeatable way to extract, normalize, and package GPS content from official SNOMED CT RF2 release files.
In a typical GPS implementation, the term extractor is used to:
Produce a flat list of SNOMED CT concepts and terms without hierarchical or relational semantics
Enable implementers to tailor GPS datasets to specific clinical or interoperability use cases
The extractor acts as a bridge between the full GPS and implementation-friendly artifacts, such as value sets or lookup tables.
Features
The SNOMED CT GPS Term Extractor provides the following capabilities to support GPS content preparation:
Term extraction from RF2
Semantic tag–based filtering
Interactive web-based processing
Active status filtering
Command-line execution
Term extraction from RF2
The extractor provides a controlled mechanism for deriving GPS-ready datasets from SNOMED CT RF2 releases by standardising how concepts and terms are selected and represented. This supports repeatable dataset generation across releases and reduces the need for custom RF2 processing logic within implementations.
The extracted output is designed to be stable and comparable over time, enabling consistent downstream use in exchange, indexing, and validation scenarios.
Active Concept Management
The extractor supports handling of concept lifecycle status, enabling implementations to:
Include both active and inactive concepts where historical traceability is required
Restrict outputs to active concepts only for current-state exchange and display use cases
This allows implementers to align GPS datasets with their data governance and clinical safety requirements.
Output Characteristics
The extractor produces GPS-compatible tab-separated files (TSV) designed to be:
Easy to inspect and validate
Simple to load into databases or terminology services
Straightforward to transform into other representations (e.g. FHIR ValueSet resources)
The output intentionally avoids RF2-specific complexity and is suitable for use in systems that do not natively support RF2.
Relationship to Full SNOMED CT Implementations
The GPS term extractor does not generate:
Concept hierarchies
Attribute relationships
Subsumption or inference logic
Implementations requiring full semantic reasoning or advanced terminology services should use a licensed SNOMED CT edition and a dedicated terminology server instead of GPS-derived artifacts.
Operational Guidance
Detailed operational guidance for installation, command-line usage, web interface operation, and configuration options is maintained in the GitHub repository for the SNOMED CT GPS Term Extractor. Implementers should refer to the repository documentation for the most current and authoritative instructions.
Access the Term Extractor here:
Last updated
