SNOMED CT Entity Linking Benchmark

A benchmark for linking text in medical notes to entities in SNOMED Clinical Taxonomy. #health

Benchmark
Open
66 joined

Model: SNOMED CT Entity Linking Super Dictionary
Reference Score

Abstract

A toolkit for building an extended synonym dictionary for the DrivenData SNOMED CT Entity Linking challenge. It consolidates multiple terminology sources into a single lookup table and trains a section-aware, precision-filtered dictionary following the approach of the 1st-place KIRIs solution from the original challenge.

Acknowledgements

This project builds directly on the winning solution by Team KIRIs (Guy Amit, Yonatan Bilu, Irena Girshovitz & Chen Yanover):

1st Place -- SNOMED CT Entity Linking Challenge https://github.com/drivendataorg/snomed-ct-entity-linking/tree/main/1st%20Place Licensed under the MIT License.

The KIRIs approach uses dictionary matching rather than ML models: it maps (section header, mention) pairs to SNOMED CT concept IDs, builds two dictionaries (case-sensitive and case-insensitive) from training data and SNOMED synonyms, resolves overlaps by preferring longer and section-specific matches, and applies post-processing with SNOMED CT relational data. Their solution achieved ~0.62 macro character-level IoU on the original challenge split.

Scores

Reference Scores
Name Macro char IoUSupport-weighted char IoU
SNOMED CT Entity Linking Super Dictionary
0.4582 0.6234