top of page
SNOMED CT: Bridging language differences between training and querying AI models for healthcare

Sorano.net

SNOMED CT: Bridging language differences between training and querying AI models for healthcare

Country / Region
EMEA
Tags
Artificial intelligence, Clinical Practice, Data analytics, Global/International, Implementation, Tooling

The healthcare landscape is increasingly witnessing the transformative potential of generative artificial intelligence (AI) across various applications, including the automation of clinical documentation and the enhancement of patient communication. Recent advances in large language model (LLM) architectures and transformer based sequence forecasting have opened new perspectives for longitudinal clinical decision support (CDS). However, a significant impediment to achieving equitable and effective healthcare delivery, particularly within the realm of AI applications, lies in the inherent linguistic diversity of global populations. This diversity encompasses not only the multitude of languages spoken worldwide but also the variations within those languages, including dialects and culturally specific expressions (i.e. practice). Generative AI models, often trained predominantly on English language data, frequently encounter difficulties in accurately processing and understanding different languages and their nuances. This limitation can lead to inaccurate translations, misinterpretations of critical medical information, and ultimately, the exacerbation of existing health disparities among different linguistic communities. This project explore the features and capabilities of SNOMED CT in the context of using LLMs and elucidate how they can effectively address the challenges posed by linguistic diversity in the application of generative AI within clinical practice, ultimately fostering more inclusive and accurate healthcare solutions. The presentation will demonstrate how the Foresight Timeline generation tool [i] using synthetic patient data based on various instances of the SNOMED Basic Synthetic Patient Data Generator (BSPG)[ii] performs with various AI models and languages (editions of SNOMED CT).

[i] Foresight: https://foresight.sites.er.kcl.ac.uk/

[ii] BSPG: https://github.com/IHTSDO/health-data-analytics/blob/ui-prototyping/generator/README.md#basic-synthetic-patient-data-generator

Description

This project aims to provide a framework to asses how SNOMED CT can contribute to semantic interoperability with generative AI. Overcoming the challenges posed by linguistic diversity in the application of generative AI within clinical practice (precision).

Scope

SNOMED CT (Systematized Nomenclature of Medicine – Clinical Terms) emerges as a comprehensive, standardized, and inherently multilingual clinical terminology system. Recognized as the most comprehensive clinical vocabulary available in any language, SNOMED CT holds substantial promise in acting as a bridge by providing a common semantic framework that transcends the barriers of different languages models.

How SNOMED CT will be used

To generate synthetic data for training or testing of the NLP pipeline (generative models).

Why SNOMED CT will be used

Contact

More information

Get SNOMED CT

Get SNOMED CT

Information about our license and fee structure

Learn more

Learn more

Explore the wide range of resources available to our community of practice

Subscribe to SNOMED International news

Stay up to date on SNOMED news, features, developments and newsletters by subscribing to our news service.

bottom of page