Sorano.net

SNOMED CT: Bridging language differences between training and querying AI models for healthcare

EMEA

Artificial intelligence, Clinical Practice, Data analytics, Global/International, Implementation, Tooling

The healthcare landscape is increasingly witnessing the transformative potential of generative artificial intelligence (AI) across various applications, including the automation of clinical documentation and the enhancement of patient communication. Recent advances in large language model (LLM) architectures and transformer based sequence forecasting have opened new perspectives for longitudinal clinical decision support (CDS). However, a significant impediment to achieving equitable and effective healthcare delivery, particularly within the realm of AI applications, lies in the inherent linguistic diversity of global populations. This diversity encompasses not only the multitude of languages spoken worldwide but also the variations within those languages, including dialects and culturally specific expressions (i.e. practice). Generative AI models, often trained predominantly on English language data, frequently encounter difficulties in accurately processing and understanding different languages and their nuances. This limitation can lead to inaccurate translations, misinterpretations of critical medical information, and ultimately, the exacerbation of existing health disparities among different linguistic communities. This project explore the features and capabilities of SNOMED CT in the context of using LLMs and elucidate how they can effectively address the challenges posed by linguistic diversity in the application of generative AI within clinical practice, ultimately fostering more inclusive and accurate healthcare solutions. The presentation will demonstrate how the Foresight Timeline generation tool [i] using synthetic patient data based on various instances of the SNOMED Basic Synthetic Patient Data Generator (BSPG)[ii] performs with various AI models and languages (editions of SNOMED CT).

[i] Foresight: https://foresight.sites.er.kcl.ac.uk/

[ii] BSPG: https://github.com/IHTSDO/health-data-analytics/blob/ui-prototyping/generator/README.md#basic-synthetic-patient-data-generator

Description

This project aims to provide a framework to asses how SNOMED CT can contribute to semantic interoperability with generative AI. Overcoming the challenges posed by linguistic diversity in the application of generative AI within clinical practice (precision).

Scope

SNOMED CT (Systematized Nomenclature of Medicine ‚Äì Clinical Terms) emerges as a comprehensive, standardized, and inherently multilingual clinical terminology system. Recognized as the most comprehensive clinical vocabulary available in any language, SNOMED CT holds substantial promise in acting as a bridge by providing a common semantic framework that transcends the barriers of different languages models.