Privacy-preserving mimic models for clinical named entity recognition in French - Information, Langue Ecrite et Signée Accéder directement au contenu
Article Dans Une Revue Journal of Biomedical Informatics Année : 2022

Privacy-preserving mimic models for clinical named entity recognition in French

Résumé

A vast amount of crucial information about patients resides solely in unstructured clinical narrative notes. There has been a growing interest in clinical Named Entity Recognition (NER) task using deep learning models. Such approaches require sufficient annotated data. However, there is little publicly available annotated corpora in the medical field due to the sensitive nature of the clinical text. In this paper, we tackle this problem by building privacy-preserving shareable models for French clinical Named Entity Recognition using the mimic learning approach to enable the knowledge transfer through a teacher model trained on a private corpus to a student model. This student model could be publicly shared without any access to the original sensitive data. We evaluated three privacy-preserving models using three medical corpora and compared the performance of our models to those of baseline models such as dictionary-based models. An overall macro F-measure of 70.6% could be achieved by a student model trained using silver annotations produced by the teacher model, compared to 85.7% for the original private teacher model. Our results revealed that these privacy-preserving mimic learning models offer a good compromise between performance and data privacy preservation.
Fichier principal
Vignette du fichier
Privacy_Preserving_Mimic_Models_for_clinical_Named_EntityRecognition__JBI_2021_.pdf (1.67 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03655039 , version 1 (08-05-2022)

Identifiants

Citer

Nesrine Bannour, Perceval Wajsbürt, Bastien Rance, Xavier Tannier, Aurélie Névéol. Privacy-preserving mimic models for clinical named entity recognition in French. Journal of Biomedical Informatics, 2022, 130, pp.104073. ⟨10.1016/j.jbi.2022.104073⟩. ⟨hal-03655039⟩
172 Consultations
244 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More