Rosetta-LSF: an Aligned Corpus of French Sign Language and French for Text-to-Sign Translation - Information, Langue Ecrite et Signée Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Rosetta-LSF: an Aligned Corpus of French Sign Language and French for Text-to-Sign Translation

Élise Bertin-Lemée
  • Fonction : Auteur
  • PersonId : 1148932
Claire Danet
Boris Dauriac
  • Fonction : Auteur
  • PersonId : 1148933
Michael Filhol
Jérémie Segouat

Résumé

This article presents a new French Sign Language (LSF) corpus called Rosetta-LSF. It was created to support future studies on the automatic translation of written French into LSF, rendered through the animation of a virtual signer. An overview of the field highlights the importance of a quality representation of LSF. In order to obtain quality animations understandable by signers, it must surpass the simple "gloss transcription" of the LSF lexical units to use in the discourse. To achieve this, we designed a corpus composed of four types of aligned data, and evaluated its usability. These are: news headlines in French, translations of these headlines into LSF in the form of videos showing animations of a virtual signer, gloss annotations of the "traditional" type-although including additional information on the context in which each gestural unit is performed as well as their potential for adaptation to another context-and AZee representations of the videos, i.e. formal expressions capturing the necessary and sufficient linguistic information. This article describes this data, exhibiting an example from the corpus. It is available online for public research.
Fichier principal
Vignette du fichier
B-L_et_al_2022.pdf (1.37 Mo) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-03720096 , version 1 (11-07-2022)

Identifiants

  • HAL Id : hal-03720096 , version 1

Citer

Élise Bertin-Lemée, Annelies Braffort, Camille Challant, Claire Danet, Boris Dauriac, et al.. Rosetta-LSF: an Aligned Corpus of French Sign Language and French for Text-to-Sign Translation. 13th Conference on Language Resources and Evaluation (LREC 2022), Jun 2022, Marseille, France. ⟨hal-03720096⟩
332 Consultations
55 Téléchargements

Partager

Gmail Facebook X LinkedIn More