"A Novel of Character": Towards the Automatic Annotation of Characters in a Large Corpus of French Novels

Abstract : In this paper, we apply named entity recognition techniques to a corpus of literary texts, i.e. French novels from the 18 th , 19 th and 20 th century. We obtain results that are usable but could be improved by using advanced annotation techniques. We discuss the use of active learning in this context, as well as the different applications that could be derived from this kind of annotation. In particular, we show that the automatic annotation of large literary corpora makes it possible to check whether traditional classifications exhibit specific structural patterns that could be identified automatically.
Liste complète des métadonnées

Littérature citée [13 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-02265134
Contributeur : Thierry Poibeau <>
Soumis le : jeudi 8 août 2019 - 15:28:29
Dernière modification le : dimanche 11 août 2019 - 01:08:32

Fichier

char_corpora2019-rabu2019.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-02265134, version 1

Collections

Citation

B Rabu, F Mélanie, Thierry Poibeau. "A Novel of Character": Towards the Automatic Annotation of Characters in a Large Corpus of French Novels. International Conference on Corpus Linguistics 2019, Jun 2019, Saint Petersbourg, Russia. ⟨hal-02265134⟩

Partager

Métriques

Consultations de la notice

90

Téléchargements de fichiers

41