Terminology of the public relations field: corpus — automatic term recognition — terminology database

  • Nataša Logar Berginc Univerza v Ljubljani, Fakulteta za družbene vede
  • Špela Vintar Univerza v Ljubljani, Filozofska fakulteta
  • Špela Arhar Holdt Trojina, zavod za uporabno slovenistiko
Keywords: Slovene terminology, KoRP public relations corpus, TERMIS project, LUIZ term extraction system

Abstract

The article describes an analysis of automatic term recognition results performed for single- and multi-word terms with the LUIZ term extraction system. The target application of the results is a terminology database of Public Relations and the main resource the KoRP Public Relations Corpus. Our analysis is focused on two segments: (a) single-word noun term candidates, which we compare with the frequency list of nouns from KoRP and evaluate termhood on the basis of the judgements of two domain experts, and (b) multi-word term candidates with verb and noun as headword. In order to better assess the performance of the system and the soundness of our approach we also performed an analysis of recall.
Our results show that the terminological relevance of extracted nouns is indeed higher than that of merely frequent nouns, and that verbal phrases only rarely count as proper terms. The most productive patterns of multi-word terms with noun as a headword have the following structure: [adjective + noun], [adjective + and + adjective + noun] and [adjective + adjective + noun]. The analysis of recall shows low inter-annotator agreement, but nevertheless very satisfactory recall levels.

Downloads

Download data is not yet available.

References

Logar Berginc, N., Vintar, Š., Arhar Holdt, Š. (2013): Terminologija odnosov z javnostmi: korpus — luščenje — terminološka podatkovna zbirka. Slovenščina 2.0, 1 (2): 113–138.
Published
2013-12-01
How to Cite
Logar BergincN., Vintar Špela, & Arhar Holdt Špela. (2013). Terminology of the public relations field: corpus — automatic term recognition — terminology database. Slovenščina 2.0: Empirical, Applied and Interdisciplinary Research, 1(2), 113-138. https://doi.org/10.4312/slo2.0.2013.2.113-138