Browsing by Subject "Document classi cation"
Now showing 1 - 1 of 1
Results Per Page
Sort Options
- PublicationOpen AccessUMUCorpusClassifier: compilation and evaluation of linguistic corpus for Natural Language Processing tasks(Sociedad Española de Procesamiento del Lenguaje Natural, 2020) Almela, Ángela; García Díaz, José Antonio; Alcaraz Marmol, Gema; Valencia García, Rafael; Filología InglesaThe development of an annotated corpus is a very time-consuming task. Although some researchers have proposed the automatic annotation of a corpus based on ad-hoc heuristics, valid hypotheses cannot always be made. Even when the annotation process is performed by human annotators, the quality of the corpus is heavily in uenced by disagreements between annotators or with themselves. Therefore, the lack of supervision of the annotation process can lead to poor quality corpus. In this work, we propose a demonstration of UMUCorpusClassi er, a NLP tool for aid researches for compiling corpus as well as coordinating and supervising the annotation process. This tool eases the daily supervision process and permits to detect deviations and inconsistencies during early stages of the annotation process.