Vol. 8 (2022): Anna Wamprechtshammer – Elena Arestau – Jocelyn Aznar – Hanna Hedeland – Amy Isard – Ilya Khait– Herbert Lange – Nicole Majka – Felix Rau: QUEST: Guidelines and Specifications for the Assessment of Audiovisual, Annotated Language Data
This guide documents the main results of the joint project “QUEST: Quality – Established: Qualitätsstandards und Kurationskriterien für audiovisuelle annotierte Sprachdaten”, which was carried out between 2019 and 2022 and funded by the German Federal Ministry of Education and Research (BMBF). The project consortium consisted of the University of Hamburg, the Leibniz-Centre General Linguistics (ZAS) in Berlin, the Archive for Spoken German (AGD)/Institute for the German Language (IDS) in Mannheim and the University of Cologne. The BBAW in Berlin was also involved through the ‘Endangered Languages Documentation Programme’.
Main aim of the project was to maximise the potential for reuse and secondary use of audiovisual, annotated language data. For this purpose, QUEST developed quality standards and curation criteria for several reuse scenarios such as ‘Language Documentation’, ‘Learner Corpora’, ‘Interpreted Corpora’, ‘Sign Language’, ‘Language Community’, ‘Ethnography’ and ‘Oral History’. Based on this, quality assurance procedures (an online questionnaire and automated quality checks) were implemented and tested on authentic data.
In summary, the guidelines document provides definitions and examples for the quality criteria elaborated in QUEST, which are intended to provide information on the reuse potential of audiovisual, annotated data and aims to give overview of the objects and workflows of the evaluation system. Quality standards and curation criteria are linked to data maturity levels and suggestions are made on how to evaluate each criterion.