Workflow and Metadata Challenges in the ParlaMint Project: Insights from Building the ParlaMint-UA Corpus
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3A10475893" target="_blank" >RIV/00216208:11320/23:10475893 - isvavai.cz</a>
Result on the web
<a href="https://office.clarin.eu/v/CE-2023-2328_CLARIN2023_ConferenceProceedings.pdf#page=75" target="_blank" >https://office.clarin.eu/v/CE-2023-2328_CLARIN2023_ConferenceProceedings.pdf#page=75</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Workflow and Metadata Challenges in the ParlaMint Project: Insights from Building the ParlaMint-UA Corpus
Original language description
The speeches in ParlaMint corpora of parliamentary proceedings are marked by their speaker, and the speakers are then paired with various metadata, also with their time-delimited affiliations with political parties or parliamentary groups. These are stored separately, and are also associated with further information. This paper discusses the addition of metadata on political parties and parliamentary groups, encoding their political position on various issues, in particular their categorisation on the left-to-right political spectrum. The paper explains our sources for this information, the process of data collection, and its encoding in the corpora. This additional metadata should be of interest to parliamentary data research, while the methodology developed could be used to add further metadata to the ParlaMint corpora.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/LM2023062" target="_blank" >LM2023062: Digital Research Infrastructure for Language Technologies, Arts and Humanities</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
CLARIN Annual Conference Proceedings 2023
ISBN
—
ISSN
2773-2177
e-ISSN
—
Number of pages
4
Pages from-to
67-70
Publisher name
CLARIN ERIC
Place of publication
Leuven, Belgium
Event location
Leuven, Belgium
Event date
Oct 16, 2023
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—