Data Mining from Free-Text Health Records : State of the Art, New Polish Corpus
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F20%3A00117842" target="_blank" >RIV/00216224:14330/20:00117842 - isvavai.cz</a>
Result on the web
<a href="https://nlp.fi.muni.cz/raslan/raslan20.pdf#page=21" target="_blank" >https://nlp.fi.muni.cz/raslan/raslan20.pdf#page=21</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Data Mining from Free-Text Health Records : State of the Art, New Polish Corpus
Original language description
This paper deals with data mining from free-form text electronic health records both from global perspective and with specific application to Slavic languages. It introduces the reader to the promises and challenges of this enterprise and provides a short overview of the global state of the art and of the general absence of this kind of research in Central European Slavic languages. It describes pl_ehr_cardio, a new corpus of Polish health records with 18 years’ worth of medical text. This paper marks the beginning of a pioneering research project in medical text data mining in Central European Slavic languages.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/LM2018101" target="_blank" >LM2018101: Digital Research Infrastructure for the Language Technologies, Arts and Humanities</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the Fourteenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2020
ISBN
9788026316008
ISSN
2336-4289
e-ISSN
—
Number of pages
10
Pages from-to
13-22
Publisher name
Tribun EU
Place of publication
Brno
Event location
Brno
Event date
Jan 1, 2020
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—