Tools for Semi-automatic Preparation of Training Data for OCR
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F19%3A43955290" target="_blank" >RIV/49777513:23520/19:43955290 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1007/978-3-030-19823-7_29" target="_blank" >http://dx.doi.org/10.1007/978-3-030-19823-7_29</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-030-19823-7_29" target="_blank" >10.1007/978-3-030-19823-7_29</a>
Alternative languages
Result language
angličtina
Original language name
Tools for Semi-automatic Preparation of Training Data for OCR
Original language description
This work aims at data preparation for OCR systems based on recurrent neural networks. Precisely annotated data are necessary for training a network as well as for evaluation of OCR methods. Manual annotation is still needed in many cases, especially in the case of historical documents we are focusing on. Although there are several complex systems for historical document processing, to the best of our knowledge, a simple annotation tool for OCR data is missing. Therefore, we propose and implement a set of tools utilizing artificial intelligence that simplify the annotation process. These tools create ground truths for line images that are used for training of nowadays OCR systems.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/EF17_048%2F0007267" target="_blank" >EF17_048/0007267: Research and Development of Intelligent Components of Advanced Technologies for the Pilsen Metropolitan Area (InteCom)</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach<br>I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2019
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Artificial Intelligence Applications and Innovations
ISBN
978-3-030-19822-0
ISSN
1868-4238
e-ISSN
—
Number of pages
10
Pages from-to
351-361
Publisher name
Springer
Place of publication
Cham
Event location
Crete
Event date
May 24, 2019
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—