Towards Visual Words to Words Text Detection with a General Bag of Words Representation
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F15%3A00236244" target="_blank" >RIV/68407700:21230/15:00236244 - isvavai.cz</a>
Result on the web
<a href="http://cmp.felk.cvut.cz/~chum/papers/Mehta-ICDAR2015.pdf" target="_blank" >http://cmp.felk.cvut.cz/~chum/papers/Mehta-ICDAR2015.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/ICDAR.2015.7333840" target="_blank" >10.1109/ICDAR.2015.7333840</a>
Alternative languages
Result language
angličtina
Original language name
Towards Visual Words to Words Text Detection with a General Bag of Words Representation
Original language description
We address the problem of text localization and retrieval in real world images. We are first to study the retrieval of text images, i.e. the selection of images containing text in large collections at high speed. We propose a novel representation, textual visual words, which describe text by generic visual words that geometrically consistently predict bottom and top lines of text. The visual words are discretized SIFT descriptors of Hessian features. The features may correspond to various structures present in the text - character fragments, individual characters or their arrangements. The textual words representation is invariant to affine transformation of the image and local linear change of intensity. Experiments demonstrate that the proposed method outperforms the state-of-the-art on the MS dataset. The proposed method detects blurry, small font, low contrast, noisy text from real world images.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
ISBN
978-1-4799-1805-8
ISSN
1520-5363
e-ISSN
—
Number of pages
5
Pages from-to
641-645
Publisher name
IEEE
Place of publication
Piscataway
Event location
Nancy
Event date
Aug 23, 2015
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000381461400127