EFFICIENT UNCONSTRAINED STROKE DETECTOR

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F23%3A00366695" target="_blank" >RIV/68407700:21230/23:00366695 - isvavai.cz</a>
Výsledek na webu
<a href="https://worldwide.espacenet.com/patent/search/family/054979874/publication/EP3380990B1?q=EP3380990B1" target="_blank" >https://worldwide.espacenet.com/patent/search/family/054979874/publication/EP3380990B1?q=EP3380990B1</a>
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
EFFICIENT UNCONSTRAINED STROKE DETECTOR
Popis výsledku v původním jazyce
Scene text localization and recognition, a.k.a. the text-in-the-wild problem, is a key component of many applications such as automated translation, image/video database indexing, assistance to the visually impaired, etc. So far, unlike printed document OCR, no method has reached sufficient accuracy and speed for a practical exploitation. It is, therefore, a need in the art for a fast and accurate text localization and recognition. Stroke detection has applications in extraction of structure information from images, such as bar codes. We propose a novel easy-to-implement stroke detector which is significantly faster and produces significantly less false detections than the detectors commonly used by scene text localization methods. Following the observation that text in virtually any script is formed of strokes, stroke keypoints are efficiently detected, see Figure 1, and then exploited to obtain stroke segmentations. We also propose an efficient classification step to eliminate segmentations which do not correspond to text fragments. Text fragment can be a single character, a group of characters, a whole word or a part of a character. The classifier exploits features already calculated in the detection phase and an effectively approximated "strokeness" feature, which plays an important role in the discrimination between text fragments and a background clutter. Last but not least, an efficient text clustering algorithm based on text direction voting is proposed, in order to aggregate detected segmentations into text line structures and to allow processing by subsequent stages, e.g. an OCR module. When the proposed detector is plugged into a scene text localization and recognition pipeline, the state-of-theart text localization results are maintained whilst the processing time is significantly reduced. The proposed detector has high application potential e.g. in real-time scene text detection in a video stream on mobile phones and embedded systems since a straightforward, single-thread non-optimized version of the algorithm runs in near real-time. Since all stages are scale and rotation invariant and they are not script-specific, a wide variety of fonts and scripts such as Latin, Hebrew, and Chinese can be detected. The detector can be used to produce repeatable local features for e.g. image matching and retrieval.
Název v anglickém jazyce
EFFICIENT UNCONSTRAINED STROKE DETECTOR
Popis výsledku anglicky
Scene text localization and recognition, a.k.a. the text-in-the-wild problem, is a key component of many applications such as automated translation, image/video database indexing, assistance to the visually impaired, etc. So far, unlike printed document OCR, no method has reached sufficient accuracy and speed for a practical exploitation. It is, therefore, a need in the art for a fast and accurate text localization and recognition. Stroke detection has applications in extraction of structure information from images, such as bar codes. We propose a novel easy-to-implement stroke detector which is significantly faster and produces significantly less false detections than the detectors commonly used by scene text localization methods. Following the observation that text in virtually any script is formed of strokes, stroke keypoints are efficiently detected, see Figure 1, and then exploited to obtain stroke segmentations. We also propose an efficient classification step to eliminate segmentations which do not correspond to text fragments. Text fragment can be a single character, a group of characters, a whole word or a part of a character. The classifier exploits features already calculated in the detection phase and an effectively approximated "strokeness" feature, which plays an important role in the discrimination between text fragments and a background clutter. Last but not least, an efficient text clustering algorithm based on text direction voting is proposed, in order to aggregate detected segmentations into text line structures and to allow processing by subsequent stages, e.g. an OCR module. When the proposed detector is plugged into a scene text localization and recognition pipeline, the state-of-theart text localization results are maintained whilst the processing time is significantly reduced. The proposed detector has high application potential e.g. in real-time scene text detection in a video stream on mobile phones and embedded systems since a straightforward, single-thread non-optimized version of the algorithm runs in near real-time. Since all stages are scale and rotation invariant and they are not script-specific, a wide variety of fonts and scripts such as Latin, Hebrew, and Chinese can be detected. The detector can be used to produce repeatable local features for e.g. image matching and retrieval.

Klasifikace

Druh
P - Patent
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
Výsledek vznikl pri realizaci vícero projektů. Více informací v záložce Projekty.
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Ostatní

Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Číslo patentu nebo vzoru
EP3380990
Vydavatel
EPO_1 -
Název vydavatele
European Patent Office
Místo vydání
Munich, The Hague, Berlin, Vienna, Brussels
Stát vydání
—
Datum přijetí
7. 6. 2023
Název vlastníka
České vysoké učení technické v Praze
Způsob využití
A - Výsledek využívá pouze poskytovatel
Druh možnosti využití
A - K využití výsledku jiným subjektem je vždy nutné nabytí licence

Podobné výsledky(10)

Real-Time Lexicon-Free Scene Text Localization and Recognition A Real-Time Scene Text to Speech System Real-time scene text localization and recognition

Co hledáte?

Rychlé hledání

Chytré vyhledávání

EFFICIENT UNCONSTRAINED STROKE DETECTOR

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)