Dynamic Threshold Selection Method for Multi-label Newspaper Topic Identification

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F13%3A43920730" target="_blank" >RIV/49777513:23520/13:43920730 - isvavai.cz</a>
Výsledek na webu
<a href="http://link.springer.com/chapter/10.1007/978-3-642-40585-3_27" target="_blank" >http://link.springer.com/chapter/10.1007/978-3-642-40585-3_27</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-642-40585-3_27" target="_blank" >10.1007/978-3-642-40585-3_27</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Dynamic Threshold Selection Method for Multi-label Newspaper Topic Identification
Popis výsledku v původním jazyce
Nowadays, the multi-label classification is increasingly required in modern categorization systems. It is especially essential in the task of newspaper article topics identification. This paper presents a method based on general topic model normalisationfor finding a threshold defining the boundary between the "correct" and the "incorrect" topics of a newspaper article. The proposed method is used to improve the topic identification algorithm which is a part of a complex system for acquisition and storing large volumes of text data. The topic identification module uses the Naive Bayes classifier for the multiclass and multi-label classification problem and assigns to each article the topics from a defined quite extensive topic hierarchy - it containsabout 450 topics and topic categories. The results of the experiments with the improved topic identification algorithm are presented in this paper.
Název v anglickém jazyce
Dynamic Threshold Selection Method for Multi-label Newspaper Topic Identification
Popis výsledku anglicky
Nowadays, the multi-label classification is increasingly required in modern categorization systems. It is especially essential in the task of newspaper article topics identification. This paper presents a method based on general topic model normalisationfor finding a threshold defining the boundary between the "correct" and the "incorrect" topics of a newspaper article. The proposed method is used to improve the topic identification algorithm which is a part of a complex system for acquisition and storing large volumes of text data. The topic identification module uses the Naive Bayes classifier for the multiclass and multi-label classification problem and assigns to each article the topics from a defined quite extensive topic hierarchy - it containsabout 450 topics and topic categories. The results of the experiments with the improved topic identification algorithm are presented in this paper.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
JD - Využití počítačů, robotika a její aplikace
OECD FORD obor
—

Návaznosti výsledku

Projekt
<a href="/cs/project/LM2010013" target="_blank" >LM2010013: LINDAT-CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach

Ostatní

Rok uplatnění
2013
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
Text, Speech and Dialogue
ISBN
978-3-642-40584-6
ISSN
0302-9743
e-ISSN
—
Počet stran výsledku
8
Strana od-do
209-216
Název nakladatele
Springer
Místo vydání
Heidelberg
Místo konání akce
Plzeň
Datum konání akce
1. 9. 2013
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)

Comparison of Score Normalization Methods Applied to Multi-label Classification Score Normalization Methods Applied to Topic Identification First Experiments with Relevant Documents Selection for Blind Relevance Feedback in Spoken Document Retrieval

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Dynamic Threshold Selection Method for Multi-label Newspaper Topic Identification

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)