Information classification methods
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F60162694%3AG43__%2F11%3A00449336" target="_blank" >RIV/60162694:G43__/11:00449336 - isvavai.cz</a>
Result on the web
<a href="http://vavtest.unob.cz/registr" target="_blank" >http://vavtest.unob.cz/registr</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Information classification methods
Original language description
The goal of this article is to describe current methods useful for automated classification of electronic documents. Although there is lot of possible approaches for this task, the article is focused on methods used in daily practice like in anti-spam filters. Common procedures used on text are splitting text into words (tokenization), using Bayesian filters, hashes, regular expressions etc. This paper will discuss the principles and efficiency of this process and show some other approaches.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
KA - Militarism
OECD FORD branch
—
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2011
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
International Conference on Military Technology ICMT 11
ISBN
978-80-7231-787-5
ISSN
—
e-ISSN
—
Number of pages
6
Pages from-to
1141-1146
Publisher name
University of Defence
Place of publication
Brno
Event location
Brno
Event date
Jan 1, 2011
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—