A software framework for text mining algorithms

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F62156489%3A43110%2F10%3A00169296" target="_blank" >RIV/62156489:43110/10:00169296 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—

Jazyk výsledku
angličtina
Název v původním jazyce
A software framework for text mining algorithms
Popis výsledku v původním jazyce
The framework provides the functionality for transforming the textual data into vector representation suitable for various machine learning or classification algorithms. Entire process is highly parametrized and produced results match the actual needs ofthe user. The user can for instance choose from three possible representations of weights of individual words in the vectors (Term Presence, Term Frequency, and TF-IDF weighting schema), three possible output formats (C5, ARFF, and generic format), andforce the algorithm to work with only certain dictionary, specify minimal word length and frequency.
Název v anglickém jazyce
A software framework for text mining algorithms
Popis výsledku anglicky
The framework provides the functionality for transforming the textual data into vector representation suitable for various machine learning or classification algorithms. Entire process is highly parametrized and produced results match the actual needs ofthe user. The user can for instance choose from three possible representations of weights of individual words in the vectors (Term Presence, Term Frequency, and TF-IDF weighting schema), three possible output formats (C5, ARFF, and generic format), andforce the algorithm to work with only certain dictionary, specify minimal word length and frequency.

Rok uplatnění
2010
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Podobné výsledky(10)