Komprese vět pro sumarizátor založený na LSA

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F06%3A00000632" target="_blank" >RIV/49777513:23520/06:00000632 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Sentence compression for the LSA-based summarizer
Popis výsledku v původním jazyce
We present a simple sentence compression approach for our summarizer based on latent semantic analysis (LSA). The summarization method assesses each sentence by an LSA score. The compression algorithm removes unimportant clauses from a full sentence. Firstly, a sentence is divided into clauses by Charniak parser,then compresion cnadidates are generated and finally, the best cnadiate is selected to represent the sentence. The candidates gain an impotance score which is directly proportional to its LSA score and indirectly to its length. We evaluated the approach in two ways. By intrinsic evaluation we found that the compressions produced by our algorithm are better than bvaseline ones but still worse than what humans can make. Then we compared the resulting sumaries with human abstracts by a standard n-gram based ROUGE measure.
Název v anglickém jazyce
Sentence compression for the LSA-based summarizer
Popis výsledku anglicky
We present a simple sentence compression approach for our summarizer based on latent semantic analysis (LSA). The summarization method assesses each sentence by an LSA score. The compression algorithm removes unimportant clauses from a full sentence. Firstly, a sentence is divided into clauses by Charniak parser,then compresion cnadidates are generated and finally, the best cnadiate is selected to represent the sentence. The candidates gain an impotance score which is directly proportional to its LSA score and indirectly to its length. We evaluated the approach in two ways. By intrinsic evaluation we found that the compressions produced by our algorithm are better than bvaseline ones but still worse than what humans can make. Then we compared the resulting sumaries with human abstracts by a standard n-gram based ROUGE measure.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
JC - Počítačový hardware a software
OECD FORD obor
—

Návaznosti výsledku

Projekt
—
Návaznosti
S - Specificky vyzkum na vysokych skolach

Ostatní

Rok uplatnění
2006
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
Information systems implementation and modelling
ISBN
80-86840-19-0
ISSN
—
e-ISSN
—
Počet stran výsledku
8
Strana od-do
141-148
Název nakladatele
MARQ
Místo vydání
Ostrava
Místo konání akce
Přerov
Datum konání akce
1. 1. 2006
Typ akce podle státní příslušnosti
EUR - Evropská akce
Kód UT WoS článku
—

Podobné výsledky(10)

SUTLER: update summarizER based on latent topics Dvojí použití rezoluce anafor v sumarizaci Evaluation measures for text summarization

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Komprese vět pro sumarizátor založený na LSA

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)