An Architecture for Scientific Document Retrieval Using Textual and Math Entailment Modules
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F14%3A00077458" target="_blank" >RIV/00216224:14330/14:00077458 - isvavai.cz</a>
Result on the web
<a href="https://doi.dx.org/10.13140/2.1.4036.2561" target="_blank" >https://doi.dx.org/10.13140/2.1.4036.2561</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.13140/2.1.4036.2561" target="_blank" >10.13140/2.1.4036.2561</a>
Alternative languages
Result language
angličtina
Original language name
An Architecture for Scientific Document Retrieval Using Textual and Math Entailment Modules
Original language description
We present an architecture for scientific document retrieval. An existing system for textual and math-ware retrieval Math Indexer and Searcher MIaS is designed for extensions by modules for textual and math-aware entailment. The goal is to increase quality of retrieval (precision and recall) by handling natural languge variations of expressing semantically the same in texts and/or formulae. Entailment modules are designed to use several, ordered layers of processing on lexical, syntactic and semantic levels using natural language processing tools adapted for handling tree structures like mathematical formulae. If these tools are not able to decide on the entailment, generic knowledge databases are used deploying distributional semantics methods and tools. It is shown that sole use of distributional semantics for semantic textual entailment decisions on sentence level is surprisingly good. Finally, further research plans to deploy results in the digital mathematical libraries are outlin
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/LG13010" target="_blank" >LG13010: Czech Republic representation in the European Research Consortium for Informatics and Mathematics (ERCIM)</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2014
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Eighth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2014
ISBN
—
ISSN
2336-4289
e-ISSN
—
Number of pages
11
Pages from-to
107-117
Publisher name
Tribun EU
Place of publication
Brno
Event location
Karlova Studánka
Event date
Jan 1, 2014
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—