Indexing and Searching Mathematics in Digital Libraries -- Architecture, Design and Scalability Issues
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F11%3A00052712" target="_blank" >RIV/00216224:14330/11:00052712 - isvavai.cz</a>
Alternative codes found
RIV/00216224:14330/11:00067289
Result on the web
<a href="http://dx.doi.org/10.1007/978-3-642-22673-1_16" target="_blank" >http://dx.doi.org/10.1007/978-3-642-22673-1_16</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-642-22673-1_16" target="_blank" >10.1007/978-3-642-22673-1_16</a>
Alternative languages
Result language
angličtina
Original language name
Indexing and Searching Mathematics in Digital Libraries -- Architecture, Design and Scalability Issues
Original language description
This paper surveys approaches and systems for searching mathematical formulae in mathematical corpora and on the web. The design and architecture of our MIaS (Math Indexer and Searcher) system is presented, and our design decisions are discussed in detail. An approach based on Presentation MathML using a similarity of math subformulae is suggested and verified by implementing it as a math-aware search engine based on the state-of-the-art system, Apache Lucene. Scalability issues were checked based on 324,000 real scientific documents from arXiv archive with 112 million mathematical formulae. More than two billions MathML subformulae were indexed using our Solr-compatible Lucene extension.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/LA09016" target="_blank" >LA09016: Czech Republic membership in the European Research Consortium for Informatics and Mathematics (ERCIM)</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2011
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Intelligent Computer Mathematics Lecture Notes in Computer Science, 2011, Volume 6824/2011
ISBN
978-3-642-22672-4
ISSN
—
e-ISSN
—
Number of pages
15
Pages from-to
228-243
Publisher name
Springer
Place of publication
Berlin / Heidelberg
Event location
Bertinoro, Italy
Event date
Jul 18, 2011
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—