All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Math-aware Similarity of Papers in Digital Mathematics Libraries

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F14%3A00077987" target="_blank" >RIV/00216224:14330/14:00077987 - isvavai.cz</a>

  • Result on the web

    <a href="http://dmv.ptm.org.pl/abstracts/19-r/19Sojka.pdf" target="_blank" >http://dmv.ptm.org.pl/abstracts/19-r/19Sojka.pdf</a>

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Math-aware Similarity of Papers in Digital Mathematics Libraries

  • Original language description

    The exploratory, semantic similarity searching is becoming widespread in digital libraries, and math ones are no exception. For working mathematicians and their use of digital mathematical libraries (DML) as the Czech Digital Mathematics Library DML-CZ or European Digital Mathematics Library (EuDML) we have designed and implemented math-aware similarity computation framework based on leading edge topic modelling techniques implemented by Gensim software package. Studies on the classification of math papers done for DML-CZ have been tested and deployed in EuDML, where for given paper ten most semantically similar papers are computed and shown. In the latest experiments we are evaluating several possible representations of mathematical formulae to get the semantically similar papers. Quality of similarity is measured by comparation to the similarity matrix induced from the Mathematical Subject Classifications every paper is marked up by.

  • Czech name

  • Czech description

Classification

  • Type

    O - Miscellaneous

  • CEP classification

    IN - Informatics

  • OECD FORD branch

Result continuities

  • Project

    <a href="/en/project/LG13010" target="_blank" >LG13010: Czech Republic representation in the European Research Consortium for Informatics and Mathematics (ERCIM)</a><br>

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2014

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů