All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Tēzaurs.lv – the experience of building a multifunctional lexical resource

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AZRR9Q9NK" target="_blank" >RIV/00216208:11320/23:ZRR9Q9NK - isvavai.cz</a>

  • Result on the web

    <a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85171388807&partnerID=40&md5=0285490d60065d8f39cdc47996d54330" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85171388807&partnerID=40&md5=0285490d60065d8f39cdc47996d54330</a>

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Tēzaurs.lv – the experience of building a multifunctional lexical resource

  • Original language description

    "In this paper, we describe our findings from developing the lexicographic platform Tēzaurs.lv, extending it from a traditional explanatory dictionary into a multifunctional resource for structured lexical data. Tēzaurs.lv is the largest Latvian dictionary with more than 390,000 entries, which emerged as a compilation from nearly 300 prior dictionaries and other sources. Recently, it has been extended with Latvian WordNet data, effectively making it also a synonym dictionary and a translation dictionary. Each entry can contain multiple lexemes with their grammatical information and inflection tables, enabling search on inflection forms and spelling variants. For the new requirements, we have developed a lexical database system and a collaborative online editor toolkit, which are also used for two other major Latvian dictionaries. While previously the data model and tools were based on what the end user would see in a dictionary entry, the current infrastructure is designed with a highly structured lexical data model. This avoids duplication and helps to ensure consistency if entries or word senses are edited or merged, and it supports the usage of this data in computational linguistics. © 2023 Lexical Computing CZ s.r.o.. All rights reserved."

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

  • OECD FORD branch

    10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Result continuities

  • Project

  • Continuities

Others

  • Publication year

    2023

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    "Proc. Electron. lexicogr. 21st cent. Conf."

  • ISBN

  • ISSN

    2533-5626

  • e-ISSN

  • Number of pages

    19

  • Pages from-to

    410-428

  • Publisher name

    Lexical Computing CZ s.r.o.

  • Place of publication

  • Event location

    Singapore

  • Event date

    Jan 1, 2023

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article