All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

The Unreasonable Effectiveness of Pattern Generation

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F19%3A00112031" target="_blank" >RIV/00216224:14330/19:00112031 - isvavai.cz</a>

  • Result on the web

    <a href="https://doi.org/10.5300/2019-1-4/73" target="_blank" >https://doi.org/10.5300/2019-1-4/73</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.5300/2019-1-4/73" target="_blank" >10.5300/2019-1-4/73</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    The Unreasonable Effectiveness of Pattern Generation

  • Original language description

    Languages are constantly evolving, and so are their hyphenation rules and needs. The effectiveness and utility of TeX’s hyphenation have been proven by its usage in almost all typesetting systems in use today. The current Czech hyphenation patterns were generated in 1995, and no hyphenated word database was freely available. We have developed a new Czech word database and have used the patgen program to generate new effective Czech hyphenation patterns efficiently and evaluated their generalization qualities. We have achieved full coverage on the training dataset of 3,000,000 words and developed a validation procedure of new patterns for Czech based on the testing database of 105,000 words approved by the Czech Academy of Science linguists. Our pattern generation case study exemplifies a practical solution to the widespread dictionary problem. The study has proved the versatility, effectiveness, and extensibility of Liang’s approach to hyphenation developed for TeX. The unreasonable effectiveness of pattern technology has led to applications that are and will be used, even more widely now, nearly 40 years after its inception.

  • Czech name

  • Czech description

Classification

  • Type

    J<sub>ost</sub> - Miscellaneous article in a specialist periodical

  • CEP classification

  • OECD FORD branch

    20206 - Computer hardware and architecture

Result continuities

  • Project

  • Continuities

    I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace

Others

  • Publication year

    2019

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Name of the periodical

    Zpravodaj CSTUG

  • ISSN

    1211-6661

  • e-ISSN

  • Volume of the periodical

    29

  • Issue of the periodical within the volume

    1-4

  • Country of publishing house

    CZ - CZECH REPUBLIC

  • Number of pages

    14

  • Pages from-to

    73-86

  • UT code for WoS article

  • EID of the result in the Scopus database