Vše

Co hledáte?

Vše
Projekty
Výsledky výzkumu
Subjekty

Rychlé hledání

  • Projekty podpořené TA ČR
  • Významné projekty
  • Projekty s nejvyšší státní podporou
  • Aktuálně běžící projekty

Chytré vyhledávání

  • Takto najdu konkrétní +slovo
  • Takto z výsledků -slovo zcela vynechám
  • “Takto můžu najít celou frázi”

Kara1k: a karaoke dataset for cover song identification and singing voice analysis

Identifikátory výsledku

  • Kód výsledku v IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F17%3A10363153" target="_blank" >RIV/00216208:11320/17:10363153 - isvavai.cz</a>

  • Výsledek na webu

    <a href="http://ieeexplore.ieee.org/document/8241597/" target="_blank" >http://ieeexplore.ieee.org/document/8241597/</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.1109/ISM.2017.32" target="_blank" >10.1109/ISM.2017.32</a>

Alternativní jazyky

  • Jazyk výsledku

    angličtina

  • Název v původním jazyce

    Kara1k: a karaoke dataset for cover song identification and singing voice analysis

  • Popis výsledku v původním jazyce

    We introduce Kara1k, a new musical dataset composed of 2,000 analyzed songs thanks to a partnership with a karaoke company. The dataset is divided into 1,000 cover songs provided by Recisio Karafun application, and the corresponding 1,000 songs by the original artists. Kara1k is mainly dedicated toward cover song identification and singing voice analysis. For both tasks, it offers novel approaches, as each cover song is a studio-recorded song with the same arrangement as the original recording, but with different singers and musicians. Essentia, harmony-analyser, Marsyas, Vamp plugins and YAAFE have been used to extract audio features for each track in Kara1k. We provide metadata such as the title, genre, original artist, year, International Standard Recording Code and the ground truths for the singer&apos;s gender, backing vocals, duets and lyrics&apos; language. Additionally, we provide the instrumental track and the pure singing voice track for each cover song. We showcase two use-case experiments for Kara1k. In the cover song identification task using the Dynamic Time Warping method, we provide a comparison of traditional and new features: chroma and MFCC features, chords and keys, and chroma and chord distances. We obtain 84-89% identification accuracy for three of the features, which justifies our focus on karaoke songs. In the supporting experiment on singer gender classification, we evaluate the difference in the performance in two conditions - a pure singing voice and the singing voice mixed with the background music. The Kara1k dataset is freely available under the KaraMIR project website.

  • Název v anglickém jazyce

    Kara1k: a karaoke dataset for cover song identification and singing voice analysis

  • Popis výsledku anglicky

    We introduce Kara1k, a new musical dataset composed of 2,000 analyzed songs thanks to a partnership with a karaoke company. The dataset is divided into 1,000 cover songs provided by Recisio Karafun application, and the corresponding 1,000 songs by the original artists. Kara1k is mainly dedicated toward cover song identification and singing voice analysis. For both tasks, it offers novel approaches, as each cover song is a studio-recorded song with the same arrangement as the original recording, but with different singers and musicians. Essentia, harmony-analyser, Marsyas, Vamp plugins and YAAFE have been used to extract audio features for each track in Kara1k. We provide metadata such as the title, genre, original artist, year, International Standard Recording Code and the ground truths for the singer&apos;s gender, backing vocals, duets and lyrics&apos; language. Additionally, we provide the instrumental track and the pure singing voice track for each cover song. We showcase two use-case experiments for Kara1k. In the cover song identification task using the Dynamic Time Warping method, we provide a comparison of traditional and new features: chroma and MFCC features, chords and keys, and chroma and chord distances. We obtain 84-89% identification accuracy for three of the features, which justifies our focus on karaoke songs. In the supporting experiment on singer gender classification, we evaluate the difference in the performance in two conditions - a pure singing voice and the singing voice mixed with the background music. The Kara1k dataset is freely available under the KaraMIR project website.

Klasifikace

  • Druh

    D - Stať ve sborníku

  • CEP obor

  • OECD FORD obor

    10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

  • Projekt

  • Návaznosti

    S - Specificky vyzkum na vysokych skolach<br>I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace

Ostatní

  • Rok uplatnění

    2017

  • Kód důvěrnosti údajů

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

  • Název statě ve sborníku

    2017 IEEE International Symposium on Multimedia (ISM)

  • ISBN

    978-1-5386-2937-6

  • ISSN

  • e-ISSN

    neuvedeno

  • Počet stran výsledku

    8

  • Strana od-do

    177-184

  • Název nakladatele

    IEEE

  • Místo vydání

    Taichung, Taiwan

  • Místo konání akce

    Taichung, Taiwan

  • Datum konání akce

    11. 12. 2017

  • Typ akce podle státní příslušnosti

    WRD - Celosvětová akce

  • Kód UT WoS článku