Evaluation of Digital Watermarking on Subjective Speech Quality
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F21%3A00351683" target="_blank" >RIV/68407700:21230/21:00351683 - isvavai.cz</a>
Result on the web
<a href="https://doi.org/10.1038/s41598-021-99811-x" target="_blank" >https://doi.org/10.1038/s41598-021-99811-x</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1038/s41598-021-99811-x" target="_blank" >10.1038/s41598-021-99811-x</a>
Alternative languages
Result language
angličtina
Original language name
Evaluation of Digital Watermarking on Subjective Speech Quality
Original language description
New methods of securing the distribution of audio content have been widely deployed in the last twenty years. Their impact on perceptive quality has, however, only been seldomly the subject of recent extensive research. We review digital speech watermarking state of the art and provide subjective testing of watermarked speech samples. Latest speech watermarking techniques are listed, with their specifics and potential for further development. Their current and possible applications are evaluated. Open-source software designed to embed watermarking patterns in audio files is used to produce a set of samples that satisfies the requirements of modern speech-quality subjective assessments. The patchwork algorithm that is coded in the application is mainly considered in this analysis. Different watermark robustness levels are used, which allow determining the threshold of detection to human listeners. The subjective listening tests are conducted following ITU-T P.800 Recommendation, which precisely defines the conditions and requirements for subjective testing. Further analysis tries to determine the effects of noise and various disturbances on watermarked speech’s perceived quality. A threshold of intelligibility is estimated to allow further openings on speech ompression techniques with watermarking. The impact of language or social background is evaluated through an additional experiment involving two groups of listeners. Results show significant robustness of the watermarking implementation, retaining both a reasonable net subjective audio quality and security attributes, despite mild levels of distortion and noise. Extended experiments with Chinese listeners open the door to formulate a hypothesis on perception variations with geographical and social backgrounds.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
20202 - Communication engineering and systems
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2021
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Scientific Reports
ISSN
2045-2322
e-ISSN
2045-2322
Volume of the periodical
11
Issue of the periodical within the volume
10
Country of publishing house
GB - UNITED KINGDOM
Number of pages
11
Pages from-to
—
UT code for WoS article
000706830500021
EID of the result in the Scopus database
2-s2.0-85117375690