Towards Universal Image Embeddings: A Large-Scale Dataset and Challenge for Generic Image Representations

Popis výsledku

—

Klíčová slova

Identifikátory výsledku

Kód výsledku v IS VaVaI
RIV/68407700:21230/23:00370772 - isvavai.cz
Výsledek na webu
https://doi.org/10.1109/ICCV51070.2023.01037
DOI - Digital Object Identifier
10.1109/ICCV51070.2023.01037

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Towards Universal Image Embeddings: A Large-Scale Dataset and Challenge for Generic Image Representations
Popis výsledku v původním jazyce
Fine-grained and instance-level recognition methods are commonly trained and evaluated on specific domains, in a model per domain scenario. Such an approach, however, is impractical in real large-scale applications. In this work, we address the problem of universal image embedding, where a single universal model is trained and used in multiple domains. First, we leverage existing domain-specific datasets to carefully construct a new large-scale public benchmark for the evaluation of universal image embeddings, with 241k query images, 1.4M index images and 2.8M training images across 8 different domains and 349k classes. We define suitable metrics, training and evaluation protocols to foster future research in this area. Second, we provide a comprehensive experimental evaluation on the new dataset, demonstrating that existing approaches and simplistic extensions lead to worse performance than an assembly of models trained for each domain separately. Finally, we conducted a public research competition on this topic, leveraging industrial datasets, which attracted the participation of more than 1k teams world-wide. This exercise generated many interesting research ideas and findings which we present in detail. Project webpage: https://cmp.felk.cvut.cz/univ_emb/
Název v anglickém jazyce
Towards Universal Image Embeddings: A Large-Scale Dataset and Challenge for Generic Image Representations
Popis výsledku anglicky
Fine-grained and instance-level recognition methods are commonly trained and evaluated on specific domains, in a model per domain scenario. Such an approach, however, is impractical in real large-scale applications. In this work, we address the problem of universal image embedding, where a single universal model is trained and used in multiple domains. First, we leverage existing domain-specific datasets to carefully construct a new large-scale public benchmark for the evaluation of universal image embeddings, with 241k query images, 1.4M index images and 2.8M training images across 8 different domains and 349k classes. We define suitable metrics, training and evaluation protocols to foster future research in this area. Second, we provide a comprehensive experimental evaluation on the new dataset, demonstrating that existing approaches and simplistic extensions lead to worse performance than an assembly of models trained for each domain separately. Finally, we conducted a public research competition on this topic, leveraging industrial datasets, which attracted the participation of more than 1k teams world-wide. This exercise generated many interesting research ideas and findings which we present in detail. Project webpage: https://cmp.felk.cvut.cz/univ_emb/

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
EF16_019/0000765: Výzkumné centrum informatiky
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
S - Specificky vyzkum na vysokych skolach

Ostatní

Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
ICCV2023: Proceedings of the International Conference on Computer Vision
ISBN
979-8-3503-0719-1
ISSN
1550-5499
e-ISSN
2380-7504
Počet stran výsledku
12
Strana od-do
11256-11267
Název nakladatele
IEEE
Místo vydání
Piscataway
Místo konání akce
Paris
Datum konání akce
2. 10. 2023
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
001169499003067

Základní informace

Druh výsledku

D - Stať ve sborníku

OECD FORD

Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Rok uplatnění

2023

Podobné výsledky(10)

IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization Generating Synthetic Depth Image Dataset for Industrial Applications of Hand Localization MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Sdílet výsledky vyhledávání

Towards Universal Image Embeddings: A Large-Scale Dataset and Challenge for Generic Image Representations

Popis výsledku

Klíčová slova

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Základní informace

Podobné výsledky(10)