All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F24%3A00372774" target="_blank" >RIV/68407700:21230/24:00372774 - isvavai.cz</a>

  • Result on the web

    <a href="https://doi.org/10.1007/s11263-023-01982-9" target="_blank" >https://doi.org/10.1007/s11263-023-01982-9</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.1007/s11263-023-01982-9" target="_blank" >10.1007/s11263-023-01982-9</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

  • Original language description

    Visual localization is critical to many applications in computer vision and robotics. To address single-image RGB localization, state-of-the-art feature-based methods match local descriptors between a query image and a pre-built 3D model. Recently, deep neural networks have been exploited to regress the mapping between raw pixels and 3D coordinates in the scene, and thus the matching is implicitly performed by the forward pass through the network. However, in a large and ambiguous environment, learning such a regression task directly can be difficult for a single network. In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image. The proposed method, which is an extension of HSCNet, allows us to train compact models which scale robustly to large environments. It sets a new state-of-the-art for single-image localization on the 7-Scenes, 12-Scenes, Cambridge Landmarks datasets, and the combined indoor scenes.

  • Czech name

  • Czech description

Classification

  • Type

    J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database

  • CEP classification

  • OECD FORD branch

    10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Result continuities

  • Project

    Result was created during the realization of more than one project. More information in the Projects tab.

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2024

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Name of the periodical

    International Journal of Computer Vision

  • ISSN

    0920-5691

  • e-ISSN

    1573-1405

  • Volume of the periodical

    132

  • Issue of the periodical within the volume

    7

  • Country of publishing house

    NL - THE KINGDOM OF THE NETHERLANDS

  • Number of pages

    21

  • Pages from-to

    2530-2550

  • UT code for WoS article

    001156667100002

  • EID of the result in the Scopus database

    2-s2.0-85187172970