Normalization of Neural Networks using Analytic Variance Propagation

Popis výsledku

—

Klíčová slova

Neural Networks Normalization Analytic Variance Propagation

Identifikátory výsledku

Kód výsledku v IS VaVaI
RIV/68407700:21230/18:00326237 - isvavai.cz
Výsledek na webu
https://arxiv.org/abs/1803.10560
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Normalization of Neural Networks using Analytic Variance Propagation
Popis výsledku v původním jazyce
We address the problem of estimating statistics of hidden units in a neural network using a method of analytic moment propagation. These statistics are useful for approximate whitening of the inputs in front of saturating non-linearities such as a sigmoid function. This is important for initialization of training and for reducing the accumulated scale and bias dependencies (compensating covariate shift), which presumably eases the learning. In batch normalization, which is currently a very widely applied technique, sample estimates of statistics of hidden units over a batch are used. The proposed estimation uses an analytic propagation of mean and variance of the training set through the network. The result depends on the network structure and its current weights but not on the specific batch input. The estimates are suitable for initialization and normalization, efficient to compute and independent of the batch size. The experimental verification well supports these claims. However, the method does not share the generalization properties of BN, to which our experiments give some additional insight.
Název v anglickém jazyce
Normalization of Neural Networks using Analytic Variance Propagation
Popis výsledku anglicky
We address the problem of estimating statistics of hidden units in a neural network using a method of analytic moment propagation. These statistics are useful for approximate whitening of the inputs in front of saturating non-linearities such as a sigmoid function. This is important for initialization of training and for reducing the accumulated scale and bias dependencies (compensating covariate shift), which presumably eases the learning. In batch normalization, which is currently a very widely applied technique, sample estimates of statistics of hidden units over a batch are used. The proposed estimation uses an analytic propagation of mean and variance of the training set through the network. The result depends on the network structure and its current weights but not on the specific batch input. The estimates are suitable for initialization and normalization, efficient to compute and independent of the batch size. The experimental verification well supports these claims. However, the method does not share the generalization properties of BN, to which our experiments give some additional insight.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
Výsledek vznikl pri realizaci vícero projektů. Více informací v záložce Projekty.
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Ostatní

Rok uplatnění
2018
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
Proceedings of the 23rd Computer Vision Winter Workshop
ISBN
978-80-270-3395-9
ISSN
—
e-ISSN
—
Počet stran výsledku
9
Strana od-do
45-53
Název nakladatele
Czech Society for Cybernetics and Informatics
Místo vydání
Praha
Místo konání akce
Český Krumlov
Datum konání akce
5. 2. 2018
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Druh výsledku

D - Stať ve sborníku

OECD FORD

Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Rok uplatnění

2018

Podobné výsledky(10)

Feed-forward Uncertainty Propagation in Belief and Neural Networks Comparing Normalization Methods for Limited Batch Size Segmentation Neural Networks Classiﬁcation of Hand Movement in EEG using ERD/ERS and Multilayer Perceptron

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Sdílet výsledky vyhledávání

Normalization of Neural Networks using Analytic Variance Propagation

Popis výsledku

Klíčová slova

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Sdílet výsledky vyhledávání

Popis výsledku

Klíčová slova

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Základní informace

Podobné výsledky(10)