All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Data normalization and scaling: Consequences for the analysis in omics sciences

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989592%3A15310%2F18%3A73589128" target="_blank" >RIV/61989592:15310/18:73589128 - isvavai.cz</a>

  • Result on the web

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Data normalization and scaling: Consequences for the analysis in omics sciences

  • Original language description

    The main task in the analysis of omics data is to understand biological information in the data. From a statistical point of view, classification analysis is one of the goals. If the data are consisting of groups of, e.g., controls and patients, the accurate prediction of new samples is desirable. To understand the processes in the human body or in other organisms, the interpretation of the model is necessary. In the two-group setting, the information about important features is one of the main tasks in omics disciplines. In metabolomics, the problem is called biomarker identification, while in genetics this is called fold changes problem, where it is examined for a feature, how many times the average concentration in one group is higher/lower than for the other group. In statistics, this is often referred to as the feature selection problem. Section 4 analyzes the impact of pretreatment methods on publicly available real-world data sets in terms of classification and feature selection analysis. As an example of omics disciplines, the data sets are originating from the metabolomics field. Section 5 discusses and summarizes the main findings and provides some overall recommendations.

  • Czech name

  • Czech description

Classification

  • Type

    C - Chapter in a specialist book

  • CEP classification

  • OECD FORD branch

    10103 - Statistics and probability

Result continuities

  • Project

    <a href="/en/project/GF15-34613L" target="_blank" >GF15-34613L: Statistics in metabolomics for biomarker research in medicine</a><br>

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2018

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Book/collection name

    Data analysis for omics sciences: Methods and applications

  • ISBN

    978-0-444-64044-4

  • Number of pages of the result

    32

  • Pages from-to

    165-196

  • Number of pages of the book

    706

  • Publisher name

    Elsevier

  • Place of publication

    Amsterdam

  • UT code for WoS chapter