Statistical Machine Learning for Complex Data Sets

Statistical Machine Learning for Complex Data Sets
Author :
Publisher :
Total Pages : 183
Release :
ISBN-10 : OCLC:1101176634
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Statistical Machine Learning for Complex Data Sets by : Xiaowu Dai

Download or read book Statistical Machine Learning for Complex Data Sets written by Xiaowu Dai and published by . This book was released on 2019 with total page 183 pages. Available in PDF, EPUB and Kindle. Book excerpt: This thesis is focused on developing theory and computational methods for a set of problems involving complex data. Chapter 2 studies multivariate nonparametric predictions with gradient information. Gradients can be easily estimated in stochastic simulations and computer experiments. We propose a unified framework to incorporate the noisy and correlated gradients into predictions. We show theoretically, through minimax optimal rates of convergence, that incorporating gradients tends to significantly improve predictions with deterministic or random designs. Chapters 3 proposes high-dimensional smoothing splines with applications to Alzheimer's disease (AD) prediction. While traditional prediction based on structural MRI uses imaging acquired at a single time point, a longitudinal study is more sensitive in detecting early pathological changes of the AD. Our novel method can be applied to extract features from heterogeneous and longitudinal MRI for the AD prediction, outperforming existing methods. Chapters 4 introduces a novel class of variable selection penalties called TWIN, which provides sensible data-adaptive penalization. Under a linear sparsity regime, we show that TWIN penalties have a high probability of selecting correct models and result in minimax optimal estimators. We demonstrate in challenging and realistic simulation settings with high correlations between active and inactive variables that TWIN has high power in variable selection while controlling the number of false discoveries, outperforming standard penalties. Chapters 5 investigates generalizations of mini-batch SGD in deep neural networks. We theoretically justify a hypothesis that large-batch SGD tends to converge to sharp minimizers by providing new properties of SGD. In particular, we give an explicit escaping time of SGD from a local minimum in the finite-time regime and prove that SGD tends to converge to flatter minima in the asymptotic regime (although may take exponential time to converge) regardless of the batch size. Chapter 6 provides another look at statistical calibration problems in computer models. This viewpoint is inspired by two overarching practical considerations: (i) Many computer models are inadequate for perfectly modeling physical systems; (ii) Only a finite number of data are available from physical experiments to calibrate related computer models. We provide a non-asymptotic theory and derive a novel prediction-oriented calibration method.


Statistical Machine Learning for Complex Data Sets Related Books

Statistical Learning of Complex Data
Language: en
Pages: 201
Authors: Francesca Greselin
Categories: Mathematics
Type: BOOK - Published: 2019-09-06 - Publisher: Springer Nature

GET EBOOK

This book of peer-reviewed contributions presents the latest findings in classification, statistical learning, data analysis and related areas, including superv
An Introduction to Statistical Learning
Language: en
Pages: 617
Authors: Gareth James
Categories: Mathematics
Type: BOOK - Published: 2023-08-01 - Publisher: Springer Nature

GET EBOOK

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast
Statistical Machine Learning for Complex Data Sets
Language: en
Pages: 183
Authors: Xiaowu Dai
Categories:
Type: BOOK - Published: 2019 - Publisher:

GET EBOOK

This thesis is focused on developing theory and computational methods for a set of problems involving complex data. Chapter 2 studies multivariate nonparametric
Mastering Machine Learning with R
Language: en
Pages: 400
Authors: Cory Lesmeister
Categories: Computers
Type: BOOK - Published: 2015-10-28 - Publisher: Packt Publishing Ltd

GET EBOOK

Master machine learning techniques with R to deliver insights for complex projects About This Book Get to grips with the application of Machine Learning methods
An Introduction to Statistical Learning
Language: en
Pages: 607
Authors: Gareth James
Categories: Mathematics
Type: BOOK - Published: 2021-07-29 - Publisher: Springer Nature

GET EBOOK

An Introduction to Statistical Learning provides an accessible overview of the field of statistical learning, an essential toolset for making sense of the vast