Latest mod news

mod

Statistical inference for model parameters in stochastic gradient descent

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Xi Chen, Jason D. Lee, Xin T. Tong, Yichen Zhang.

Source: The Annals of Statistics, Volume 48, Number 1, 251--273.

Abstract:
The stochastic gradient descent (SGD) algorithm has been widely used in statistical estimation for large-scale data due to its computational and memory efficiency. While most existing works focus on the convergence of the objective function or the error of the obtained solution, we investigate the problem of statistical inference of true model parameters based on SGD when the population loss function is strongly convex and satisfies certain smoothness conditions. Our main contributions are twofold. First, in the fixed dimension setup, we propose two consistent estimators of the asymptotic covariance of the average iterate from SGD: (1) a plug-in estimator, and (2) a batch-means estimator, which is computationally more efficient and only uses the iterates from SGD. Both proposed estimators allow us to construct asymptotically exact confidence intervals and hypothesis tests. Second, for high-dimensional linear regression, using a variant of the SGD algorithm, we construct a debiased estimator of each regression coefficient that is asymptotically normal. This gives a one-pass algorithm for computing both the sparse regression coefficients and confidence intervals, which is computationally attractive and applicable to online data.

Statistical inference for model parameters in stochastic gradient descent

Optimal rates for community estimation in the weighted stochastic block model

Model assisted variable clustering: Minimax-optimal recovery and algorithms

Minimax posterior convergence rates and model selection consistency in high-dimensional DAG models based on sparse Cholesky factors

On optimal designs for nonregular models

Statistical inference for autoregressive models under heteroscedasticity of unknown form

Adaptive estimation of the rank of the coefficient matrix in high-dimensional multivariate response regression models

Additive models with trend filtering

Inference for the mode of a log-concave density

Projected spline estimation of the nonparametric function in high-dimensional partially linear models for massive data

Eigenvalue distributions of variance components estimators in high-dimensional random effects models

Exact lower bounds for the agnostic probably-approximately-correct (PAC) machine learning model

An operator theoretic approach to nonparametric mixture models

Linear hypothesis testing for high dimensional generalized linear models

Property testing in high-dimensional Ising models

Dynamic network models and graphon estimation

Correction: Sensitivity analysis for an unobserved moderator in RCT-to-target-population generalization of treatment effects

Bayesian mixed effects models for zero-inflated compositions in microbiome data analysis

A hierarchical dependent Dirichlet process prior for modelling bird migration patterns in the UK

Estimating causal effects in studies of human brain function: New models, methods and estimands

Regression for copula-linked compound distributions with applications in modeling aggregate insurance claims

Modeling wildfire ignition origins in southern California using linear network point processes

Optimal asset allocation with multivariate Bayesian dynamic linear models

Feature selection for generalized varying coefficient mixed-effect models with application to obesity GWAS

Bayesian factor models for probabilistic cause of death assessment with verbal autopsies

A hierarchical Bayesian model for predicting ecological interactions using scaled evolutionary relationships

Modifying the Chi-square and the CMH test for population genetic inference: Adapting to overdispersion

Modeling microbial abundances and dysbiosis with beta-binomial regression

SHOPPER: A probabilistic model of consumer choice with substitutes and complements

Hierarchical infinite factor models for improving the prediction of surgical complications for geriatric patients

Objective Bayes model selection of Gaussian interventional essential graphs for the identification of signaling pathways

Fitting a deeply nested hierarchical model to a large book review dataset using a moment-based estimator

Spatial modeling of trends in crime over time in Philadelphia

Microsimulation model calibration using incremental mixture approximate Bayesian computation

Prediction of small area quantiles for the conservation effects assessment project using a mixed effects quantile regression model

Joint model of accelerated failure time and mechanistic nonlinear model for censored covariates, with application in HIV/AIDS

Fire seasonality identification with multimodality tests

Estimating abundance from multiple sampling capture-recapture data via a multi-state multi-period stopover model

A semiparametric modeling approach using Bayesian Additive Regression Trees with an application to evaluate heterogeneous treatment effects

Bayesian modeling of the structural connectome for studying Alzheimer’s disease

Incorporating conditional dependence in latent class models for probabilistic record linkage: Does it matter?

A hierarchical Bayesian model for single-cell clustering using RNA-sequencing data

A Bayesian mark interaction model for analysis of tumor pathology images

Sequential decision model for inference and prediction on nonuniform hypergraphs with application to knot matching from computational forestry

RCRnorm: An integrated system of random-coefficient hierarchical regression models for normalizing NanoString nCounter data

Modeling seasonality and serial dependence of electricity price curves with warping functional autoregressive dynamics

Network modelling of topological domains using Hi-C data

A hidden Markov model approach to characterizing the photo-switching behavior of fluorophores

Imputation and post-selection inference in models with missing data: An application to colorectal cancer surveillance guidelines

Introduction to papers on the modeling and analysis of network data—II

Propex to Showcase Isis Modular Tile Backing at FloorTek Expo

Image modeling for biomedical organs

The challenges of debate moderating have grown along with partisan differences

Minecraft's business model is 'leave users alone' — will it be Microsoft's?

Indonesia's Indosat, GoTo launch local language AI model

Additional Accommodation Information

Leveraging Focused Ultrasound to Drive Tissue Regeneration via On-Demand Modulation of Microenvironmental Cues (November 14, 2024 12:00pm)

CJS Noon Lecture Series | Tyrannical Tigers and Endangered Cats: Why Are the Korean Scholar-Bureaucrats Always So Important in Modern Japan? (November 14, 2024 12:00pm)

OEMs See Growth in Single-Zone Ductless Market as Homeowners Expand, Remodel

New Deep Sensing Models Join General's Meter Line

Fed's Logan: Models show that Fed funds could be 'very close' to neutral

Charting Your Sales & Use Tax Business Model

OSCE Project Co-ordinator calls for joint efforts in supporting voices of moderation, promoting dialogue and restoring trust

On World Anti-Trafficking Day, OSCE calls for comprehensive, co-ordinated and victim-centred approach to combatting modern-day slavery

Reimagining the E2T2 Program for the Modern Era - Federation of American Scientists

Subscribe To Our Newsletter