Latest en news

The two-to-infinity norm and singular subspace geometry with applications to high-dimensional statistics

By projecteuclid.org
Published On :: Fri, 02 Aug 2019 22:04 EDT

Joshua Cape, Minh Tang, Carey E. Priebe.

Source: The Annals of Statistics, Volume 47, Number 5, 2405--2439.

Abstract:
The singular value matrix decomposition plays a ubiquitous role throughout statistics and related fields. Myriad applications including clustering, classification, and dimensionality reduction involve studying and exploiting the geometric structure of singular values and singular vectors. This paper provides a novel collection of technical and theoretical tools for studying the geometry of singular subspaces using the two-to-infinity norm. Motivated by preliminary deterministic Procrustes analysis, we consider a general matrix perturbation setting in which we derive a new Procrustean matrix decomposition. Together with flexible machinery developed for the two-to-infinity norm, this allows us to conduct a refined analysis of the induced perturbation geometry with respect to the underlying singular vectors even in the presence of singular value multiplicity. Our analysis yields singular vector entrywise perturbation bounds for a range of popular matrix noise models, each of which has a meaningful associated statistical inference task. In addition, we demonstrate how the two-to-infinity norm is the preferred norm in certain statistical settings. Specific applications discussed in this paper include covariance estimation, singular subspace recovery, and multiple graph inference. Both our Procrustean matrix decomposition and the technical machinery developed for the two-to-infinity norm may be of independent interest.

The two-to-infinity norm and singular subspace geometry with applications to high-dimensional statistics

On testing conditional qualitative treatment effects

Convergence complexity analysis of Albert and Chib’s algorithm for Bayesian probit regression

Convergence rates of least squares regression estimators with heavy-tailed errors

On deep learning as a remedy for the curse of dimensionality in nonparametric regression

Negative association, ordering and convergence of resampling methods

Generalized cluster trees and singular measures

Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem

componentization

endpoint

object-oriented

Correction: Sensitivity analysis for an unobserved moderator in RCT-to-target-population generalization of treatment effects

A hierarchical dependent Dirichlet process prior for modelling bird migration patterns in the UK

A comparison of principal component methods between multiple phenotype regression and multiple SNP regression in genetic association studies

Measuring human activity spaces from GPS data with density ranking and summary curves

Estimating and forecasting the smoking-attributable mortality fraction for both genders jointly in over 60 countries

Feature selection for generalized varying coefficient mixed-effect models with application to obesity GWAS

Estimating the health effects of environmental mixtures using Bayesian semiparametric regression and sparsity inducing priors

Bayesian factor models for probabilistic cause of death assessment with verbal autopsies

Modifying the Chi-square and the CMH test for population genetic inference: Adapting to overdispersion

Surface temperature monitoring in liver procurement via functional variance change-point analysis

Efficient real-time monitoring of an emerging influenza pandemic: How feasible?

Integrative survival analysis with uncertain event times in application to a suicide risk study

BART with targeted smoothing: An analysis of patient-specific stillbirth risk

SHOPPER: A probabilistic model of consumer choice with substitutes and complements

A general theory for preferential sampling in environmental networks

Hierarchical infinite factor models for improving the prediction of surgical complications for geriatric patients

Scalable high-resolution forecasting of sparse spatiotemporal events with kernel methods: A winning solution to the NIJ “Real-Time Crime Forecasting Challenge”

A simple, consistent estimator of SNP heritability from genome-wide association studies

Empirical Bayes analysis of RNA sequencing experiments with auxiliary information

Propensity score weighting for causal inference with multiple treatments

A nonparametric spatial test to identify factors that shape a microbiome

A latent discrete Markov random field approach to identifying and classifying historical forest communities based on spatial multivariate tree species counts

Objective Bayes model selection of Gaussian interventional essential graphs for the identification of signaling pathways

Fitting a deeply nested hierarchical model to a large book review dataset using a moment-based estimator

Spatial modeling of trends in crime over time in Philadelphia

Microsimulation model calibration using incremental mixture approximate Bayesian computation

Prediction of small area quantiles for the conservation effects assessment project using a mixed effects quantile regression model

Joint model of accelerated failure time and mechanistic nonlinear model for censored covariates, with application in HIV/AIDS

Fire seasonality identification with multimodality tests

Statistical inference for partially observed branching processes with application to cell lineage tracking of in vivo hematopoiesis

Robust elastic net estimators for variable selection and identification of proteomic biomarkers

Estimating the rate constant from biosensor data via an adaptive variational Bayesian approach

A semiparametric modeling approach using Bayesian Additive Regression Trees with an application to evaluate heterogeneous treatment effects

Radio-iBAG: Radiomics-based integrative Bayesian analysis of multiplatform genomic data

Approximate inference for constructing astronomical catalogs from images

Incorporating conditional dependence in latent class models for probabilistic record linkage: Does it matter?

A hierarchical Bayesian model for single-cell clustering using RNA-sequencing data

Sequential decision model for inference and prediction on nonuniform hypergraphs with application to knot matching from computational forestry

RCRnorm: An integrated system of random-coefficient hierarchical regression models for normalizing NanoString nCounter data

The finish line: Attachment of Signs

The Finish Line: Drainage Efficiency

The Finish Line: Eco-Friendliness of EIFS

The Finish Line: Adhesives vs. Mechanical Fasteners

The Finish Line: A (Faux) Monument for the Ages

Green Globes vs. LEED

Cloaked in Green?

Building Product Transparency— Be Careful What You Ask For

An Energy Label for Buildings

A Green Screw?

Benefits of the Variable Refrigerant Flow

Green Advocacy vs. Informed Consent

Green Building Mistakes

The Greenest Low Slope Roofing Solution

ANSI Green Globes 2015

Subscribe To Our Newsletter