Latest d news

Isotonic regression in general dimensions

By projecteuclid.org
Published On :: Fri, 02 Aug 2019 22:04 EDT

Qiyang Han, Tengyao Wang, Sabyasachi Chatterjee, Richard J. Samworth.

Source: The Annals of Statistics, Volume 47, Number 5, 2440--2471.

Abstract:
We study the least squares regression function estimator over the class of real-valued functions on $[0,1]^{d}$ that are increasing in each coordinate. For uniformly bounded signals and with a fixed, cubic lattice design, we establish that the estimator achieves the minimax rate of order $n^{-min{2/(d+2),1/d}}$ in the empirical $L_{2}$ loss, up to polylogarithmic factors. Further, we prove a sharp oracle inequality, which reveals in particular that when the true regression function is piecewise constant on $k$ hyperrectangles, the least squares estimator enjoys a faster, adaptive rate of convergence of $(k/n)^{min(1,2/d)}$, again up to polylogarithmic factors. Previous results are confined to the case $dleq2$. Finally, we establish corresponding bounds (which are new even in the case $d=2$) in the more challenging random design setting. There are two surprising features of these results: first, they demonstrate that it is possible for a global empirical risk minimisation procedure to be rate optimal up to polylogarithmic factors even when the corresponding entropy integral for the function class diverges rapidly; second, they indicate that the adaptation rate for shape-constrained estimators can be strictly worse than the parametric rate.

Isotonic regression in general dimensions

The two-to-infinity norm and singular subspace geometry with applications to high-dimensional statistics

Cross validation for locally stationary processes

Dynamic network models and graphon estimation

On testing conditional qualitative treatment effects

Convergence complexity analysis of Albert and Chib’s algorithm for Bayesian probit regression

Convergence rates of least squares regression estimators with heavy-tailed errors

On deep learning as a remedy for the curse of dimensionality in nonparametric regression

Negative association, ordering and convergence of resampling methods

Spectral method and regularized MLE are both optimal for top-&#36;K&#36; ranking

Generalized cluster trees and singular measures

Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem

grid computing

endpoint

middleware

metadata

object-oriented

data warehouse

Correction: Sensitivity analysis for an unobserved moderator in RCT-to-target-population generalization of treatment effects

Bayesian mixed effects models for zero-inflated compositions in microbiome data analysis

A hierarchical dependent Dirichlet process prior for modelling bird migration patterns in the UK

Estimating causal effects in studies of human brain function: New models, methods and estimands

A comparison of principal component methods between multiple phenotype regression and multiple SNP regression in genetic association studies

Measuring human activity spaces from GPS data with density ranking and summary curves

Estimating and forecasting the smoking-attributable mortality fraction for both genders jointly in over 60 countries

Regression for copula-linked compound distributions with applications in modeling aggregate insurance claims

Modeling wildfire ignition origins in southern California using linear network point processes

Optimal asset allocation with multivariate Bayesian dynamic linear models

Feature selection for generalized varying coefficient mixed-effect models with application to obesity GWAS

Estimating the health effects of environmental mixtures using Bayesian semiparametric regression and sparsity inducing priors

Bayesian factor models for probabilistic cause of death assessment with verbal autopsies

A hierarchical Bayesian model for predicting ecological interactions using scaled evolutionary relationships

Modifying the Chi-square and the CMH test for population genetic inference: Adapting to overdispersion

TFisher: A powerful truncation and weighting procedure for combining &#36;p&#36;-values

Assessing wage status transition and stagnation using quantile transition regression

A statistical analysis of noisy crowdsourced weather data

Modeling microbial abundances and dysbiosis with beta-binomial regression

Efficient real-time monitoring of an emerging influenza pandemic: How feasible?

Integrative survival analysis with uncertain event times in application to a suicide risk study

BART with targeted smoothing: An analysis of patient-specific stillbirth risk

SHOPPER: A probabilistic model of consumer choice with substitutes and complements

Hierarchical infinite factor models for improving the prediction of surgical complications for geriatric patients

Bayesian indicator variable selection to incorporate hierarchical overlapping group structure in multi-omics applications

On Bayesian new edge prediction and anomaly detection in computer networks

Scalable high-resolution forecasting of sparse spatiotemporal events with kernel methods: A winning solution to the NIJ “Real-Time Crime Forecasting Challenge”

A hierarchical curve-based approach to the analysis of manifold data

A simple, consistent estimator of SNP heritability from genome-wide association studies

New formulation of the logistic-Gaussian process to analyze trajectory tracking data

Outline analyses of the called strike zone in Major League Baseball

Predicting paleoclimate from compositional data using multivariate Gaussian process inverse prediction

The Finish Line: Cast Stone and EIFS

The Finish Line: A Case Study: What is Causing This?

The Finish Line: Backwrapping vs. Edgewrapping

The Finish Line: Drainage Efficiency

The Finish Line: Earthquakes and EIFS

The Finish Line: Eco-Friendliness of EIFS

The Finish Line: Foam Shapes Revisited

The Finish Line: Adhesives vs. Mechanical Fasteners

The Finish Line: Keep it Dry

The Finish Line: Keep it Dry Part 2

The Finish Line: Design Features

The Finish Line: Building Walls in the Land Down Under

Green Globes vs. LEED

Will Synthetic Biology Save the World?

Cloaked in Green?

Subscribe To Our Newsletter

Spectral method and regularized MLE are both optimal for top-$K$ ranking

TFisher: A powerful truncation and weighting procedure for combining $p$-values