Latest sa news

Neyman-Pearson classification: parametrics and sample size requirement

By
Published On :: 2020

The Neyman-Pearson (NP) paradigm in binary classification seeks classifiers that achieve a minimal type II error while enforcing the prioritized type I error controlled under some user-specified level $alpha$. This paradigm serves naturally in applications such as severe disease diagnosis and spam detection, where people have clear priorities among the two error types. Recently, Tong, Feng, and Li (2018) proposed a nonparametric umbrella algorithm that adapts all scoring-type classification methods (e.g., logistic regression, support vector machines, random forest) to respect the given type I error (i.e., conditional probability of classifying a class $0$ observation as class $1$ under the 0-1 coding) upper bound $alpha$ with high probability, without specific distributional assumptions on the features and the responses. Universal the umbrella algorithm is, it demands an explicit minimum sample size requirement on class $0$, which is often the more scarce class, such as in rare disease diagnosis applications. In this work, we employ the parametric linear discriminant analysis (LDA) model and propose a new parametric thresholding algorithm, which does not need the minimum sample size requirements on class $0$ observations and thus is suitable for small sample applications such as rare disease diagnosis. Leveraging both the existing nonparametric and the newly proposed parametric thresholding rules, we propose four LDA-based NP classifiers, for both low- and high-dimensional settings. On the theoretical front, we prove NP oracle inequalities for one proposed classifier, where the rate for excess type II error benefits from the explicit parametric model assumption. Furthermore, as NP classifiers involve a sample splitting step of class $0$ observations, we construct a new adaptive sample splitting scheme that can be applied universally to NP classifiers, and this adaptive strategy reduces the type II error of these classifiers. The proposed NP classifiers are implemented in the R package nproc.

Neyman-Pearson classification: parametrics and sample size requirement

On the consistency of graph-based Bayesian semi-supervised learning and the scalability of sampling algorithms

Provably robust estimation of modulo 1 samples of a smooth function with applications to phase unwrapping

Graph-Dependent Implicit Regularisation for Distributed Stochastic Subgradient Descent

Causal Discovery Toolbox: Uncovering causal relationships in Python

Learning Linear Non-Gaussian Causal Models in the Presence of Latent Variables

Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables

Greedy Attack and Gumbel Attack: Generating Adversarial Examples for Discrete Data

A Convex Parametrization of a New Class of Universal Kernel Functions

Ancestral Gumbel-Top-k Sampling for Sampling Without Replacement

Learning Causal Networks via Additive Faithfulness

Generalized Optimal Matching Methods for Causal Inference

Multi-Player Bandits: The Adversarial Case

Access thousands of newspapers and magazines with PressReader

Have your say on the Highway 404 Employment Corridor Secondary Plan

Oriented first passage percolation in the mean field limit

A message from the editorial board

A message from the editorial board

The limiting distribution of the Gibbs sampler for the intrinsic conditional autoregressive model

Keeping the balance—Bridge sampling for marginal likelihood estimation in finite mixture, mixture of experts and Markov mixture models

A rank-based Cramér–von-Mises-type test for two samples

A temporal perspective on the rate of convergence in first-passage percolation under a moment condition

Necessary and sufficient conditions for the convergence of the consistent maximal displacement of the branching random walk

A new log-linear bimodal Birnbaum–Saunders regression model with application to survival data

Failure rate of Birnbaum–Saunders distributions: Shape, change-point, estimation and robustness

Novel bodies : disability and sexuality in eighteenth-century British literature

Heavy metalloid music : the story of Simply Saucer

Can &#36;p&#36;-values be meaningfully interpreted without random sampling?

Estimating the size of a hidden finite set: Large-sample behavior of estimators

A survey of bootstrap methods in finite population sampling

Generating Thermal Image Data Samples using 3D Facial Modelling Techniques and Deep Learning Methodologies. (arXiv:2005.01923v2 [cs.CV] UPDATED)

Deep transfer learning for improving single-EEG arousal detection. (arXiv:2004.05111v2 [cs.CV] UPDATED)

Sampling random graph homomorphisms and applications to network data analysis. (arXiv:1910.09483v2 [math.PR] UPDATED)

Convergence rates for optimised adaptive importance samplers. (arXiv:1903.12044v4 [stat.CO] UPDATED)

Visualisation and knowledge discovery from interpretable models. (arXiv:2005.03632v1 [cs.LG])

Know Your Clients' behaviours: a cluster analysis of financial transactions. (arXiv:2005.03625v1 [econ.EM])

A simulation study of disaggregation regression for spatial disease mapping. (arXiv:2005.03604v1 [stat.AP])

Predictive Modeling of ICU Healthcare-Associated Infections from Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling Approach. (arXiv:2005.03582v1 [cs.LG])

Multi-Label Sampling based on Local Label Imbalance. (arXiv:2005.03240v1 [cs.LG])

Deep Learning Framework for Detecting Ground Deformation in the Built Environment using Satellite InSAR data. (arXiv:2005.03221v1 [cs.CV])

Staying safe in the Library

Sowing legume seeds, reaping cash : a renaissance within communities in Sub-Saharan Africa

Salt, fat and sugar reduction : sensory approaches for nutritional reformulation of foods and beverages

Saffron : science, technology and health

Requirements engineering : 26th International Working Conference, REFSQ 2020, Pisa, Italy, March 24-27, 2020, Proceedings

Prevention of chronic diseases and age-related disability

Passive and active measurement : 21st International Conference, PAM 2020, Eugene, Oregon, USA, March 30-31, 2020, Proceedings

Oral mucosa in health and disease : a concise handbook

Ketamine : from abused drug to rapid-acting antidepressant

Fresh-cut fruits and vegetables : technologies and mechanisms for safety control

Will Synthetic Biology Save the World?

Will LEED v4 Ever Be Usable?

Cost-Effective, Energy Efficient Concrete Sandwich Panels

NCS Trust ‘sad and disappointed’ at government plans to shut it down

Incident involving highwall collapse spurs MSHA safety alert

Urban Roots Fruit+ and Cantina-Style Salsa

CBC's Salto brand unveils Unica

WHATSAPP: +1(443) 720-5561 Buy fake usd/aud/cad/JPY/CNY/GBP/euros/pounds

Lakeview Farms to Acquire noosa from Campbell Soup Company

Do Cash Transfers Save Lives?, Nov. 19

Good Morning, News: City Council to Vote on Clean & Safe Contract, Vision Zero Gets an Audit, and Trump Taps Elon Musk to Lead DOGE (Do You Even Want to Know?)

Even a heroic detective like 'Cross' can't save this Prime Video adaptation

Basic Black: An urban agenda for Massachusetts

Basic Black: Urban Renaissance

Pastor Greg Laurie says God placed Trump in power 'for such a time as this'

Subscribe To Our Newsletter

Can $p$-values be meaningfully interpreted without random sampling?