Latest cl news

Path-Based Spectral Clustering: Guarantees, Robustness to Outliers, and Fast Algorithms

By
Published On :: 2020

We consider the problem of clustering with the longest-leg path distance (LLPD) metric, which is informative for elongated and irregularly shaped clusters. We prove finite-sample guarantees on the performance of clustering with respect to this metric when random samples are drawn from multiple intrinsically low-dimensional clusters in high-dimensional space, in the presence of a large number of high-dimensional outliers. By combining these results with spectral clustering with respect to LLPD, we provide conditions under which the Laplacian eigengap statistic correctly determines the number of clusters for a large class of data sets, and prove guarantees on the labeling accuracy of the proposed algorithm. Our methods are quite general and provide performance guarantees for spectral clustering with any ultrametric. We also introduce an efficient, easy to implement approximation algorithm for the LLPD based on a multiscale analysis of adjacency graphs, which allows for the runtime of LLPD spectral clustering to be quasilinear in the number of data points.

Path-Based Spectral Clustering: Guarantees, Robustness to Outliers, and Fast Algorithms

Neyman-Pearson classification: parametrics and sample size requirement

Perturbation Bounds for Procrustes, Classical Scaling, and Trilateration, with Applications to Manifold Learning

Connecting Spectral Clustering to Maximum Margins and Level Sets

Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes

A New Class of Time Dependent Latent Factor Models with Applications

Noise Accumulation in High Dimensional Classification and Total Signal Index

Latent Simplex Position Model: High Dimensional Multi-view Clustering with Uncertainty Quantification

Optimal Bipartite Network Clustering

A Convex Parametrization of a New Class of Universal Kernel Functions

pyts: A Python Package for Time Series Classification

High-Dimensional Inference for Cluster-Based Graphical Models

Union of Low-Rank Tensor Spaces: Clustering and Completion

(1 + epsilon)-class Classification: an Anomaly Detection Method for Highly Imbalanced or Incomplete Data Sets

A Bayesian sparse finite mixture model for clustering data from a heterogeneous population

Reclaiming indigenous governance : reflections and insights from Australia, Canada, New Zealand, and the United States

Variable selection methods for model-based clustering

Adaptive clinical trial designs for phase I cancer studies

Data confidentiality: A review of methods for statistical disclosure limitation and methods for assessing privacy

Finite mixture models and model-based clustering

Unsupervised Pre-trained Models from Healthy ADLs Improve Parkinson's Disease Classification of Gait Patterns. (arXiv:2005.02589v2 [cs.LG] UPDATED)

Mnemonics Training: Multi-Class Incremental Learning without Forgetting. (arXiv:2002.10211v3 [cs.CV] UPDATED)

Statistical aspects of nuclear mass models. (arXiv:2002.04151v3 [nucl-th] UPDATED)

Cyclic Boosting -- an explainable supervised machine learning algorithm. (arXiv:2002.03425v2 [cs.LG] UPDATED)

On the impact of selected modern deep-learning techniques to the performance and celerity of classification models in an experimental high-energy physics use case. (arXiv:2002.01427v3 [physics.data-an] UPDATED)

Margin-Based Generalization Lower Bounds for Boosted Classifiers. (arXiv:1909.12518v4 [cs.LG] UPDATED)

Deep Learning on Point Clouds for False Positive Reduction at Nodule Detection in Chest CT Scans. (arXiv:2005.03654v1 [eess.IV])

Local Cascade Ensemble for Multivariate Data Classification. (arXiv:2005.03645v1 [cs.LG])

Know Your Clients' behaviours: a cluster analysis of financial transactions. (arXiv:2005.03625v1 [econ.EM])

Predictive Modeling of ICU Healthcare-Associated Infections from Imbalanced Data. Using Ensembles and a Clustering-Based Undersampling Approach. (arXiv:2005.03582v1 [cs.LG])

Reference and Document Aware Semantic Evaluation Methods for Korean Language Summarization. (arXiv:2005.03510v1 [cs.CL])

Transfer Learning for sEMG-based Hand Gesture Classification using Deep Learning in a Master-Slave Architecture. (arXiv:2005.03460v1 [eess.SP])

Distributional Robustness of K-class Estimators and the PULSE. (arXiv:2005.03353v1 [econ.EM])

Training and Classification using a Restricted Boltzmann Machine on the D-Wave 2000Q. (arXiv:2005.03247v1 [cs.LG])

Classification of pediatric pneumonia using chest X-rays by functional regression. (arXiv:2005.03243v1 [stat.AP])

Fair Algorithms for Hierarchical Agglomerative Clustering. (arXiv:2005.03197v1 [cs.LG])

Close encounters: a manuscripts workshop

Wintrobe's atlas of clinical hematology

Temporomandibular disorders : a translational approach from basic science to clinical applicability

Plastic waste and recycling : environmental impact, societal issues, prevention, and solutions

Pathogenesis of periodontal diseases : biological concepts for clinicians

Ocular therapeutics handbook : a clinical manual

Microbial cyclic di-nucleotide signaling

Mayo Clinic strategies to reduce burnout : 12 actions to create the ideal workplace

Maxillofacial cone beam computed tomography : principles, techniques and clinical applications

Management of fractured endodontic instruments : a clinical guide

Machine learning in aquaculture : hunger classification of Lates calcarifer

Low-dose radiation effects on animals and ecosystems : long-term study on the Fukushima Nuclear Accident

Handbook of immunosenescence : basic understanding and clinical implications

Genomic designing of climate-smart vegetable crops

The Finish Line: Cleaning EIFS

Cloaked in Green?

New Gadget Analyzes Everything Including Building Industry

Climate Change

Exoskeleton in the Job Site Closet

Only 12 per cent of leading charities publicly recognise a trade union, analysis suggests

Companies' 'Green' Efforts Include Products’ Material Content

Viridian Introduces Engineered Reclaimed Hardwood Line

Reclaimé Collection by Quick-Step includes new White Washed Oak look

Good Morning, News: City Council to Vote on Clean & Safe Contract, Vision Zero Gets an Audit, and Trump Taps Elon Musk to Lead DOGE (Do You Even Want to Know?)

Portland’s Ranked Choice Voting Was a Success (Despite What the Oregonian Claims)

Who's powering nuclear energy's comeback?

What Trump's win means for electric vehicle manufacturers

Why working-class voters have been shifting toward the Republican Party

Exquisite bird fossil provides clues to the evolution of avian brains

Subscribe To Our Newsletter