Latest en news

Fast multivariate empirical cumulative distribution function with connection to kernel density estimation. (arXiv:2005.03246v1 [cs.DS])

By arxiv.org
Published On ::

This paper revisits the problem of computing empirical cumulative distribution functions (ECDF) efficiently on large, multivariate datasets. Computing an ECDF at one evaluation point requires $mathcal{O}(N)$ operations on a dataset composed of $N$ data points. Therefore, a direct evaluation of ECDFs at $N$ evaluation points requires a quadratic $mathcal{O}(N^2)$ operations, which is prohibitive for large-scale problems. Two fast and exact methods are proposed and compared. The first one is based on fast summation in lexicographical order, with a $mathcal{O}(N{log}N)$ complexity and requires the evaluation points to lie on a regular grid. The second one is based on the divide-and-conquer principle, with a $mathcal{O}(Nlog(N)^{(d-1){vee}1})$ complexity and requires the evaluation points to coincide with the input points. The two fast algorithms are described and detailed in the general $d$-dimensional case, and numerical experiments validate their speed and accuracy. Secondly, the paper establishes a direct connection between cumulative distribution functions and kernel density estimation (KDE) for a large class of kernels. This connection paves the way for fast exact algorithms for multivariate kernel density estimation and kernel regression. Numerical tests with the Laplacian kernel validate the speed and accuracy of the proposed algorithms. A broad range of large-scale multivariate density estimation, cumulative distribution estimation, survival function estimation and regression problems can benefit from the proposed numerical methods.

Fast multivariate empirical cumulative distribution function with connection to kernel density estimation. (arXiv:2005.03246v1 [cs.DS])

Subdomain Adaptation with Manifolds Discrepancy Alignment. (arXiv:2005.03229v1 [cs.LG])

Detecting Latent Communities in Network Formation Models. (arXiv:2005.03226v1 [econ.EM])

Deep Learning Framework for Detecting Ground Deformation in the Built Environment using Satellite InSAR data. (arXiv:2005.03221v1 [cs.CV])

Efficient Characterization of Dynamic Response Variation Using Multi-Fidelity Data Fusion through Composite Neural Network. (arXiv:2005.03213v1 [stat.ML])

Convergence and inference for mixed Poisson random sums. (arXiv:2005.03187v1 [math.PR])

MAZE: Data-Free Model Stealing Attack Using Zeroth-Order Gradient Estimation. (arXiv:2005.03161v1 [stat.ML])

On the Optimality of Randomization in Experimental Design: How to Randomize for Minimax Variance and Design-Based Inference. (arXiv:2005.03151v1 [stat.ME])

Towards Frequency-Based Explanation for Robust CNN. (arXiv:2005.03141v1 [cs.LG])

Joint Multi-Dimensional Model for Global and Time-Series Annotations. (arXiv:2005.03117v1 [cs.LG])

Entries open for State Library’s $20,000 short film competition

Entries now open for the 2020 National Biography Award

Entries open for $40,000 award for female scriptwriters

Turn your ‘iso’ moments into history

Add your entry to the great pandemic diary of 2020

mgm: Estimating Time-Varying Mixed Graphical Models in High-Dimensional Data

lslx: Semi-Confirmatory Structural Equation Modeling via Penalized Likelihood

Object-Oriented Software for Functional Data

Anxiety and compassion: emotions and the surgical encounter in early 19th-century Britain

Plague in Italy and Europe during the 17th century

Broadcasting Health and Disease conference

Smell and medical efficacy in 18th-century England

Close encounters: a manuscripts workshop

Wyllie's treatment of epilepsy : principles and practice

Wood microbiology : decay and its prevention

Wine science : principles and applications

Water hyacinth : a potential lignocellulosic biomass for bioethanol

Urban landscape entomology

Tumor microenvironments in organs : from the brain to the skin.

Tumor microenvironment : hematopoietic cells.

Tumor microenvironment : signaling pathways.

Tumor microenvironment : the main driver of metabolic adaptation

Trusted computing and information security : 13th Chinese conference, CTCIS 2019, Shanghai, China, October 24-27, 2019

Trends in biomedical research

Treatment of skin diseases : a practical guide

Translational neuroscience of speech and language disorders

Transgender and gender nonconforming health and aging

Tissue engineering : principles, protocols, and practical exercises

The unedited : a novel about genome and identity

The tobacco plant genome

The science of grapevines

The root canal anatomy in permanent dentition

The mungbean genome

The lupin genome

The interaction of food industry and environment

The evolution of feathers : from their origin to the present

The duckweed genomes

The citrus genome

The bitter gourd genome

The Scientific basis of oral health education

The finish line: Attachment of Signs

The Finish Line: Drainage Efficiency

The Finish Line: Eco-Friendliness of EIFS

The Finish Line: Adhesives vs. Mechanical Fasteners

The Finish Line: A (Faux) Monument for the Ages

Green Globes vs. LEED

Cloaked in Green?

Building Product Transparency— Be Careful What You Ask For

An Energy Label for Buildings

A Green Screw?

Benefits of the Variable Refrigerant Flow

Green Advocacy vs. Informed Consent

Green Building Mistakes

The Greenest Low Slope Roofing Solution

ANSI Green Globes 2015

Subscribe To Our Newsletter