Latest ici news

ici

Methadone substitution therapy : policies and practices / edited by Hamid Ghodse, Carmel Clancy, Adenekan Oyefeso.

By search.wellcomelibrary.org
Published On ::

London : European Collaborating Centres in Addiction Studies, 1998.

Full Article

ici

Recovery of simultaneous low rank and two-way sparse coefficient matrices, a nonconvex approach

By projecteuclid.org
Published On :: Tue, 05 May 2020 22:00 EDT

Ming Yu, Varun Gupta, Mladen Kolar.

Source: Electronic Journal of Statistics, Volume 14, Number 1, 413--457.

Abstract:
We study the problem of recovery of matrices that are simultaneously low rank and row and/or column sparse. Such matrices appear in recent applications in cognitive neuroscience, imaging, computer vision, macroeconomics, and genetics. We propose a GDT (Gradient Descent with hard Thresholding) algorithm to efficiently recover matrices with such structure, by minimizing a bi-convex function over a nonconvex set of constraints. We show linear convergence of the iterates obtained by GDT to a region within statistical error of an optimal solution. As an application of our method, we consider multi-task learning problems and show that the statistical error rate obtained by GDT is near optimal compared to minimax rate. Experiments demonstrate competitive performance and much faster running speed compared to existing methods, on both simulations and real data sets.

Full Article

ici

Estimation of linear projections of non-sparse coefficients in high-dimensional regression

By projecteuclid.org
Published On :: Mon, 27 Apr 2020 22:02 EDT

David Azriel, Armin Schwartzman.

Source: Electronic Journal of Statistics, Volume 14, Number 1, 174--206.

Abstract:
In this work we study estimation of signals when the number of parameters is much larger than the number of observations. A large body of literature assumes for these kind of problems a sparse structure where most of the parameters are zero or close to zero. When this assumption does not hold, one can focus on low-dimensional functions of the parameter vector. In this work we study one-dimensional linear projections. Specifically, in the context of high-dimensional linear regression, the parameter of interest is ${oldsymbol{eta}}$ and we study estimation of $mathbf{a}^{T}{oldsymbol{eta}}$. We show that $mathbf{a}^{T}hat{oldsymbol{eta}}$, where $hat{oldsymbol{eta}}$ is the least squares estimator, using pseudo-inverse when $p>n$, is minimax and admissible. Thus, for linear projections no regularization or shrinkage is needed. This estimator is easy to analyze and confidence intervals can be constructed. We study a high-dimensional dataset from brain imaging where it is shown that the signal is weak, non-sparse and significantly different from zero.

Full Article

ici

Efficient estimation in expectile regression using envelope models

By projecteuclid.org
Published On :: Thu, 23 Apr 2020 22:01 EDT

Tuo Chen, Zhihua Su, Yi Yang, Shanshan Ding.

Source: Electronic Journal of Statistics, Volume 14, Number 1, 143--173.

Abstract:
As a generalization of the classical linear regression, expectile regression (ER) explores the relationship between the conditional expectile of a response variable and a set of predictor variables. ER with respect to different expectile levels can provide a comprehensive picture of the conditional distribution of the response variable given the predictors. We adopt an efficient estimation method called the envelope model ([8]) in ER, and construct a novel envelope expectile regression (EER) model. Estimation of the EER parameters can be performed using the generalized method of moments (GMM). We establish the consistency and derive the asymptotic distribution of the EER estimators. In addition, we show that the EER estimators are asymptotically more efficient than the ER estimators. Numerical experiments and real data examples are provided to demonstrate the efficiency gains attained by EER compared to ER, and the efficiency gains can further lead to improvements in prediction.

Full Article

ici

Online Sufficient Dimension Reduction Through Sliced Inverse Regression

By
Published On :: 2020

Sliced inverse regression is an effective paradigm that achieves the goal of dimension reduction through replacing high dimensional covariates with a small number of linear combinations. It does not impose parametric assumptions on the dependence structure. More importantly, such a reduction of dimension is sufficient in that it does not cause loss of information. In this paper, we adapt the stationary sliced inverse regression to cope with the rapidly changing environments. We propose to implement sliced inverse regression in an online fashion. This online learner consists of two steps. In the first step we construct an online estimate for the kernel matrix; in the second step we propose two online algorithms, one is motivated by the perturbation method and the other is originated from the gradient descent optimization, to perform online singular value decomposition. The theoretical properties of this online learner are established. We demonstrate the numerical performance of this online learner through simulations and real world applications. All numerical studies confirm that this online learner performs as well as the batch learner.

Full Article

ici

The Maximum Separation Subspace in Sufficient Dimension Reduction with Categorical Response

By
Published On :: 2020

Sufficient dimension reduction (SDR) is a very useful concept for exploratory analysis and data visualization in regression, especially when the number of covariates is large. Many SDR methods have been proposed for regression with a continuous response, where the central subspace (CS) is the target of estimation. Various conditions, such as the linearity condition and the constant covariance condition, are imposed so that these methods can estimate at least a portion of the CS. In this paper we study SDR for regression and discriminant analysis with categorical response. Motivated by the exploratory analysis and data visualization aspects of SDR, we propose a new geometric framework to reformulate the SDR problem in terms of manifold optimization and introduce a new concept called Maximum Separation Subspace (MASES). The MASES naturally preserves the “sufficiency” in SDR without imposing additional conditions on the predictor distribution, and directly inspires a semi-parametric estimator. Numerical studies show MASES exhibits superior performance as compared with competing SDR methods in specific settings.

Full Article

ici

Graph-Dependent Implicit Regularisation for Distributed Stochastic Subgradient Descent

By
Published On :: 2020

We propose graph-dependent implicit regularisation strategies for synchronised distributed stochastic subgradient descent (Distributed SGD) for convex problems in multi-agent learning. Under the standard assumptions of convexity, Lipschitz continuity, and smoothness, we establish statistical learning rates that retain, up to logarithmic terms, single-machine serial statistical guarantees through implicit regularisation (step size tuning and early stopping) with appropriate dependence on the graph topology. Our approach avoids the need for explicit regularisation in decentralised learning problems, such as adding constraints to the empirical risk minimisation rule. Particularly for distributed methods, the use of implicit regularisation allows the algorithm to remain simple, without projections or dual methods. To prove our results, we establish graph-independent generalisation bounds for Distributed SGD that match the single-machine serial SGD setting (using algorithmic stability), and we establish graph-dependent optimisation bounds that are of independent interest. We present numerical experiments to show that the qualitative nature of the upper bounds we derive can be representative of real behaviours.

Full Article

ici

On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics

By
Published On :: 2020

Stochastic gradient Langevin dynamics (SGLD) is a fundamental algorithm in stochastic optimization. Recent work by Zhang et al. (2017) presents an analysis for the hitting time of SGLD for the first and second order stationary points. The proof in Zhang et al. (2017) is a two-stage procedure through bounding the Cheeger's constant, which is rather complicated and leads to loose bounds. In this paper, using intuitions from stochastic differential equations, we provide a direct analysis for the hitting times of SGLD to the first and second order stationary points. Our analysis is straightforward. It only relies on basic linear algebra and probability theory tools. Our direct analysis also leads to tighter bounds comparing to Zhang et al. (2017) and shows the explicit dependence of the hitting time on different factors, including dimensionality, smoothness, noise strength, and step size effects. Under suitable conditions, we show that the hitting time of SGLD to first-order stationary points can be dimension-independent. Moreover, we apply our analysis to study several important online estimation problems in machine learning, including linear regression, matrix factorization, and online PCA.

Full Article

ici

GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning

By
Published On :: 2020

When the data is distributed across multiple servers, lowering the communication cost between the servers (or workers) while solving the distributed learning problem is an important problem and is the focus of this paper. In particular, we propose a fast, and communication-efficient decentralized framework to solve the distributed machine learning (DML) problem. The proposed algorithm, Group Alternating Direction Method of Multipliers (GADMM) is based on the Alternating Direction Method of Multipliers (ADMM) framework. The key novelty in GADMM is that it solves the problem in a decentralized topology where at most half of the workers are competing for the limited communication resources at any given time. Moreover, each worker exchanges the locally trained model only with two neighboring workers, thereby training a global model with a lower amount of communication overhead in each exchange. We prove that GADMM converges to the optimal solution for convex loss functions, and numerically show that it converges faster and more communication-efficient than the state-of-the-art communication-efficient algorithms such as the Lazily Aggregated Gradient (LAG) and dual averaging, in linear and logistic regression tasks on synthetic and real datasets. Furthermore, we propose Dynamic GADMM (D-GADMM), a variant of GADMM, and prove its convergence under the time-varying network topology of the workers.

Full Article

ici

TIGER: using artificial intelligence to discover our collections

By feedproxy.google.com
Published On :: Tue, 10 Mar 2020 22:01:20 +0000

The State Library of NSW has almost 4 million digital files in its collection.

Full Article

ici

Option pricing with bivariate risk-neutral density via copula and heteroscedastic model: A Bayesian approach

By projecteuclid.org
Published On :: Mon, 26 Aug 2019 04:00 EDT

Lucas Pereira Lopes, Vicente Garibay Cancho, Francisco Louzada.

Source: Brazilian Journal of Probability and Statistics, Volume 33, Number 4, 801--825.

Abstract:
Multivariate options are adequate tools for multi-asset risk management. The pricing models derived from the pioneer Black and Scholes method under the multivariate case consider that the asset-object prices follow a Brownian geometric motion. However, the construction of such methods imposes some unrealistic constraints on the process of fair option calculation, such as constant volatility over the maturity time and linear correlation between the assets. Therefore, this paper aims to price and analyze the fair price behavior of the call-on-max (bivariate) option considering marginal heteroscedastic models with dependence structure modeled via copulas. Concerning inference, we adopt a Bayesian perspective and computationally intensive methods based on Monte Carlo simulations via Markov Chain (MCMC). A simulation study examines the bias, and the root mean squared errors of the posterior means for the parameters. Real stocks prices of Brazilian banks illustrate the approach. For the proposed method is verified the effects of strike and dependence structure on the fair price of the option. The results show that the prices obtained by our heteroscedastic model approach and copulas differ substantially from the prices obtained by the model derived from Black and Scholes. Empirical results are presented to argue the advantages of our strategy.

Full Article

ici

A note on monotonicity of spatial epidemic models

By projecteuclid.org
Published On :: Mon, 10 Jun 2019 04:04 EDT

Achillefs Tzioufas.

Source: Brazilian Journal of Probability and Statistics, Volume 33, Number 3, 674--684.

Abstract:
The epidemic process on a graph is considered for which infectious contacts occur at rate which depends on whether a susceptible is infected for the first time or not. We show that the Vasershtein coupling extends if and only if secondary infections occur at rate which is greater than that of initial ones. Nonetheless we show that, with respect to the probability of occurrence of an infinite epidemic, the said proviso may be dropped regarding the totally asymmetric process in one dimension, thus settling in the affirmative this special case of the conjecture for arbitrary graphs due to [ Ann. Appl. Probab. 13 (2003) 669–690].

Full Article

ici

Stochastic monotonicity from an Eulerian viewpoint

By projecteuclid.org
Published On :: Mon, 10 Jun 2019 04:04 EDT

Davide Gabrielli, Ida Germana Minelli.

Source: Brazilian Journal of Probability and Statistics, Volume 33, Number 3, 558--585.

Abstract:
Stochastic monotonicity is a well-known partial order relation between probability measures defined on the same partially ordered set. Strassen theorem establishes equivalence between stochastic monotonicity and the existence of a coupling compatible with respect to the partial order. We consider the case of a countable set and introduce the class of finitely decomposable flows on a directed acyclic graph associated to the partial order. We show that a probability measure stochastically dominates another probability measure if and only if there exists a finitely decomposable flow having divergence given by the difference of the two measures. We illustrate the result with some examples.

Full Article

ici

Fractional backward stochastic variational inequalities with non-Lipschitz coefficient

By projecteuclid.org
Published On :: Mon, 10 Jun 2019 04:04 EDT

Katarzyna Jańczak-Borkowska.

Source: Brazilian Journal of Probability and Statistics, Volume 33, Number 3, 480--497.

Abstract:
We prove the existence and uniqueness of the solution of backward stochastic variational inequalities with respect to fractional Brownian motion and with non-Lipschitz coefficient. We assume that $H>1/2$.

Full Article

ici

Necessary and sufficient conditions for the convergence of the consistent maximal displacement of the branching random walk

By projecteuclid.org
Published On :: Mon, 04 Mar 2019 04:00 EST

Bastien Mallein.

Source: Brazilian Journal of Probability and Statistics, Volume 33, Number 2, 356--373.

Abstract:
Consider a supercritical branching random walk on the real line. The consistent maximal displacement is the smallest of the distances between the trajectories followed by individuals at the $n$th generation and the boundary of the process. Fang and Zeitouni, and Faraud, Hu and Shi proved that under some integrability conditions, the consistent maximal displacement grows almost surely at rate $lambda^{*}n^{1/3}$ for some explicit constant $lambda^{*}$. We obtain here a necessary and sufficient condition for this asymptotic behaviour to hold.

Full Article

ici

Alternating Maximization: Unifying Framework for 8 Sparse PCA Formulations and Efficient Parallel Codes. (arXiv:1212.4137v2 [stat.ML] UPDATED)

By arxiv.org
Published On ::

Given a multivariate data set, sparse principal component analysis (SPCA) aims to extract several linear combinations of the variables that together explain the variance in the data as much as possible, while controlling the number of nonzero loadings in these combinations. In this paper we consider 8 different optimization formulations for computing a single sparse loading vector; these are obtained by combining the following factors: we employ two norms for measuring variance (L2, L1) and two sparsity-inducing norms (L0, L1), which are used in two different ways (constraint, penalty). Three of our formulations, notably the one with L0 constraint and L1 variance, have not been considered in the literature. We give a unifying reformulation which we propose to solve via a natural alternating maximization (AM) method. We show the the AM method is nontrivially equivalent to GPower (Journ'{e}e et al; JMLR 11:517--553, 2010) for all our formulations. Besides this, we provide 24 efficient parallel SPCA implementations: 3 codes (multi-core, GPU and cluster) for each of the 8 problems. Parallelism in the methods is aimed at i) speeding up computations (our GPU code can be 100 times faster than an efficient serial code written in C++), ii) obtaining solutions explaining more variance and iii) dealing with big data problems (our cluster code is able to solve a 357 GB problem in about a minute).

Full Article

ici

Efficient Characterization of Dynamic Response Variation Using Multi-Fidelity Data Fusion through Composite Neural Network. (arXiv:2005.03213v1 [stat.ML])

By arxiv.org
Published On ::

Uncertainties in a structure is inevitable, which generally lead to variation in dynamic response predictions. For a complex structure, brute force Monte Carlo simulation for response variation analysis is infeasible since one single run may already be computationally costly. Data driven meta-modeling approaches have thus been explored to facilitate efficient emulation and statistical inference. The performance of a meta-model hinges upon both the quality and quantity of training dataset. In actual practice, however, high-fidelity data acquired from high-dimensional finite element simulation or experiment are generally scarce, which poses significant challenge to meta-model establishment. In this research, we take advantage of the multi-level response prediction opportunity in structural dynamic analysis, i.e., acquiring rapidly a large amount of low-fidelity data from reduced-order modeling, and acquiring accurately a small amount of high-fidelity data from full-scale finite element analysis. Specifically, we formulate a composite neural network fusion approach that can fully utilize the multi-level, heterogeneous datasets obtained. It implicitly identifies the correlation of the low- and high-fidelity datasets, which yields improved accuracy when compared with the state-of-the-art. Comprehensive investigations using frequency response variation characterization as case example are carried out to demonstrate the performance.

Full Article

ici

History of Pre-Modern Medicine Seminar Series, Spring 2018

By blog.wellcomelibrary.org
Published On :: Fri, 05 Jan 2018 12:26:55 +0000

The History of Pre-Modern Medicine seminar series returns this month. The 2017–18 series – organised by a group of historians of medicine based at London universities and hosted by the Wellcome Library – will conclude with four seminars. The series… Continue reading

Full Article

Early Medicine
Events and Visits
China
Early Sex and Reproduction
plague
smell

ici

Rehabilitation medicine for elderly patients

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9783319574066

Full Article

ici

Psychoactive medicinal plants and fungal neurotoxins

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Singh Saroya, Amritpal, author

Callnumber: Online

ISBN: 9789811523137 (electronic bk.)

Full Article

ici

Pathogenesis of periodontal diseases : biological concepts for clinicians

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9783319537375

Full Article

ici

Nanobiomaterial engineering : concepts and their applications in biomedicine and diagnostics

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9789813298408 (electronic bk.)

Full Article

ici

NanoBioMedicine

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9789813298989 (electronic bk.)

Full Article

ici

Molecular aspects of plant beneficial microbes in agriculture

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9780128184707 (electronic bk.)

Full Article

ici

Machine learning in medicine : a complete overview

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Cleophas, Ton J. M., author

Callnumber: Online

ISBN: 9783030339708 (electronic bk.)

Full Article

ici

Irwin and Rippe's intensive care medicine

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9781496306081 hardcover

Full Article

ici

Geriatric Medicine : a Problem-Based Approach

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9789811032530

Full Article

ici

General medicine and surgery for dental practitioners

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Greenwood, M. (Mark), author.

Callnumber: Online

ISBN: 9783319977379 (electronic book)

Full Article

ici

Ethnoveterinary medicine : present and future concepts

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9783030322700 (electronic bk.)

Full Article

ici

Ecophysiology of pesticides : interface between pesticide chemistry and plant physiology

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Parween, Talat, author.

Callnumber: Online

ISBN: 9780128176146

Full Article

ici

DNA beyond genes : from data storage and computing to nanobots, nanomedicine, and nanoelectronics

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Demidov, Vadim V., author

Callnumber: Online

ISBN: 9783030364342 (electronic bk.)

Full Article

ici

Cell biology and translational medicine.

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9783030378455 (electronic bk.)

Full Article

ici

Binary code fingerprinting for cybersecurity : application to malicious code fingerprinting

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Alrabaee, Saed, authior

Callnumber: Online

ISBN: 9783030342388 (electronic bk.)

Full Article

ici

Efficient estimation of linear functionals of principal components

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Vladimir Koltchinskii, Matthias Löffler, Richard Nickl.

Source: The Annals of Statistics, Volume 48, Number 1, 464--490.

Abstract:
We study principal component analysis (PCA) for mean zero i.i.d. Gaussian observations $X_{1},dots,X_{n}$ in a separable Hilbert space $mathbb{H}$ with unknown covariance operator $Sigma $. The complexity of the problem is characterized by its effective rank $mathbf{r}(Sigma):=frac{operatorname{tr}(Sigma)}{|Sigma |}$, where $mathrm{tr}(Sigma)$ denotes the trace of $Sigma $ and $|Sigma|$ denotes its operator norm. We develop a method of bias reduction in the problem of estimation of linear functionals of eigenvectors of $Sigma $. Under the assumption that $mathbf{r}(Sigma)=o(n)$, we establish the asymptotic normality and asymptotic properties of the risk of the resulting estimators and prove matching minimax lower bounds, showing their semiparametric optimality.

Full Article

ici

The multi-armed bandit problem: An efficient nonparametric solution

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Hock Peng Chan.

Source: The Annals of Statistics, Volume 48, Number 1, 346--373.

Abstract:
Lai and Robbins ( Adv. in Appl. Math. 6 (1985) 4–22) and Lai ( Ann. Statist. 15 (1987) 1091–1114) provided efficient parametric solutions to the multi-armed bandit problem, showing that arm allocation via upper confidence bounds (UCB) achieves minimum regret. These bounds are constructed from the Kullback–Leibler information of the reward distributions, estimated from specified parametric families. In recent years, there has been renewed interest in the multi-armed bandit problem due to new applications in machine learning algorithms and data analytics. Nonparametric arm allocation procedures like $epsilon $-greedy, Boltzmann exploration and BESA were studied, and modified versions of the UCB procedure were also analyzed under nonparametric settings. However, unlike UCB these nonparametric procedures are not efficient under general parametric settings. In this paper, we propose efficient nonparametric procedures.

Full Article

ici

Statistical inference for autoregressive models under heteroscedasticity of unknown form

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Ke Zhu.

Source: The Annals of Statistics, Volume 47, Number 6, 3185--3215.

Abstract:
This paper provides an entire inference procedure for the autoregressive model under (conditional) heteroscedasticity of unknown form with a finite variance. We first establish the asymptotic normality of the weighted least absolute deviations estimator (LADE) for the model. Second, we develop the random weighting (RW) method to estimate its asymptotic covariance matrix, leading to the implementation of the Wald test. Third, we construct a portmanteau test for model checking, and use the RW method to obtain its critical values. As a special weighted LADE, the feasible adaptive LADE (ALADE) is proposed and proved to have the same efficiency as its infeasible counterpart. The importance of our entire methodology based on the feasible ALADE is illustrated by simulation results and the real data analysis on three U.S. economic data sets.

Full Article

ici

Adaptive estimation of the rank of the coefficient matrix in high-dimensional multivariate response regression models

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Xin Bing, Marten H. Wegkamp.

Source: The Annals of Statistics, Volume 47, Number 6, 3157--3184.

Abstract:
We consider the multivariate response regression problem with a regression coefficient matrix of low, unknown rank. In this setting, we analyze a new criterion for selecting the optimal reduced rank. This criterion differs notably from the one proposed in Bunea, She and Wegkamp ( Ann. Statist. 39 (2011) 1282–1309) in that it does not require estimation of the unknown variance of the noise, nor does it depend on a delicate choice of a tuning parameter. We develop an iterative, fully data-driven procedure, that adapts to the optimal signal-to-noise ratio. This procedure finds the true rank in a few steps with overwhelming probability. At each step, our estimate increases, while at the same time it does not exceed the true rank. Our finite sample results hold for any sample size and any dimension, even when the number of responses and of covariates grow much faster than the number of observations. We perform an extensive simulation study that confirms our theoretical findings. The new method performs better and is more stable than the procedure of Bunea, She and Wegkamp ( Ann. Statist. 39 (2011) 1282–1309) in both low- and high-dimensional settings.

Full Article

ici

Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem

By projecteuclid.org
Published On :: Thu, 05 Aug 2010 15:41 EDT

James G. Scott, James O. Berger

Source: Ann. Statist., Volume 38, Number 5, 2587--2619.

Abstract:
This paper studies the multiplicity-correction effect of standard Bayesian variable-selection priors in linear regression. Our first goal is to clarify when, and how, multiplicity correction happens automatically in Bayesian analysis, and to distinguish this correction from the Bayesian Ockham’s-razor effect. Our second goal is to contrast empirical-Bayes and fully Bayesian approaches to variable selection through examples, theoretical results and simulations. Considerable differences between the two approaches are found. In particular, we prove a theorem that characterizes a surprising aymptotic discrepancy between fully Bayes and empirical Bayes. This discrepancy arises from a different source than the failure to account for hyperparameter uncertainty in the empirical-Bayes estimate. Indeed, even at the extreme, when the empirical-Bayes estimate converges asymptotically to the true variable-inclusion probability, the potential for a serious difference remains.

Full Article

ici

Feature selection for generalized varying coefficient mixed-effect models with application to obesity GWAS

By projecteuclid.org
Published On :: Wed, 15 Apr 2020 22:05 EDT

Wanghuan Chu, Runze Li, Jingyuan Liu, Matthew Reimherr.

Source: The Annals of Applied Statistics, Volume 14, Number 1, 276--298.

Abstract:
Motivated by an empirical analysis of data from a genome-wide association study on obesity, measured by the body mass index (BMI), we propose a two-step gene-detection procedure for generalized varying coefficient mixed-effects models with ultrahigh dimensional covariates. The proposed procedure selects significant single nucleotide polymorphisms (SNPs) impacting the mean BMI trend, some of which have already been biologically proven to be “fat genes.” The method also discovers SNPs that significantly influence the age-dependent variability of BMI. The proposed procedure takes into account individual variations of genetic effects and can also be directly applied to longitudinal data with continuous, binary or count responses. We employ Monte Carlo simulation studies to assess the performance of the proposed method and further carry out causal inference for the selected SNPs.

Full Article

ici

Efficient real-time monitoring of an emerging influenza pandemic: How feasible?

By projecteuclid.org
Published On :: Wed, 15 Apr 2020 22:05 EDT

Paul J. Birrell, Lorenz Wernisch, Brian D. M. Tom, Leonhard Held, Gareth O. Roberts, Richard G. Pebody, Daniela De Angelis.

Source: The Annals of Applied Statistics, Volume 14, Number 1, 74--93.

Abstract:
A prompt public health response to a new epidemic relies on the ability to monitor and predict its evolution in real time as data accumulate. The 2009 A/H1N1 outbreak in the UK revealed pandemic data as noisy, contaminated, potentially biased and originating from multiple sources. This seriously challenges the capacity for real-time monitoring. Here, we assess the feasibility of real-time inference based on such data by constructing an analytic tool combining an age-stratified SEIR transmission model with various observation models describing the data generation mechanisms. As batches of data become available, a sequential Monte Carlo (SMC) algorithm is developed to synthesise multiple imperfect data streams, iterate epidemic inferences and assess model adequacy amidst a rapidly evolving epidemic environment, substantially reducing computation time in comparison to standard MCMC, to ensure timely delivery of real-time epidemic assessments. In application to simulated data designed to mimic the 2009 A/H1N1 epidemic, SMC is shown to have additional benefits in terms of assessing predictive performance and coping with parameter nonidentifiability.

Full Article

ici

Integrative survival analysis with uncertain event times in application to a suicide risk study

By projecteuclid.org
Published On :: Wed, 15 Apr 2020 22:05 EDT

Wenjie Wang, Robert Aseltine, Kun Chen, Jun Yan.

Source: The Annals of Applied Statistics, Volume 14, Number 1, 51--73.

Abstract:
The concept of integrating data from disparate sources to accelerate scientific discovery has generated tremendous excitement in many fields. The potential benefits from data integration, however, may be compromised by the uncertainty due to incomplete/imperfect record linkage. Motivated by a suicide risk study, we propose an approach for analyzing survival data with uncertain event times arising from data integration. Specifically, in our problem deaths identified from the hospital discharge records together with reported suicidal deaths determined by the Office of Medical Examiner may still not include all the death events of patients, and the missing deaths can be recovered from a complete database of death records. Since the hospital discharge data can only be linked to the death record data by matching basic patient characteristics, a patient with a censored death time from the first dataset could be linked to multiple potential event records in the second dataset. We develop an integrative Cox proportional hazards regression in which the uncertainty in the matched event times is modeled probabilistically. The estimation procedure combines the ideas of profile likelihood and the expectation conditional maximization algorithm (ECM). Simulation studies demonstrate that under realistic settings of imperfect data linkage the proposed method outperforms several competing approaches including multiple imputation. A marginal screening analysis using the proposed integrative Cox model is performed to identify risk factors associated with death following suicide-related hospitalization in Connecticut. The identified diagnostics codes are consistent with existing literature and provide several new insights on suicide risk, prediction and prevention.

Full Article

ici

RCRnorm: An integrated system of random-coefficient hierarchical regression models for normalizing NanoString nCounter data

By projecteuclid.org
Published On :: Wed, 16 Oct 2019 22:03 EDT

Gaoxiang Jia, Xinlei Wang, Qiwei Li, Wei Lu, Ximing Tang, Ignacio Wistuba, Yang Xie.

Source: The Annals of Applied Statistics, Volume 13, Number 3, 1617--1647.

Abstract:
Formalin-fixed paraffin-embedded (FFPE) samples have great potential for biomarker discovery, retrospective studies and diagnosis or prognosis of diseases. Their application, however, is hindered by the unsatisfactory performance of traditional gene expression profiling techniques on damaged RNAs. NanoString nCounter platform is well suited for profiling of FFPE samples and measures gene expression with high sensitivity which may greatly facilitate realization of scientific and clinical values of FFPE samples. However, methodological development for normalization, a critical step when analyzing this type of data, is far behind. Existing methods designed for the platform use information from different types of internal controls separately and rely on an overly-simplified assumption that expression of housekeeping genes is constant across samples for global scaling. Thus, these methods are not optimized for the nCounter system, not mentioning that they were not developed for FFPE samples. We construct an integrated system of random-coefficient hierarchical regression models to capture main patterns and characteristics observed from NanoString data of FFPE samples and develop a Bayesian approach to estimate parameters and normalize gene expression across samples. Our method, labeled RCRnorm, incorporates information from all aspects of the experimental design and simultaneously removes biases from various sources. It eliminates the unrealistic assumption on housekeeping genes and offers great interpretability. Furthermore, it is applicable to freshly frozen or like samples that can be generally viewed as a reduced case of FFPE samples. Simulation and applications showed the superior performance of RCRnorm.

Full Article

ici

Modeling seasonality and serial dependence of electricity price curves with warping functional autoregressive dynamics

By projecteuclid.org
Published On :: Wed, 16 Oct 2019 22:03 EDT

Ying Chen, J. S. Marron, Jiejie Zhang.

Source: The Annals of Applied Statistics, Volume 13, Number 3, 1590--1616.

Abstract:
Electricity prices are high dimensional, serially dependent and have seasonal variations. We propose a Warping Functional AutoRegressive (WFAR) model that simultaneously accounts for the cross time-dependence and seasonal variations of the large dimensional data. In particular, electricity price curves are obtained by smoothing over the $24$ discrete hourly prices on each day. In the functional domain, seasonal phase variations are separated from level amplitude changes in a warping process with the Fisher–Rao distance metric, and the aligned (season-adjusted) electricity price curves are modeled in the functional autoregression framework. In a real application, the WFAR model provides superior out-of-sample forecast accuracy in both a normal functioning market, Nord Pool, and an extreme situation, the California market. The forecast performance as well as the relative accuracy improvement are stable for different markets and different time periods.

Full Article

ici

Efficient estimation in single index models through smoothing splines

By projecteuclid.org
Published On :: Fri, 31 Jan 2020 04:06 EST

Arun K. Kuchibhotla, Rohit K. Patra.

Source: Bernoulli, Volume 26, Number 2, 1587--1618.

Abstract:
We consider estimation and inference in a single index regression model with an unknown but smooth link function. In contrast to the standard approach of using kernels or regression splines, we use smoothing splines to estimate the smooth link function. We develop a method to compute the penalized least squares estimators (PLSEs) of the parametric and the nonparametric components given independent and identically distributed (i.i.d.) data. We prove the consistency and find the rates of convergence of the estimators. We establish asymptotic normality under mild assumption and prove asymptotic efficiency of the parametric component under homoscedastic errors. A finite sample simulation corroborates our asymptotic theory. We also analyze a car mileage data set and a Ozone concentration data set. The identifiability and existence of the PLSEs are also investigated.

Full Article

ici

Stratonovich stochastic differential equation with irregular coefficients: Girsanov’s example revisited

By projecteuclid.org
Published On :: Fri, 31 Jan 2020 04:06 EST

Ilya Pavlyukevich, Georgiy Shevchenko.

Source: Bernoulli, Volume 26, Number 2, 1381--1409.

Abstract:
In this paper, we study the Stratonovich stochastic differential equation $mathrm{d}X=|X|^{alpha }circ mathrm{d}B$, $alpha in (-1,1)$, which has been introduced by Cherstvy et al. ( New J. Phys. 15 (2013) 083039) in the context of analysis of anomalous diffusions in heterogeneous media. We determine its weak and strong solutions, which are homogeneous strong Markov processes spending zero time at $0$: for $alpha in (0,1)$, these solutions have the form egin{equation*}X_{t}^{ heta }=((1-alpha)B_{t}^{ heta })^{1/(1-alpha )},end{equation*} where $B^{ heta }$ is the $ heta $-skew Brownian motion driven by $B$ and starting at $frac{1}{1-alpha }(X_{0})^{1-alpha }$, $ heta in [-1,1]$, and $(x)^{gamma }=|x|^{gamma }operatorname{sign}x$; for $alpha in (-1,0]$, only the case $ heta =0$ is possible. The central part of the paper consists in the proof of the existence of a quadratic covariation $[f(B^{ heta }),B]$ for a locally square integrable function $f$ and is based on the time-reversion technique for Markovian diffusions.

Full Article

ici

Degeneracy in sparse ERGMs with functions of degrees as sufficient statistics

By projecteuclid.org
Published On :: Fri, 31 Jan 2020 04:06 EST

Sumit Mukherjee.

Source: Bernoulli, Volume 26, Number 2, 1016--1043.

Abstract:
A sufficient criterion for “non-degeneracy” is given for Exponential Random Graph Models on sparse graphs with sufficient statistics which are functions of the degree sequence. This criterion explains why statistics such as alternating $k$-star are non-degenerate, whereas subgraph counts are degenerate. It is further shown that this criterion is “almost” tight. Existence of consistent estimates is then proved for non-degenerate Exponential Random Graph Models.

Full Article

ici

Coronavirus: Chinese official admits health system weaknesses

By news.yahoo.com
Published On :: Sat, 09 May 2020 11:02:40 -0400

China says it will improve public health systems after criticism of its early response to the virus.

Full Article

ici

Implicit Copulas from Bayesian Regularized Regression Smoothers

By projecteuclid.org
Published On :: Thu, 19 Dec 2019 22:10 EST

Nadja Klein, Michael Stanley Smith.

Source: Bayesian Analysis, Volume 14, Number 4, 1143--1171.

Abstract:
We show how to extract the implicit copula of a response vector from a Bayesian regularized regression smoother with Gaussian disturbances. The copula can be used to compare smoothers that employ different shrinkage priors and function bases. We illustrate with three popular choices of shrinkage priors—a pairwise prior, the horseshoe prior and a g prior augmented with a point mass as employed for Bayesian variable selection—and both univariate and multivariate function bases. The implicit copulas are high-dimensional, have flexible dependence structures that are far from that of a Gaussian copula, and are unavailable in closed form. However, we show how they can be evaluated by first constructing a Gaussian copula conditional on the regularization parameters, and then integrating over these. Combined with non-parametric margins the regularized smoothers can be used to model the distribution of non-Gaussian univariate responses conditional on the covariates. Efficient Markov chain Monte Carlo schemes for evaluating the copula are given for this case. Using both simulated and real data, we show how such copula smoothing models can improve the quality of resulting function estimates and predictive distributions.

Full Article

ici

Model Criticism in Latent Space

By projecteuclid.org
Published On :: Tue, 11 Jun 2019 04:00 EDT

Sohan Seth, Iain Murray, Christopher K. I. Williams.

Source: Bayesian Analysis, Volume 14, Number 3, 703--725.

Abstract:
Model criticism is usually carried out by assessing if replicated data generated under the fitted model looks similar to the observed data, see e.g. Gelman, Carlin, Stern, and Rubin (2004, p. 165). This paper presents a method for latent variable models by pulling back the data into the space of latent variables, and carrying out model criticism in that space. Making use of a model's structure enables a more direct assessment of the assumptions made in the prior and likelihood. We demonstrate the method with examples of model criticism in latent space applied to factor analysis, linear dynamical systems and Gaussian processes.

Full Article

ici

Efficient Acquisition Rules for Model-Based Approximate Bayesian Computation

By projecteuclid.org
Published On :: Wed, 13 Mar 2019 22:00 EDT

Marko Järvenpää, Michael U. Gutmann, Arijus Pleska, Aki Vehtari, Pekka Marttinen.

Source: Bayesian Analysis, Volume 14, Number 2, 595--622.

Abstract:
Approximate Bayesian computation (ABC) is a method for Bayesian inference when the likelihood is unavailable but simulating from the model is possible. However, many ABC algorithms require a large number of simulations, which can be costly. To reduce the computational cost, Bayesian optimisation (BO) and surrogate models such as Gaussian processes have been proposed. Bayesian optimisation enables one to intelligently decide where to evaluate the model next but common BO strategies are not designed for the goal of estimating the posterior distribution. Our paper addresses this gap in the literature. We propose to compute the uncertainty in the ABC posterior density, which is due to a lack of simulations to estimate this quantity accurately, and define a loss function that measures this uncertainty. We then propose to select the next evaluation location to minimise the expected loss. Experiments show that the proposed method often produces the most accurate approximations as compared to common BO strategies.

Full Article

Methadone substitution therapy : policies and practices / edited by Hamid Ghodse, Carmel Clancy, Adenekan Oyefeso.

Recovery of simultaneous low rank and two-way sparse coefficient matrices, a nonconvex approach

Estimation of linear projections of non-sparse coefficients in high-dimensional regression

Efficient estimation in expectile regression using envelope models

Online Sufficient Dimension Reduction Through Sliced Inverse Regression

The Maximum Separation Subspace in Sufficient Dimension Reduction with Categorical Response

Graph-Dependent Implicit Regularisation for Distributed Stochastic Subgradient Descent

On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics

GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning

TIGER: using artificial intelligence to discover our collections

Option pricing with bivariate risk-neutral density via copula and heteroscedastic model: A Bayesian approach

A note on monotonicity of spatial epidemic models

Stochastic monotonicity from an Eulerian viewpoint

Fractional backward stochastic variational inequalities with non-Lipschitz coefficient

Necessary and sufficient conditions for the convergence of the consistent maximal displacement of the branching random walk

Alternating Maximization: Unifying Framework for 8 Sparse PCA Formulations and Efficient Parallel Codes. (arXiv:1212.4137v2 [stat.ML] UPDATED)

Efficient Characterization of Dynamic Response Variation Using Multi-Fidelity Data Fusion through Composite Neural Network. (arXiv:2005.03213v1 [stat.ML])

History of Pre-Modern Medicine Seminar Series, Spring 2018

Rehabilitation medicine for elderly patients

Psychoactive medicinal plants and fungal neurotoxins

Pathogenesis of periodontal diseases : biological concepts for clinicians

Nanobiomaterial engineering : concepts and their applications in biomedicine and diagnostics

NanoBioMedicine

Molecular aspects of plant beneficial microbes in agriculture

Machine learning in medicine : a complete overview

Irwin and Rippe's intensive care medicine

Geriatric Medicine : a Problem-Based Approach

General medicine and surgery for dental practitioners

Ethnoveterinary medicine : present and future concepts

Ecophysiology of pesticides : interface between pesticide chemistry and plant physiology

DNA beyond genes : from data storage and computing to nanobots, nanomedicine, and nanoelectronics

Cell biology and translational medicine.

Binary code fingerprinting for cybersecurity : application to malicious code fingerprinting

Efficient estimation of linear functionals of principal components

The multi-armed bandit problem: An efficient nonparametric solution

Statistical inference for autoregressive models under heteroscedasticity of unknown form

Adaptive estimation of the rank of the coefficient matrix in high-dimensional multivariate response regression models

Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem

Feature selection for generalized varying coefficient mixed-effect models with application to obesity GWAS

Efficient real-time monitoring of an emerging influenza pandemic: How feasible?

Integrative survival analysis with uncertain event times in application to a suicide risk study

RCRnorm: An integrated system of random-coefficient hierarchical regression models for normalizing NanoString nCounter data

Modeling seasonality and serial dependence of electricity price curves with warping functional autoregressive dynamics

Efficient estimation in single index models through smoothing splines

Stratonovich stochastic differential equation with irregular coefficients: Girsanov’s example revisited

Degeneracy in sparse ERGMs with functions of degrees as sufficient statistics

Coronavirus: Chinese official admits health system weaknesses

Implicit Copulas from Bayesian Regularized Regression Smoothers

Model Criticism in Latent Space

Efficient Acquisition Rules for Model-Based Approximate Bayesian Computation

The Finish Line: Drainage Efficiency

Cost-Effective, Energy Efficient Concrete Sandwich Panels

Panasonic's Security Solutions Start With Energy-Efficient Products

Carpet Industry Leaders Navigate Global Growth, Sustainability Policies at CRI Annual Meeting

Blue states prepare to fight Trump administration policies

Basic Black: Ebola and Race | Policing Communities of Color

WashU Medicine, BJC Health System launch Center for Health AI

WashU Medicine, BJC Health System launch Center for Health AI

Academy of Science, Engineering and Medicine of Florida names two FSU professors Rising Stars

12 Most Dangerous Cities in Mexico by Homicides per Capita

Preparing Technicians for the Future of Work

Physicists demonstrate silicon's energy-harvesting power in study

Preparing Technicians for the Future of Work

Preparing Technicians for the Future of Work

Preparing Technicians for the Future of Work

Subscribe To Our Newsletter