We gratefully acknowledge support from
the Simons Foundation and member institutions.

Statistics Theory

New submissions

[ total of 12 entries: 1-12 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Wed, 14 Aug 19

[1]  arXiv:1908.04328 [pdf, other]
Title: Identifying shifts between two regression curves
Subjects: Statistics Theory (math.ST)

This article studies the problem whether two convex (concave) regression functions modelling the relation between a response and covariate in two samples differ by a shift in the horizontal and/or vertical axis. We consider a nonparametric situation assuming only smoothness of the regression functions. A graphical tool based on the derivatives of the regression functions and their inverses is proposed to answer this question and studied in several examples. We also formalize this question in a corresponding hypothesis and develop a statistical test. The asymptotic properties of the corresponding test statistic are investigated under the null hypothesis and local alternatives. In contrast to most of the literature on comparing shape invariant models, which requires independent data the procedure is applicable for dependent and non-stationary data. We also illustrate the finite sample properties of the new test by means of a small simulation study and a real data example.

[2]  arXiv:1908.04331 [pdf, ps, other]
Title: Elements of asymptotic theory with outer probability measures
Subjects: Statistics Theory (math.ST)

Outer measures can be used for statistical inference in place of probability measures to bring flexibility in terms of model specification. The corresponding statistical procedures such as estimation or hypothesis testing need to be analysed in order to understand their behaviour, and motivate their use. In this article, we consider a simple class of outer measures based on the supremum of particular functions that we refer to as possibility functions. We then derive the asymptotic properties of the corresponding maximum likelihood estimators, likelihood ratio tests and Bayesian posterior uncertainties. These results are largely based on versions of both the law of large numbers and the central limit theorem that are adapted to possibility functions. Our motivation with outer measures is through the notion of uncertainty quantification, where verification of these procedures is of crucial importance. These introduced concepts naturally strengthen the link between the frequentist and Bayesian approaches.

[3]  arXiv:1908.04433 [pdf, other]
Title: Sharp Guarantees for Solving Random Equations with One-Bit Information
Comments: 12 pages, 4 figures
Subjects: Statistics Theory (math.ST); Information Theory (cs.IT); Machine Learning (cs.LG); Signal Processing (eess.SP)

We study the performance of a wide class of convex optimization-based estimators for recovering a signal from corrupted one-bit measurements in high-dimensions. Our general result predicts sharply the performance of such estimators in the linear asymptotic regime when the measurement vectors have entries IID Gaussian. This includes, as a special case, the previously studied least-squares estimator and various novel results for other popular estimators such as least-absolute deviations, hinge-loss and logistic-loss. Importantly, we exploit the fact that our analysis holds for generic convex loss functions to prove a bound on the best achievable performance across the entire class of estimators. Numerical simulations corroborate our theoretical findings and suggest they are accurate even for relatively small problem dimensions.

[4]  arXiv:1908.04462 [pdf, other]
Title: The bias of isotonic regression
Subjects: Statistics Theory (math.ST)

We study the bias of the isotonic regression estimator. While there is extensive work characterizing the mean squared error of the isotonic regression estimator, relatively little is known about the bias. In this paper, we provide a sharp characterization, proving that the bias scales as $O(n^{-\beta/3})$ up to log factors, where $1 \leq \beta \leq 2$ is the exponent corresponding to H{\"o}lder smoothness of the underlying mean. Importantly, this result only requires a strictly monotone mean and that the noise distribution has subexponential tails, without relying on symmetric noise or other restrictive assumptions.

[5]  arXiv:1908.04468 [pdf, ps, other]
Title: A Fast Spectral Algorithm for Mean Estimation with Sub-Gaussian Rates
Comments: 27 pages
Subjects: Statistics Theory (math.ST); Data Structures and Algorithms (cs.DS); Machine Learning (cs.LG); Machine Learning (stat.ML)

We study the algorithmic problem of estimating the mean of heavy-tailed random vector in $\mathbb{R}^d$, given $n$ i.i.d. samples. The goal is to design an efficient estimator that attains the optimal sub-gaussian error bound, only assuming that the random vector has bounded mean and covariance. Polynomial-time solutions to this problem are known but have high runtime due to their use of semi-definite programming (SDP). Conceptually, it remains open whether convex relaxation is truly necessary for this problem.
In this work, we show that it is possible to go beyond SDP and achieve better computational efficiency. In particular, we provide a spectral algorithm that achieves the optimal statistical performance and runs in time $\widetilde O\left(n^2 d \right)$, improving upon the previous fastest runtime $\widetilde O\left(n^{3.5}+ n^2d\right)$ by Cherapanamjeri el al. (COLT '19) and matching the concurrent work by Depersin and Lecu\'e. Our algorithm is spectral in that it only requires (approximate) eigenvector computations, which can be implemented very efficiently by, for example, power iteration or the Lanczos method.
At the core of our algorithm is a novel connection between the furthest hyperplane problem introduced by Karnin et al. (COLT '12) and a structural lemma on heavy-tailed distributions by Lugosi and Mendelson (Ann. Stat. '19). This allows us to iteratively reduce the estimation error at a geometric rate using only the information derived from the top singular vector of the data matrix, leading to a significantly faster running time.

[6]  arXiv:1908.04553 [pdf, other]
Title: Principal symmetric space analysis
Subjects: Statistics Theory (math.ST); Differential Geometry (math.DG)

We develop a novel analogue of Euclidean PCA (principal component analysis) for data taking values on a Riemannian symmetric space, using totally geodesic submanifolds as approximating lower dimnsional submanifolds. We illustrate the technique on n-spheres, Grassmannians, n-tori and polyspheres.

Cross-lists for Wed, 14 Aug 19

[7]  arXiv:1908.04569 (cross-list from q-fin.RM) [pdf, other]
Title: Forecast Encompassing Tests for the Expected Shortfall
Comments: 26 pages, 3 tables, 1 figure
Subjects: Risk Management (q-fin.RM); Econometrics (econ.EM); Statistics Theory (math.ST)

In this paper, we introduce new forecast encompassing tests for the risk measure Expected Shortfall (ES). Forecasting and forecast evaluation techniques for the ES are rapidly gaining attention through the recently introduced Basel III Accords, which stipulate the use of the ES as primary market risk measure for the international banking regulations. Encompassing tests generally rely on the existence of strictly consistent loss functions for the functionals under consideration, which do not exist for the ES. However, our encompassing tests are based on recently introduced loss functions and an associated regression framework which considers the ES jointly with the corresponding Value at Risk (VaR). This setup facilitates several testing specifications which allow for both, joint tests for the ES and VaR and stand-alone tests for the ES. We present asymptotic theory for our encompassing tests and verify their finite sample properties through various simulation setups. In an empirical application, we utilize the encompassing tests in order to demonstrate the superiority of forecast combination methods for the ES for the IBM stock.

Replacements for Wed, 14 Aug 19

[8]  arXiv:1605.02214 (replaced) [pdf, other]
Title: On cross-validated Lasso
Comments: 38 pages, 1 figure, 3 tables
Subjects: Statistics Theory (math.ST)
[9]  arXiv:1806.01431 (replaced) [pdf, ps, other]
Title: A Uniform-in-$P$ Edgeworth Expansion under Weak Cramér Conditions
Authors: Kyungchul Song
Subjects: Statistics Theory (math.ST)
[10]  arXiv:1908.03152 (replaced) [pdf, other]
Title: Analysis of Networks via the Sparse $β$-Model
Comments: 38 pages
Subjects: Statistics Theory (math.ST)
[11]  arXiv:1812.02127 (replaced) [pdf, other]
Title: Information geometry for approximate Bayesian computation
Subjects: Methodology (stat.ME); Probability (math.PR); Statistics Theory (math.ST); Applications (stat.AP); Machine Learning (stat.ML)
[12]  arXiv:1904.04239 (replaced) [pdf, other]
Title: Minimax-Optimal Algorithms for Detecting Changes in Statistically Periodic Random Processes
Comments: arXiv admin note: text overlap with arXiv:1810.12760, arXiv:1807.06945
Subjects: Signal Processing (eess.SP); Information Theory (cs.IT); Statistics Theory (math.ST)
[ total of 12 entries: 1-12 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, math, recent, 1908, contact, help  (Access key information)