Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Report
Data sources: ZENODO
addClaim

A Unified Mathematical Benchmark for Statistical Inference: Synthetic Data Validation of High-Dimensional, Nonparametric, and Distribution-Free Methods

Authors: Dr B. Jhansirani, Principal; Dr. T. Vijaya Laxmi, Associate Professor;

A Unified Mathematical Benchmark for Statistical Inference: Synthetic Data Validation of High-Dimensional, Nonparametric, and Distribution-Free Methods

Abstract

This paper presents a comprehensive mathematical benchmarking framework for modern statistical inference, integrating recent advances in high-dimensional analysis, nonparametric estimation, distribution-free inference, and algebraic statistics. Using a synthetically generated dataset of 10 million observations across 1,000 simulation replicates, we construct a 100% reliable benchmark that satisfies all known consistency constraints—including oracle inequalities, Berry–Esseen bounds, and finite-sample coverage guarantees—while emulating the statistical properties of real-world complex data structures. Key findings demonstrate that recently proposed calibration estimators for stratified sampling with non-response and measurement error achieve a 22.7% reduction in mean squared error compared to traditional methods. Distribution-free changepoint localization using conformal p-values attains finite-sample coverage at the nominal 95% level with a median confidence set width reduced by 31% relative to asymptotic competitors. The novel hypergraph-based U-statistic framework yields a Berry–Esseen bound convergence rate of \( O(m^{-1/6}) \) and achieves computational speedups exceeding two orders of magnitude for kernel-based independence tests while preserving power. Additionally, the algebraic geometry approach to Hüsler–Reiss extremal graphical models reduces parameter space dimension by up to 63% compared to naive estimation, enabling scalable inference for rare events. This work establishes a benchmark for advancing theoretical statistics and validating new methodologies across diverse data regimes.

Powered by OpenAIRE graph
Found an issue? Give us feedback