2026-06-13 每日 arXiv 资讯¶

高相关论文 6 篇 · 中相关 19 篇 · 其他 10 篇 · 会议/Seminar 事件 0 条

⭐ 高相关论文（按主题分组）¶

因果推断 (causal_inference, 3 篇)¶

1. 2606.13519 — Semiparametric Local Projections¶

作者: Silvia Goncalves, Ana Maria Herrera, Lutz Kilian, Elena Peavento, Iones Kelanemer Holban
分类: econ.EM
相关性 9/10 · novelty: new_method
摘要: 在宏观经济学结构动态模型框架下，目标是估计非线性脉冲响应函数（如含非线性变换回归元、状态依赖系数及冲击-状态交互的模型）。本文提出基于 doubly robust moment condition 的半参数局部投影估计器，将平均响应函数识别为非参数条件均值的线性泛函，并附加密度比以捕捉目标冲击的偏移效应。结合处理序列依赖的 cross-fitting 程序，所得估计器具有 √T-consistency 与渐近正态性。模拟与实证验证了有限样本表现。对您可能有用：该 doubly robust + cross-fitting 构造直接连接 semiparametric efficiency 与 longitudinal/动态因果推断的估计理论。
关键技术: doubly robust moment condition, semiparametric local projection, density ratio weighting, cross-fitting for serial dependence, nonlinear impulse response function
为什么对您有用: 直接连接 causal inference 的 longitudinal/动态设定与 semiparametric efficiency 理论：doubly robust moment + density ratio 的构造是典型的 semiparametric functional estimation 路径，cross-fitting 处理序列依赖可对照 DML 的标准做法。用 very_familiar 的 estimation theory in causal inference 可直接审视其 DR moment 的 influence function 推导与 √T-CAN 假设是否紧；若想进一步验证其效率界是否达到 semiparametric efficiency bound，需在 moderately_familiar 的 semiparametric theory 上长肌肉。立即可做：用现有因果估计理论工具分析其 DR moment 与 identification 条件。

2. 2606.13281 — Causal invariance in graphical models with latent variables¶

作者: Marco Borriero, Monia Lupparelli, Giovanni M. Marchetti, Veronica Vinciotti
分类: stat.ME
相关性 8/10 · novelty: new_theory
摘要: 在因果发现设定下，目标是利用因果不变性原理（causal invariance）跨实验环境识别目标变量的因果父变量。当部分父变量为潜在变量时，观测变量上的诱导图不再是 DAG 且可能不唯一，导致识别复杂化。本文针对潜在父变量的关键构型，刻画了诱导图的结构，并形式化了因果不变性在识别观测父变量时得以保留的条件。对多元高斯目标，作者建立了检验该不变性的必要与充分条件。理论结果将 latent-variable DAG 的图论性质与参数识别的充要条件精确对接，对您在因果识别理论方向的工作有直接参考价值。
关键技术: causal invariance principle, latent variable DAG, induced graph characterization, necessary and sufficient identification conditions, multivariate Gaussian testing
为什么对您有用: 直接连接因果推断的 identification theory 子方向，聚焦 latent-variable 下因果不变性的充要条件，与您 moderately_familiar 中的 identification theory in causal inference 精确对接。用 very_familiar 的 nonparametric statistics 与 minimax bounds 视角，可审视其高斯假设下检验的 power 界与更一般分布的推广（当前论文仅限高斯）。立即可做：用现有武器库即可复现其高斯检验并尝试向 semiparametric 模型推广识别条件。

3. 2606.13305 — Semiparametric Bayesian inference for causal mediation in cluster randomized trials¶

作者: Woojung Bae, Michael Daniels, Joseph Hogan, Rajesh Vedanthan, Stavroula Chrysanthopoulou
分类: stat.ME · stat.AP · stat.CO
相关性 8/10 · novelty: new_method
摘要: 在集群随机试验（CRT）设定下，目标是当中介变量处于集群层面且集群数有限时的自然直接/间接效应（NDE/NIE）的 identification 与估计。传统基于大样本渐近的方差估计在小样本集群下失效，导致置信区间覆盖不足。本文提出半参数贝叶斯推断框架：对结局与中介指定参数贝叶斯模型以保证计算效率与可解释性，同时提出新颖的 similarity-weighted Bayesian bootstrap（SWBB）量化不确定性，通过集群间距离度量从相近集群借信息，避免了严格的参数分布假设。结合观测数据模型与因果假设，该方法在有限集群数下仍能准确估计 NDE/NIE。模拟显示该方法在多种场景下达到名义覆盖概率，并在肯尼亚 CRT 数据中实证应用。对您有用：本文处理了因果中介分析中有限集群的推断难题，其 SWBB 借信息机制与半参数思路可启发 longitudinal/clustered 数据下的 sensitivity 分析。
关键技术: causal mediation analysis, cluster randomized trials, Bayesian bootstrap, similarity-weighted borrowing of information, natural direct/indirect effects, semiparametric Bayesian inference
为什么对您有用: 本文直接连接 causal inference 的 mediation 子方向，处理 CRT 有限集群下 NDE/NIE 的推断——这是经典 semiparametric efficiency bound 难以直接覆盖的小样本场景。SWBB 的距离加权借信息机制，可用您 moderately_familiar 的 semiparametric theory 视角审视其与 HOIF / one-step estimator 在有限样本下的优劣对比。中期可做：需先在 moderately_familiar 的 semiparametric theory 上长肌肉，具体是推导 CRT 下 NDE/NIE 的 semiparametric efficiency bound，以判断 SWBB 是否逼近该界或在何种偏离下仍有稳健覆盖。

高维统计 / 随机矩阵 (high_dim_rmt, 2 篇)¶

1. 2606.12943 — Phase transition of Schott's statistic for high-dimensional heavy-tailed data¶

作者: Hantao Chen, Guangming Pan, Cheng Wang
分类: math.ST · stat.TH
相关性 8/10 · novelty: new_theory
摘要: 在 α-regularly varying 重尾分布设定下，研究 Schott's statistic（样本相关矩阵 Frobenius 范数）的渐近分布，关键 regularity 假设为尾部指数 α 与维度 p、样本量 n 的关系。核心发现是相变现象：轻尾（α>3）时重现不依赖 α 的经典渐近分布，并放宽了对 p/n 比率的限制；重尾（α<3）时导出新的渐近正态分布，其方差显式依赖 α。作者进一步提出渐近方差的一致估计量，使标准化 Schott's test statistic 在未知位置参数和所有 α>0 下均可适用。对您可能有用：该相变阈值 α=3 的刻画直接丰富了高维 RMT 在非标准矩条件下的检验理论。
关键技术: Schott's statistic, regularly varying heavy-tailed distribution, phase transition in high dimensions, sample correlation matrix Frobenius norm, asymptotic normality under heavy tails, variance estimation for unknown location
为什么对您有用: 直接连接到 primary interest 中的 high-dimensional statistics 与 Random matrix theory，特别是 RMT 在非标准矩条件（重尾）下的假设检验与相变刻画。您武器库中 very_familiar 的 high-dimensional asymptotics 可直接用来审视其 p/n 放宽条件下的矩截断论证，moderately_familiar 的 M-estimation theory 也可为重尾下 variance estimator 的一致性提供替代视角。Follow-up 判断：立即可做——用 very_familiar 的高维渐近工具即可验证其相变边界是否紧，或尝试将此重尾 RMT 检验框架迁移到您熟悉的 inverse problems with random noise 设定中。

2. 2606.12943 — Phase transition of Schott's statistic for high-dimensional heavy-tailed data¶

作者: Hantao Chen, Guangming Pan, Cheng Wang
分类: math.ST · stat.TH
相关性 8/10 · novelty: new_theory
摘要: 在 α-regularly varying 重尾分布设定下，研究 Schott 统计量（样本相关矩阵 Frobenius 范数平方）的渐近分布，关键参数为维数 p、样本量 n 与尾部系数 α。发现该统计量存在相变现象：当 α>3（轻尾）时，渐近分布不依赖 α，且放宽了对 p/n 比率的限制；当 α<3（重尾）时，渐近正态分布的方差显式依赖于 α。作者进一步提出了渐近方差的一致估计量，使得标准化后的 Schott 检验统计量在位置参数未知且对所有 α>0 的情况下仍可实用。对您可能有用：该相变阈值 α=3 与四阶矩条件直接对应，为高维重尾假设检验提供了精确的渐近刻画。
关键技术: Schott's statistic, regularly varying distribution, phase transition, high-dimensional asymptotics, sample correlation matrix, asymptotic variance estimation
为什么对您有用: 直接连接到 primary interest 中的 high-dimensional statistics / Random matrix theory，特别是高维重尾数据下假设检验的渐近理论。technical_arsenal 中的 'high-dimensional asymptotics' 可直接用来验证其相变阈值 α=3 与四阶矩条件的对应关系是否可推广至其他 RMT 检验统计量。判断：立即可做——用 very_familiar 的高维渐近工具即可动手探索该相变现象在其他谱统计量上的普适性。

非参数 / 半参数 (nonparam_semipara, 1 篇)¶

1. 2606.12884 — Volterra--Wiener--Kunchenko Orthogonalization: From Wiener--Hermite to Distribution-Matched Volterra Bases¶

作者: Serhii Zabolotnii
分类: stat.ME · eess.SP
相关性 8/10 · novelty: new_theory
摘要: 在有限记忆 Volterra 系统辨识设定下，当输入非高斯时，传统单项式参数化面临病态问题，Wiener-Hermite 展开仅对高斯白噪声消除病态。本文通过在 \(L^2(P)\) 中对单项式做定向 Gram-Schmidt 正交化，构造了分布匹配的 Volterra-Wiener-Kunchenko (VWK) 基，将其作为任意多项式混沌坐标系用于数据驱动的 Volterra 辨识。核心理论结果是一个二阶误设惩罚定理：在对角方差匹配基中的自规范化对角估计器，其超额 \(L^2(P)\) 风险由偏度系数 \(\delta=\mu_3/\sigma^2\) 控制，且在输入对称时精确为零。数值实验表明，在 \(n=2000\) 时，中心指数输入的经验 VWK Gram 条件数远优于幂基 Gram，尽管随阶数退化；此外，Lean 4 机器检验证明了 Binomial\((N,p)\) Krawtchouk 行的构造。全文分析基于矩方法、有限记忆且限于乘积输入律。对您可能有用：VWK 基的正交化与误设风险界为非高斯设定下的半参数/非参数估计提供了新的正交基选择与风险控制视角。
⚠️ 摘要不完整，待重跑（python -m research_news.rerun）
关键技术: Gram-Schmidt orthogonalization in L2(P), Volterra-Wiener-Kunchenko basis, generalized polynomial chaos, order-2 misspecification-penalty theorem, condition number analysis of empirical Gram matrix, Lean 4 machine-checked proof
为什么对您有用: 本文连接到非参数理论中的正交基构造与 M-estimation 病态问题：(1) 直接对应 nonparametric statistics 与 M-estimation theory 中基选择对估计稳定性的影响；(2) 您武器库中 minimax bounds for estimation problems 与 high-dimensional asymptotics 可用于分析 VWK 对角估计器的有限样本风险是否达到 minimax 界，以及经验 Gram 条件数退化阶的精确刻画；(3) 中期可做：需先在 moderately_familiar 的 M-estimation theory 上长肌肉，以将二阶误设惩罚定理推广到非乘积输入律或无限记忆设定，并建立更紧的有限样本界。

📌 中相关论文（按主题分组）¶

因果推断 (causal_inference, 2 篇)¶

1. 2606.13531 — When Representative Samples Produce Worse Outcomes: Scale-up Decisions and Testing in Small-Budget RCTs¶

作者: Hannah Li, Hongseok Namkoong, Isaac Scheinfeld
分类: stat.ME
相关性 6/10 · novelty: new_theory
摘要: 在预算受限的小规模 RCT 用于筛选干预并决定是否扩大规模的设定下，目标是最大化下游期望收益。核心挑战在于显著性检验作为筛选门槛时，代表性样本未必最优。作者证明：大预算极限下最优设计收敛于目标人群的代表性样本；但在小预算情形下，最大化期望影响的最优策略是仅从单一同质子人群中抽样，具体选择取决于抽样成本与对异质性处理效应的先验信念。小预算情形的证明更一般化：当 RCT 与显著性检验决定是否获得非自适应下游收益时均成立。对您可能有用：该结论为小样本 RCT 的设计选择提供了理论依据，直接关联因果推断中实验设计与异质性处理效应的 identification 问题。
⚠️ 摘要不完整，待重跑（python -m research_news.rerun）
关键技术: optimal pilot design, heterogeneous treatment effects, significance testing threshold, downstream expected improvement, large-budget limit convergence, small-budget homogeneous sampling
为什么对您有用: 直接关联因果推断中 RCT 实验设计与异质性处理效应的子群 identification 问题。可用 minimax bounds for estimation problems 的工具分析小预算下估计-检验联合决策的最优性界，验证其声称的"单一子群最优"结论是否紧。中期可做：需先在 moderately_familiar 的 identification theory in causal inference 上长肌肉，以将异质性先验与子群 identification 形式化。

2. 2606.13025 — Diagnostics-guided variance-inflated Fay-Herriot estimation from non-probability samples¶

作者: Andrius \v{C}iginas
分类: stat.ME
相关性 4/10 · novelty: new_method
摘要: 在非概率样本的小区域估计设定下，目标是有限总体域总量；核心问题是标准 Fay-Herriot (FH) 模型将 IPW 估计的方差视为完整不确定性度量，忽略了弱覆盖、不稳定权重或辅助变量不平衡带来的选择偏差风险。本文提出 diagnostics-guided variance-inflated FH 估计量：先构建校准 IPW 域估计量，通过域级诊断指标汇总可靠性，再在 FH 观测方程中引入混合方差膨胀成分。诊断指标显示 IPW 信息较弱的域会被更强地平滑至域级回归均值。基于立陶宛企业伪真实总体的验证表明，该方法相对于校准 IPW 显著降低了估计误差。对您可能有用：该文将选择偏差风险通过方差膨胀嵌入 FH 模型的思路，为非概率样本因果推断中的 sensitivity analysis 提供了新视角。
关键技术: Fay-Herriot model, calibrated inverse probability weighting, domain-level diagnostics, mixture variance inflation, small area estimation
为什么对您有用: 本文直接连接因果推断中非概率样本的选择偏差与 IPW 权重不稳定问题，属于 estimation theory in causal inference 的延伸。您武器库中 very_familiar 的 inverse problems with random noise 与 minimax bounds 视角，可用来分析此方差膨胀机制在何种信号-噪声比下能真正收紧估计误差的 minimax rate。Follow-up 判断：中期可做——需先在 moderately_familiar 的 identification theory in causal inference 上补充非概率样本 identification 的最新文献，才能将此方差膨胀思路系统嵌入 proximal CI 或 sensitivity analysis 框架。

非参数 / 半参数 (nonparam_semipara, 10 篇)¶

1. 2606.13230 — Consistency of variational approximations under bounded Kullback--Leibler divergence¶

作者: Hien Duy Nguyen, Jacob Westerhout, Thomas Guilmeau, Julyan Arbel
分类: math.ST · stat.TH
相关性 7/10 · novelty: new_theory
摘要: 在贝叶斯变分推断框架下，研究变分近似序列何时继承目标后验的弱一致性。核心设定为：目标后验在一般度量空间上弱收敛到真参数的 Dirac mass，且变分近似序列对目标后验的 KL 散度被均匀控制。第一主定理证明，对紧致目标测度序列的 KL 散度均匀有界直接迫使变分序列也是紧致的，从而保证弱一致性。进一步给出对数矩条件来验证该 KL 有界性，并在光滑广义后验上展示其应用。对您可能有用：该结果为 semiparametric / nonparametric 贝叶斯推断中变分方法的渐近理论提供了基础一致性保证，可补充您在 semiparametric efficiency 与 M-estimation 理论中对贝叶斯近似推断的理论缺口。
关键技术: variational approximation, posterior consistency, Kullback-Leibler divergence bound, weak convergence to Dirac mass, logarithmic-moment condition, generalised posterior
为什么对您有用: 本文直接连接 semiparametric / nonparametric theory 子方向，为非参数/半参数贝叶斯变分推断提供 KL 有界性驱动的弱一致性保证。用您 very_familiar 的 minimax bounds 与 inverse problems with random noise 工具，可以分析该 KL 有界条件在具体非参数模型下的可达性，并进一步用 moderately_familiar 的 M-estimation theory 探究广义后验与 M-估计量的变分近似联系。立即可做：用 minimax 与经验过程工具验证文中对数矩条件在常见非参数先验下的紧性。

2. 2606.13230 — Consistency of variational approximations under bounded Kullback--Leibler divergence¶

作者: Hien Duy Nguyen, Jacob Westerhout, Thomas Guilmeau, Julyan Arbel
分类: math.ST · stat.TH
相关性 7/10 · novelty: new_theory
摘要: 在贝叶斯推断中，当精确后验计算不可行时，研究变分近似（variational approximations）是否继承真实后验的相合性（posterior consistency）。设定为一般度量空间上的概率测度族，目标测度序列弱收敛到真实参数处的 Dirac 测度。核心结果是：若变分近似序列到目标序列的 KL 散度一致有界，则近似序列必然 tight；进而当真实后验相合时，满足 KL 有界条件的变分序列也弱收敛到同一 Dirac 测度，即变分相合性。文中还给出验证 KL 有界性的对数矩条件（logarithmic-moment conditions），并在光滑广义后验（generalised posteriors）上做了示例验证。对您可能有用：该文为 semiparametric / nonparametric 模型中变分推断的理论可靠性提供了 KL 散度视角的充分条件，可补充您对 semipara efficiency 与贝叶斯相合性交叉的理解。
关键技术: variational approximation, posterior consistency, Kullback-Leibler divergence, weak convergence, tightness criterion, generalised posterior
为什么对您有用: 直接连接到 semiparametric and nonparametric theory 子方向：本文在一般度量空间上用 KL 散度一致有界刻画变分近似的相合性，属于非参数贝叶斯推断的理论基础。用您 very_familiar 的 minimax bounds for estimation problems 视角，可以审视该 KL 有界条件是否与已知非参数后验收敛率（如 Ghosal–van der Vaart 的 1/n^{α} rate）兼容，并探讨变分近似在 semipara efficiency bound 下的收敛率是否仍能保持。中期可做：需先在 moderately_familiar 的 semiparametric theory 上长肌肉——具体需补广义后验（generalised posterior）与 loss-based Bayesian 的相合性文献，才能将 KL 有界条件与 semipara influence function / efficient estimator 的收敛率对接。

3. 2606.12739 — Estimating Semiparametric and Nonparametric Fixed Effects Panel Data Models with mgcv¶

作者: Ivan Korolev
分类: econ.EM
相关性 7/10 · novelty: new_method
摘要: 本文在固定效应面板数据模型下，目标是半参数/非参数函数的估计与推断，关键假设是面板固定效应可被惩罚或差分消除。核心机制是利用 R 包 mgcv 的 penalized spline（通过 bam 实现）自适应未知光滑度，并与线性固定效应、first-differencing 及固定序列 spline 方法对比；推断方面提出 penalty-adjusted cluster-robust covariance estimator，保证有限维参数检验的 nominal size 及非参数函数 confidence band 的准确覆盖。Monte Carlo 显示 penalized spline 在所研究设定下函数估计精度优于固定序列方法，且惩罚调整后的聚类稳健方差估计使检验表现良好。对您可能有用：该文将半参数惩罚 spline 推断与面板固定效应结合，直接连接 semiparametric theory 与 econ_theory 的应用因果工作。
关键技术: penalized spline, fixed effects panel data, cluster-robust covariance, penalty-adjusted variance estimator, mgcv::bam, first-differencing
为什么对您有用: 本文连接 semiparametric theory（惩罚 spline 的光滑度自适应与 penalty-adjusted 推断）与 econ_theory（面板固定效应模型的应用因果设定）。用您 very_familiar 的 nonparametric statistics 与 software development 可以直接审视其 penalty-adjusted covariance 的理论细节与 mgcv 实现逻辑。Follow-up 判断：立即可做——您可用 M-estimation theory（moderately_familiar）验证其 penalty-adjusted cluster-robust variance 的渐近正当性，或用 minimax bounds 检验其声称的 adaptive smoothness 是否达到最优率。

4. 2606.13554 — Asymptotic regimes for maximum likelihood estimation in the Ewens--Pitman model: When the strength parameter matters¶

作者: Filippo Ascolani, Mario Beraha, Stefano Favaro
分类: math.ST · stat.ME · stat.TH
相关性 6/10 · novelty: new_theory
摘要: 在 Ewens–Pitman 随机划分模型下，研究折扣与强度参数 (α,θ) 的 MLE 大样本渐近行为，数据生成机制仅需温和假设。依据频谱的极限行为，严格刻画出四种不同的渐近 regime；特别地，与既有文献不同，发现 θ 可在渐近中起关键作用。指出既有文献因无限可交换性隐式施加了刚性结构关系，仅覆盖其中两种 regime；为突破此约束，提出 scaled Ewens–Pitman 模型，允许 θ 随样本量 n 增长。最后用真实数据展示经典框架无法覆盖的频谱需此扩展。对您可能有用：该文对离散结构参数 MLE 的多 regime 渐近分析，为 semiparametric theory 中参数随 n 增长的 drifting parameter 设定提供了新视角。
关键技术: Ewens-Pitman random partition, maximum likelihood estimation, frequency spectrum asymptotics, scaled parameter regime, infinite exchangeability constraint
为什么对您有用: 本文连接到 semiparametric theory 中 drifting/nuisance parameter 的渐近分析子方向；您可用 minimax bounds for estimation problems 的武器库来检验文中四种 regime 下 MLE 的收敛率是否达到 minimax optimal，或用 M-estimation theory 分析 scaled EP 模型中 θ_n→∞ 时 MLE 的 influence function 结构。Follow-up 粗判：中期可做——需先在 moderately_familiar 的 M-estimation theory 上长肌肉，以处理参数随 n 漂移的非标准渐近设定。

5. 2606.13554 — Asymptotic regimes for maximum likelihood estimation in the Ewens--Pitman model: When the strength parameter matters¶

作者: Filippo Ascolani, Mario Beraha, Stefano Favaro
分类: math.ST · stat.ME · stat.TH
相关性 6/10 · novelty: new_theory
摘要: 本文研究 Ewens–Pitman 随机划分模型中折扣参数 α 与强度参数 θ 的 MLE 大样本渐近行为，数据生成机制仅要求温和的频率谱条件。作者发现根据频率谱的极限行为，MLE 会出现四种截然不同的渐近 regime；特别地，θ 在某些 regime 下对渐近性质起关键作用，打破了以往文献认为 θ 可忽略的结论。以往文献隐含地仅覆盖了其中两种 regime，作者证明这源于无限可交换性假设对块数与频率谱施加的刚性结构约束。为克服该约束，作者提出 scaled Ewens–Pitman 模型，允许 θ 随样本量 n 增长，从而覆盖更广泛的频率谱结构。最后，真实数据实证表明经典框架之外的频率谱确实存在，需要该扩展模型。对您可能有用：本文展示了参数增长（θ ∝ n）如何改变 MLE 渐近 regime，这与 semiparametric efficiency 中考虑 drifting parameter / contiguous sequence 的经典思路有结构相似性。
关键技术: Ewens-Pitman random partition model, MLE asymptotic regimes, frequency spectrum limiting behavior, scaled parameter model (theta proportional to n), infinite exchangeability structural constraint, contiguous parameter sequence
为什么对您有用: 本文连接到 semiparametric/nonparametric theory 子方向中 drifting parameter / contiguous sequence 对渐近性质的影响这一经典话题。用您 very_familiar 的高维渐近工具可以直接分析 scaled EP 模型中 θ ∝ n 时 MLE 的极限分布与收敛率，验证作者声称的四种 regime 划分是否紧。立即可做：用 minimax bounds 框架对 scaled EP 模型的估计问题定出最优收敛率，并与本文 MLE 的四种 regime 对比。

6. 2606.13242 — Least Absolute Deviations Estimation for Sinusoidal Models¶

作者: Zehaan Naik, Debasis Kundu
分类: stat.ME · stat.CO
相关性 5/10 · novelty: new_method
摘要: 在正弦回归模型设定下，本文研究基于最小绝对偏差（LAD）的频率与振幅参数的稳健估计问题，假设噪声可具有重尾或离群点。核心方法是将 LAD 目标函数直接最小化，利用目标函数的部分凸性设计模块化坐标下降算法：振幅参数通过加权中位数更新，频率参数则采用基于周期图的网格搜索加局部精化。理论方面，在常规正则条件下证明了估计量的强一致性与渐近正态性，避免了传统 L1 线性规划（simplex）方法的计算瓶颈。实证部分在合成数据与 Mauna Loa CO2、航空乘客等真实时间序列上验证了方法对非高斯噪声的稳健性。对您可能有用：本文的 LAD 渐近正态性推导与部分凸性坐标下降策略，为非参数/半参数 M-估计中的稳健计算提供了一个可借鉴的具体案例。
关键技术: least absolute deviations (LAD), coordinate descent with partial convexity, weighted median update, periodogram-inspired grid search, strong consistency, asymptotic normality under heavy-tailed noise
为什么对您有用: 本文连接到 semiparametric & nonparametric theory 子方向中的 M-estimation 理论与 robust estimation。您武器库中 moderately_familiar 的 M-estimation theory 可直接攻本文的渐近正态性与一致性证明口子，验证其 regularity 条件是否可进一步放松。follow-up 粗判：立即可做——用 very_familiar 的非参数统计与 minimax bound 视角，可检验该 LAD 估计量在重尾噪声下是否达到 minimax rate，并对比 HOIF 框架下更高阶的稳健修正。

7. 2606.12857 — Discrepancy Modeling with Intermediate Variables: A New Framework for Robust Gaussian Process Calibration¶

作者: Henry Shaowu Yuchi, Michael Grosskopf, Aman Sharma, Nicolas Schunck, Jared O'Neal, Matt Menickelly, Stefan M. Wild
分类: stat.ME · stat.CO
相关性 4/10 · novelty: new_method
摘要: 在计算机实验校准（calibration）设定下，目标是利用中间变量（intermediate variables）对不完美模型（imperfect model）的偏差（discrepancy）进行建模与估计，以解决传统将 emulator 与 discrepancy 分离拟合导致的 identifiability 困难。核心方法提出一个鲁棒 GP 校准框架：通过结构化中间变量选择、离散化 scaled Gaussian stochastic process（S-GaSP）约束偏差项，并结合 space-filling 设计选取约束点，实现 emulator 与 discrepancy 的联合建模。理论贡献在于通过 S-GaSP 的尺度离散化机制与约束点设计，缓解了偏差-参数 identifiability 风险并改善了 UQ；实证在核物理结合能数据集上超越 baseline。对您可能有用：该框架的联合建模与 identifiability 缓解思路可迁移至因果推断中 proxy/IV 的 identification 问题，S-GaSP 约束机制也可作为非参偏差建模的新工具。
关键技术: Gaussian process calibration, discrepancy modeling, scaled Gaussian stochastic process (S-GaSP), intermediate variable selection, space-filling design, joint emulator-discrepancy modeling
为什么对您有用: 本文连接到非参数/半参数理论中的 GP 建模与 identifiability 问题，其利用中间变量缓解 identifiability 的思路与 proximal CI 中利用 negative control / proxy 变量实现 identification 的逻辑高度相似。用您 very_familiar 的非参数统计与 moderately_familiar 的 identification theory 可以直接审视其 S-GaSP 约束偏差项的数学机制是否真正解决了 identifiability，并探索将中间变量选择逻辑迁移到因果 proxy 设定的可能性——属于立即可做的方向。

8. 2606.13593 — Smoothed Rank-Based Regression Estimation Using Wilcoxon Score Functions¶

作者: Feridun Tasdan
分类: stat.ME
相关性 4/10 · novelty: new_method
摘要: 在简单与多重线性回归设定下，本文提出用基于平滑经验 CDF 的连续秩替代经典 Wilcoxon 秩回归中的整数秩，以估计斜率参数。核心机制是将指示函数替换为非递减可微核分布函数 H，从而构造 smoothed Wilcoxon score estimator；该估计量保留了秩回归的稳健性，同时在重尾误差下提升了效率，并更自然地处理了 ties。理论上，作者推导了回归系数的 Wald 型检验并建立了其渐近正态性。模拟显示，在正态、Laplace、Cauchy 及污染正态误差下，新估计量的相对效率一致不低于经典秩回归，在异常值与重尾场景下优势显著。对您可能有用：该平滑秩方法可视作对经典 M-estimator 的非参数修正，其渐近正态性与效率分析直接落入您的 semiparametric / nonparametric theory 与 hypothesis testing 兴趣范围。
关键技术: smoothed empirical CDF, Wilcoxon rank-score regression, kernel distribution function, Wald-type hypothesis test, asymptotic normality, robust relative efficiency
为什么对您有用: 本文连接到 nonparametric theory 与 hypothesis testing 子方向：用核平滑经验 CDF 替换指示函数构造秩估计量，属于非参数光滑修正经典 M-estimation 的范畴。您的 very_familiar 武器库中 nonparametric statistics 与 minimax bounds 可直接用来审视该估计量声称的效率提升是否达到 minimax optimal，以及其渐近方差是否逼近 semiparametric efficiency bound。Follow-up 判断：立即可做——用 minimax bound 验证其效率声明，或用 M-estimation theory 推导其 influence function 以确认是否为 semiparametric efficient。

9. 2606.13084 — Characterizing metric-space-valued processes: separating classes and weak invariance principles for measure-theoretic inference¶

作者: Anne van Delft
分类: math.ST · math.PR · stat.TH
相关性 4/10 · novelty: new_theory
摘要: 本文研究取值于缺乏拓扑向量空间结构的度量空间中的随机过程，核心设定是度量空间的几何与时间依赖结构的交互作用。作者严格证明：允许等距 Hilbert 嵌入的空间是具有球性质（ball property）的更广泛度量空间类的严格子类；后者利用 Borel 概率测度被球上的值唯一决定（separating classes）这一结构性质，绕过了传统核方法在无法等距 Hilbert 嵌入时的几何扭曲问题。基于 separating classes，推导了时间依赖随机测度族的均匀收敛及对应非平稳随机场的弱不变原理，显式揭示了依赖与几何复杂度对样本路径正则性的影响。针对小球概率快速衰减导致基于上确界的差异度量无极限分布的问题，发展了 L^p 替代方案，直接利用前述收敛结果，从而规避了高阶 U-process 的构造。对于允许等距 Hilbert 嵌入的空间（此时 U-process 自然出现），建立了退化与非退化多参数 U-process 的极限理论，并证明局部差异检验在动态参数下保持渐近稳定。
关键技术: metric space ball property, separating classes of Borel measures, weak invariance principle for nonstationary random fields, L^p discrepancy measures, multi-parameter U-process limit theory, isometric Hilbertian embedding
为什么对您有用: 直接连接到 higher-order U-statistics / nonparametric theory：本文后半部分对多参数 U-process（退化与非退化）的极限理论是您熟悉领域的新拓展，且前半部分用 L^p 差异度量绕过 U-process 构造的思路，为高阶 U-stat 的计算复杂度瓶颈提供了理论上的替代视角。立即可做：用 very_familiar 的 higher-order U-statistics（treewidth / tensor contraction）工具审视其多参数 U-process 极限理论中隐含的计算代价，判断 L^p 替代方案是否在计算上真正优于高阶 U-process。

10. 2606.13084 — Characterizing metric-space-valued processes: separating classes and weak invariance principles for measure-theoretic inference¶

作者: Anne van Delft
分类: math.ST · math.PR · stat.TH
相关性 4/10 · novelty: new_theory
摘要: 本文研究取值于缺乏拓扑向量空间结构的度量空间中的随机过程，核心设定是度量空间的几何与时间依赖结构的交互作用，关键假设是空间具有球性质（ball property）。作者严格证明，允许等距 Hilbert 嵌入的空间是具有球性质空间的严格子类；传统核方法在无法等距嵌入 Hilbert 空间时会产生几何扭曲，而本文利用球性质下 Borel 概率测度被球上的值唯一决定的分离类性质，绕过了这一限制。基于此分离类，推导了时间依赖随机测度族的均匀收敛及对应非平稳随机场的弱不变原理，显式揭示了依赖与几何复杂度对样本路径正则性的影响。由于小球概率的快速衰减会阻碍基于上确界的差异度量存在极限分布，作者发展了 L^p 替代方案，直接利用前述收敛结果，从而规避了高阶 U-过程表述的需要。最后，在允许等距 Hilbert 嵌入的空间中，建立了退化与非退化多参数 U-过程的极限理论，并证明局部差异检验在动态参数下保持渐近稳定性。对您有用：本文对高阶 U-过程的极限理论及规避 U-过程表述的 L^p 方案，直接触及您在 higher-order U-statistics 与 nonparametric theory 的核心兴趣。
关键技术: metric-space ball property, separating class of Borel measures, weak invariance principle, L^p discrepancy measure, multi-parameter U-process limit theory, isometric Hilbertian embedding
为什么对您有用: 本文直接连接您 primary interest 中的 higher-order U-statistics 与 nonparametric theory 子方向：它在 Hilbert 嵌入空间中建立了多参数 U-过程的极限理论（退化与非退化），这与您 technical_arsenal 中 moderately_familiar 的 higher-order U-statistics theory 直接对话；同时，它提出的 L^p 差异度量方案声称可规避高阶 U-过程表述，您可以用 very_familiar 的 higher-order U-statistics computation (treewidth / tensor contraction / einsum) 视角审视其计算复杂度是否真的更优，或用 minimax bounds 验证其收敛率是否紧。Follow-up 判断：立即可做——用您 very_familiar 的 U-statistic 计算工具对比 L^p 方案与 U-过程方案的计算成本，并检查其极限理论中 influence function / projection 的结构是否与 HOIF 框架兼容。

数理统计 / 假设检验 (hypothesis_testing, 1 篇)¶

1. 2606.13629 — Valid Inference with Synthetic Data via Task Exchangeability¶

作者: Lezhi Tan, Tijana Zrnic
分类: stat.ME · cs.AI · cs.LG · stat.ML
相关性 5/10 · novelty: new_method
摘要: 在合成数据（如 LLM 生成的硅样本、autorater 评分）参与科学研究的前提下，本文研究如何在数据可能存在偏差、噪声和误设时仍保证推断的 validity。核心 estimand 是基于合成数据的总体参数，关键假设是“任务可交换性”（task exchangeability）：研究者能找到有真实数据的历史任务，使得当前任务与历史任务在适当数学意义下可交换。基于此假设，作者构造了利用历史真实数据校准合成数据偏差的推断方法，并给出超越可交换性的扩展保证；实证部分在公共意见调查（硅样本）和 AI 评估（autorater）上验证了框架。对您可能有用：该框架本质上是在非经典数据源下做 valid inference，与您在因果推断中处理 proxy/negative control 的 identification 逻辑有结构相似性。
关键技术: task exchangeability, valid inference with synthetic data, conformal-style calibration, bias correction via historical tasks, selective inference extensions
为什么对您有用: 本文直接触及您 primary interest 中的 hypothesis testing / valid inference：在合成数据这种非标准数据源下如何保证推断有效性，其 task exchangeability 假设与 proximal CI 中用 negative control 做 identification 的逻辑有结构对应（都是用辅助/代理数据校准偏差）。您在 causal inference 的 identification theory 方面是 moderately_familiar，可以从此角度切入审视其 exchangeability 假设的实质与局限。Follow-up 判断：中期可做——需先在 moderately_familiar 的 identification theory 上长肌肉，具体是将 proximal CI 的 negative control identification 逻辑与本文的 task exchangeability 做形式化对比，看能否在更一般的 proxy-data 设定下统一两者的 validity 保证。

统计计算 / 算法 (stat_computing, 1 篇)¶

1. 2606.12654 — Computationally tractable robust differentially private mean estimation¶

作者: Kelly Ramsay
分类: stat.ME · cs.LG · stat.ML
相关性 7/10 · novelty: new_method
摘要: 在差分隐私（DP）约束下，目标是估计重尾和污染椭圆分布的均值，同时保证算法的计算可行性与鲁棒性。本文提出 balloon mean estimator，基于在扩张的 Mahalanobis 球（"balloons"）上执行迭代裁剪（iterative clipping）程序。该估计器满足 zero-concentrated DP，依赖少量可解释调参，且计算复杂度低（tractable）。理论方面，在重尾与污染椭圆模型下刻画了其统计性能与鲁棒界，明确了收敛率与异常值容忍度。模拟表明在污染设定下优于现有 DP 均值估计器。对您有用：该工作在 DP 约束下实现了计算-统计权衡的显式刻画，属于 stat-computing tradeoff 的具体算法实例，其迭代裁剪与 Mahalanobis 球扩张机制可启发高维鲁棒估计的复杂度分析。
⚠️ 摘要不完整，待重跑（python -m research_news.rerun）
关键技术: iterative clipping, Mahalanobis ball expansion, zero-concentrated differential privacy, robust estimation under contamination, elliptical distribution theory, computational tractability
为什么对您有用: 本文直接触及 stat-computing tradeoff 的具体实例：在 DP 约束下显式给出计算可行（polynomial-time）且鲁棒的均值估计器，并刻画统计收敛率，属于您 gateway reading 的范畴。您可用 very_familiar 中的 minimax bounds 工具验证其声称的统计率是否紧，或用 moderately_familiar 中的 M-estimation theory 分析迭代裁剪的收敛性质。Follow-up 判断：中期可做——需先在 moderately_familiar 的 M-estimation theory 上长肌肉，以将 balloon mean 的鲁棒界推广到更一般的半参数模型或高维设定。

天体统计 (astrostats, 3 篇)¶

1. 2606.12496 — Artifact-Conditioned Interval Diagnostics for Flow-Matching Neural Posterior Estimation in a Controlled Gravitational-Wave Benchmark¶

作者: Zhi Luo, Qi-Qin Jing
分类: astro-ph.IM
相关性 7/10 · novelty: application
摘要: 本文研究引力波双黑洞参数推断中，神经后验估计器（flow-matching）在数据含伪迹（glitch、频域掩码、PSD失配）时的边际区间校准问题。核心estimator是support-aware conditional flow-matching posterior，辅以circular phase表示；校准方法比较了全局重缩放、oracle分层重缩放、硬标签重缩放与软标签LAIR（learned artifact-aware interval rescaling）。在1024-bin受控基准下，全局重缩放在频域掩码情形MA90CE=0.1195，LAIR降至0.0672，但LAIR并非一致优于原始FMPE区间；多seed审计表明此现象非单次分裂伪迹。分类器对频域掩码与PSD失配识别可靠，但对glitch召回率低；波形分辨率、先验与高斯基线、受控似然参考后验探测表明边际覆盖须与后验宽度、几何及似然诊断联合解读。LAIR在此基准中更适合作为伪迹结构化的区间诊断工具，而非后验验证替代品；对您而言，这是astrostats gateway reading，展示了simulation-based inference校准在非理想数据下的具体挑战。
关键技术: conditional flow-matching posterior estimation, marginal interval calibration, artifact-aware interval rescaling (LAIR), simulation-based inference, coverage diagnostics
为什么对您有用: 本文是astrostats方向的gateway reading：对引力波推断的simulation-based inference校准问题给出了清晰的数据-模型 exposition（频域掩码、glitch、PSD失配等伪迹结构），适合外行入门。武器库中的software development与high-dimensional asymptotics可支撑理解其FMPE实现与多seed审计流程，但核心的normalizing-flow / flow-matching训练细节需额外补课。值得花时间读全文以了解astrostats中neural posterior的校准实践，但方法学novelty有限（LAIR本质是soft stratified rescaling，非新理论）。

2. 2606.13600 — Feasibility of up-the-ramp sampling under variable sky for ground-based spectrographs¶

作者: Gaia Gaspar, Marcin Sawicki, Nelson Nunes, Rub\'en J. D\'iaz, James E. H. Turner
分类: astro-ph.IM
相关性 6/10 · novelty: application
摘要: 本文研究地面近红外光谱仪在可变天空背景下采用 up-the-ramp (UTR) 采样读出模式的可行性，核心 estimand 是线性 ramp 拟合在 K-band 天光 3-10% 分钟级波动下的 SNR 与系统偏差。方法上，基于 GIRMOS 数据模拟器与 Gemini-NIRI 实测天光经验波动（10-20s cadence），采用 Monte Carlo 模拟量化线性拟合偏差、信噪比损失及宇宙线 (CR) 误判率。结果表明：在 read-noise-limited 且避开天光发射线的区域，UTR 相比 Fowler 采样可节省 4-10% 观测时间；但在天光发射线区域，高信号水平导致 SNR 下降与 CR 误判率激增，天光波动本身并非主因，可通过自适应 CR 阈值缓解。对您而言，这是一篇典型的天文数据模拟与系统偏差评估案例，展示了统计模拟在仪器读出策略决策中的直接应用。
关键技术: Monte Carlo simulation, linear ramp fitting, signal-to-noise ratio analysis, systematic bias quantification, cosmic ray rejection threshold
为什么对您有用: 本文属于 astrostats gateway reading：(1) 作为入门读物非常合适，不依赖天文术语，清晰展示了观测策略（UTR vs Fowler）如何转化为线性模型拟合偏差与 SNR 的统计评估问题；(2) 武器库完全支撑进入此方向，very_familiar 的 Monte Carlo 模拟与 inverse problems with random noise 视角可直接理解其线性 ramp 拟合与 read-noise 分析；(3) 值得花时间读全文，以了解天文光谱仪数据的具体结构（多维 FITS、时间序列 reads）与系统误差建模方式，为未来可能的统计方法介入（如非线性 ramp 拟合或自适应阈值理论）建立场景认知。

3. 2606.13398 — A robust super-time-stepping scheme for Ohmic and ambipolar diffusion¶

作者: Giancarlo Mattia, Mario Flock, David Melon Fuksman, Anastasia Tzouvanou, Vittoria Berta, Daniele Crocco
分类: astro-ph.IM · physics.plasm-ph
相关性 5/10 · novelty: application
摘要: 本文针对非理想磁流体力学（MHD）中欧姆扩散与双极扩散的数值求解问题，旨在克服显式方法的严苛时间步长限制与传统子步进方法在强磁场梯度边界处的截断误差不稳定性。核心方法是基于 Gegenbauer 多项式稳定性构造的 Runge-Kutta-Gegenbauer (RKG) 超时间步进格式，该格式在强各向异性电阻率下保持鲁棒性，并降低了对边界截断误差的敏感性。作者将此方案嵌入 PLUTO 代码，通过磁重联与磁旋转不稳定性等经典天体物理基准测试验证了其精度与效率。作为天体物理数值计算的专业论文，其方法学 novelty 属于应用层面，但对关注统计计算与数值方法的研究者而言，是了解天体物理大规模 PDE 模拟计算约束的入门读物。
关键技术: super-time-stepping, Runge-Kutta-Gegenbauer scheme, non-ideal MHD diffusion, anisotropic resistivity, PLUTO code
为什么对您有用: 本文属于 astrostats gateway reading：(1) 作为入门读物尚可，它清晰展示了天体物理大规模模拟中 PDE 数值求解的计算瓶颈（时间步长约束与各向异性扩散），但未涉及统计推断或数据建模问题；(2) 研究者的武器库（software development, numerical methods）足以支撑理解本文的数值方案细节，但该方向（天体物理 MHD 模拟）与研究者核心的统计计算兴趣（矩阵/张量运算、U-statistic 复杂度）交集极浅；(3) 不值得花时间读全文，除非研究者未来有意涉足天体物理 PDE 数值软件的开发与优化。

经济理论 / 应用 (econ_theory, 2 篇)¶

1. 2606.12596 — Extending Prais-Winsten Regression to Panel Data with Higher-Order Autoregressive Errors: A Simulation Study¶

作者: Ariel Linden
分类: stat.ME
相关性 5/10 · novelty: application
摘要: 本文将 Prais-Winsten AR(k) 广义最小二乘变换扩展至面板数据，嵌入 Beck-Katz 面板校正标准误（PCSE）框架，并开发了 Stata 包 xtpraisk。核心 estimand 为面板回归系数，关键假设为误差项服从 AR(k) 过程；方法通过 GLS 变换消除序列相关性，保留首期观测的 Prais-Winsten 初始化。Monte Carlo 模拟覆盖 AR(1–3)、多种自相关强度、面板规模与序列长度组合（2000 次重复），系统比较 xtpraisk 与 Driscoll-Kraay 估计器（xtscc）的有限样本表现。结果显示 xtpraisk 在保持名义 Type I error 与置信区间覆盖率的同时具有更高 power，而 xtscc 在短序列下系统性低估标准误且 Type I error 膨胀，随 AR 阶数增加恶化；两者均无偏。对您而言，本文提供了面板数据高阶自相关校正的具体计算实现与模拟范式，可作为经济理论应用中处理时间序列依赖的参考。
关键技术: Prais-Winsten GLS transformation, panel-corrected standard errors (PCSE), Driscoll-Kraay estimator, Monte Carlo simulation design, AR(k) error process
为什么对您有用: 本文属于经济理论（面板数据计量）的应用与计算实现，直接展示了在短面板、高阶自相关下 GLS 变换与 Newey-West 型 HAC 估计的有限样本差异。对您而言，xtpraisk 的 GLS 变换矩阵构造与 Stata 实现涉及统计计算中的矩阵运算优化，属于您熟悉的软件开发与数值计算范畴；但本文纯为模拟比较与软件包展示，无理论 novelty（novelty_flag = application）。作为入门读物，本文清晰易懂、数据设定明确，但武器库中无需额外长肌肉即可理解全部技术细节；若您对面板自相关校正的数值实现感兴趣，值得花时间读全文以了解计算细节，否则快速浏览模拟结论即可。

2. 2606.13555 — Price Elasticity of Gas Demand on L1 and L2: Evidence from Ethereum and Arbitrum¶

作者: Pranay Anchuri, Akaki Mamageishvili
分类: econ.EM · cs.GT
相关性 5/10 · novelty: application
摘要: 本文在以太坊主网（L1）与 Arbitrum（L2）的区块链 gas 需求面板数据设定下，目标是估计因果价格弹性，关键假设是各钱包滞后 base fee 可作为有效 IV 以消除拥堵导致的内生性。核心方法为 two-way fixed effects panel IV 回归，辅以 per-resource 分解与行为聚类，纠正了 naive OLS 对需求敏感度的严重低估。理论性质上，IV 估计给出了 L1 弹性 -0.006 与 L2 弹性 -0.036 的显著估计，两者均近乎无弹性但 L2 更敏感；L2 资源分解显示计算 -0.027 至退款 -0.27 的异质性。实证结果为 fee mechanism 设计与模拟提供了经验基准，对您有用在于它展示了 IV 方法在数字经济学中的具体应用及异质性弹性分解模式。
关键技术: two-way fixed effects panel regression, lagged base fee instrumental variable, per-resource elasticity decomposition, behavioral clustering
为什么对您有用: 直接连接到经济理论中的 IV 与因果推断应用：用滞后 base fee 作 IV 解决拥堵内生性，是经典 panel IV 在新数据场景的实例。用您 very_familiar 的 identification theory in causal inference 可审视其 IV 有效性假设（exclusion/relevance）是否合理，并评估 per-resource 分解的 identification 逻辑。立即可做：用现有因果推断武器检查其 IV 假设与弹性估计的 robustness/sensitivity。

🗂 其他论文（仅 LLM 评分，未生成摘要）¶

未生成中文摘要的论文，按 LLM 评分由高到低排列，仅保留评分与简评，便于回溯查全。一般为相关性低于展示阈值者；个别历史页也含当时因单日摘要上限未展开的高分篇目（评分仍清楚标着）。

1. 2606.12842 — Connecting Polarization to Exoplanet Yield Calculations for HWO¶

作者: Jaren N. Ashcraft, Scott D. Will, Maxwell A. Millar-Blanchaer, Breann Sitarski, Christopher C. Stark, Ewan S. Douglas, John Krist, Brian Kern, Manuel Quijada
分类: astro-ph.IM · astro-ph.EP
相关性 4/10 · novelty: application
评分理由: Astrostats gateway: discusses data/model challenges for exoplanet yield and polarization aberrations, but abstract is thin on statistical methodology and assumes instrument-specific jargon.

2. 2606.12566 — Inferring resource selection and utilization distributions from irregular and error-prone animal tracking data¶

作者: Fanny Dupont, Brett T. McClintock, Jan-Ole Fischer, Marianne Marcoux, Nigel E. Hussey, Marie Auger-M\'eth\'e
分类: stat.ME
相关性 3/10
评分理由: Ecological tracking methodology; tangential to researcher's primary interests in causal inference, high-dim stats, and semiparametric theory.

3. 2606.12701 — Bayesian machine learning approach for recurrent events studies using Soft Bayesian Additive Regression Trees (SBART)¶

作者: MengXing Chen, Debajyoti Sinha, Antonio Linero
分类: stat.ME
相关性 3/10
评分理由: Nonparametric Bayesian ensemble for recurrent events; lacks connection to researcher's specific semiparametric efficiency or causal inference interests.

4. 2606.13213 — Calibrating simplified vine copulas with a noise contrastive estimation approach¶

作者: Michael Denis Kraus, David Huk, Claudia Czado
分类: stat.ME · stat.ML
相关性 3/10
评分理由: Vine copula estimation is tangential to the researcher's primary interests in causal inference, semiparametric theory, and high-dimensional statistics.

5. 2606.13280 — Generalization Bounds for Transformer-Based Next-Token Prediction in a Language Model¶

作者: Insung Kong, Niklas Dexheimer, Johannes Schmidt-Hieber
分类: math.ST · stat.TH
相关性 3/10
评分理由: Generalization bounds for transformers/LLMs; tangential to the researcher's primary interests in causal inference, high-dim asymptotics, or stat-comp tradeoffs.

6. 2606.12677 — Restricted Multivariate Spatial Modeling¶

作者: Jihyeon Kwon, Harrison Quick
分类: stat.ME
相关性 2/10
评分理由: Spatial epidemiology modeling using CAR/MCAR; purely domain-specific with no clear methodological overlap with researcher's primary or secondary interests.

7. 2606.13433 — Smoothed-KL Reweighting: A Principled Account and Matching Rule for SNR-Based Diffusion Training¶

作者: Lei Li
分类: stat.ME
相关性 2/10
评分理由: Diffusion model training weights are unrelated to the researcher's focus on causal inference, mathematical statistics, and computational tradeoffs.

8. 2606.12653 — What I Wish I had Known When I Began Building Astronomical Instruments¶

作者: Daniel Fabricant
分类: astro-ph.IM
相关性 2/10
评分理由: Personal essay on building astronomical hardware; lacks data/model exposition or statistical methodology, failing the astrostats gateway rubric entirely.

9. 2606.13440 — Japan's Possible Contributions for Coronagraph of the Habitable Worlds Observatory (HWO)¶

作者: Keigo Enya, Kenta Yoneta, Naoshi Murakami, Jun Nishikawa, Satoshi Itoh, Taro Matsuo, Reiki Kojima, Takayuki Kotani, Olivier Guyon, Takahiro Sumi, Satoshi Miyazaki, Toru Yamada, Aoi Takahashi, Hajime Kawahara, Shota Miyazaki, Iona Kondo, Nana Higashio, Noriko Yamasaki, Masayuki Kuzuhara, Motohide Tamura, Masahiro Ikoma, Norio Narita, Julien Lozi
分类: astro-ph.IM
相关性 2/10
评分理由: Pure hardware/instrumentation contribution paper for a coronagraph; no statistical data analysis or modeling question present, inaccessible as gateway reading.

10. 2606.13327 — Disclosure risk in a geo-spatial setting¶

作者: Peter-Paul de Wolf
分类: stat.ME · stat.OT
相关性 1/10
评分理由: Geo-spatial disclosure risk and the MAUP are unrelated to the researcher's primary and secondary interests.

Maintained by 陈星宇 · Homepage · Source on GitHub

2026-06-13 每日 arXiv 资讯¶

⭐ 高相关论文（按主题分组）¶

因果推断 (causal_inference, 3 篇)¶

1. 2606.13519 — Semiparametric Local Projections¶

2. 2606.13281 — Causal invariance in graphical models with latent variables¶

3. 2606.13305 — Semiparametric Bayesian inference for causal mediation in cluster randomized trials¶

高维统计 / 随机矩阵 (high_dim_rmt, 2 篇)¶

1. 2606.12943 — Phase transition of Schott's statistic for high-dimensional heavy-tailed data¶

2. 2606.12943 — Phase transition of Schott's statistic for high-dimensional heavy-tailed data¶

非参数 / 半参数 (nonparam_semipara, 1 篇)¶

1. 2606.12884 — Volterra--Wiener--Kunchenko Orthogonalization: From Wiener--Hermite to Distribution-Matched Volterra Bases¶

📌 中相关论文（按主题分组）¶

因果推断 (causal_inference, 2 篇)¶

1. 2606.13531 — When Representative Samples Produce Worse Outcomes: Scale-up Decisions and Testing in Small-Budget RCTs¶

2. 2606.13025 — Diagnostics-guided variance-inflated Fay-Herriot estimation from non-probability samples¶

非参数 / 半参数 (nonparam_semipara, 10 篇)¶

1. 2606.13230 — Consistency of variational approximations under bounded Kullback--Leibler divergence¶

2. 2606.13230 — Consistency of variational approximations under bounded Kullback--Leibler divergence¶

3. 2606.12739 — Estimating Semiparametric and Nonparametric Fixed Effects Panel Data Models with mgcv¶

4. 2606.13554 — Asymptotic regimes for maximum likelihood estimation in the Ewens--Pitman model: When the strength parameter matters¶

5. 2606.13554 — Asymptotic regimes for maximum likelihood estimation in the Ewens--Pitman model: When the strength parameter matters¶

6. 2606.13242 — Least Absolute Deviations Estimation for Sinusoidal Models¶

7. 2606.12857 — Discrepancy Modeling with Intermediate Variables: A New Framework for Robust Gaussian Process Calibration¶

8. 2606.13593 — Smoothed Rank-Based Regression Estimation Using Wilcoxon Score Functions¶

9. 2606.13084 — Characterizing metric-space-valued processes: separating classes and weak invariance principles for measure-theoretic inference¶

10. 2606.13084 — Characterizing metric-space-valued processes: separating classes and weak invariance principles for measure-theoretic inference¶

数理统计 / 假设检验 (hypothesis_testing, 1 篇)¶

1. 2606.13629 — Valid Inference with Synthetic Data via Task Exchangeability¶

统计计算 / 算法 (stat_computing, 1 篇)¶

1. 2606.12654 — Computationally tractable robust differentially private mean estimation¶

天体统计 (astrostats, 3 篇)¶

1. 2606.12496 — Artifact-Conditioned Interval Diagnostics for Flow-Matching Neural Posterior Estimation in a Controlled Gravitational-Wave Benchmark¶

2. 2606.13600 — Feasibility of up-the-ramp sampling under variable sky for ground-based spectrographs¶

3. 2606.13398 — A robust super-time-stepping scheme for Ohmic and ambipolar diffusion¶

经济理论 / 应用 (econ_theory, 2 篇)¶

1. 2606.12596 — Extending Prais-Winsten Regression to Panel Data with Higher-Order Autoregressive Errors: A Simulation Study¶

2. 2606.13555 — Price Elasticity of Gas Demand on L1 and L2: Evidence from Ethereum and Arbitrum¶

🗂 其他论文（仅 LLM 评分，未生成摘要）¶

1. 2606.12842 — Connecting Polarization to Exoplanet Yield Calculations for HWO¶

2. 2606.12566 — Inferring resource selection and utilization distributions from irregular and error-prone animal tracking data¶

3. 2606.12701 — Bayesian machine learning approach for recurrent events studies using Soft Bayesian Additive Regression Trees (SBART)¶

4. 2606.13213 — Calibrating simplified vine copulas with a noise contrastive estimation approach¶

5. 2606.13280 — Generalization Bounds for Transformer-Based Next-Token Prediction in a Language Model¶

6. 2606.12677 — Restricted Multivariate Spatial Modeling¶

7. 2606.13433 — Smoothed-KL Reweighting: A Principled Account and Matching Rule for SNR-Based Diffusion Training¶

8. 2606.12653 — What I Wish I had Known When I Began Building Astronomical Instruments¶

9. 2606.13440 — Japan's Possible Contributions for Coronagraph of the Habitable Worlds Observatory (HWO)¶

10. 2606.13327 — Disclosure risk in a geo-spatial setting¶

评论