2026-05-29 每日 arXiv 资讯¶

高相关论文 11 篇 · 中相关 13 篇 · 其他 15 篇 · 会议/Seminar 事件 0 条

⭐ 高相关论文（按主题分组）¶

因果推断 (causal_inference, 7 篇)¶

1. 2605.28341 — Identification and Inference for Structural Accelerated Failure Time Models via Instrument Interactions¶

作者: Qiushi Bu, Wen Su, Xinyu Zhang, Xingqiu Zhao, Zhonghua Liu
分类: stat.ME · math.ST · stat.TH
相关性 9/10 · novelty: new_method
摘要: 在右删失生存数据与未测量混杂下，本文针对结构加速失效时间模型（SAFTM），利用工具变量间的交互项构建了新的识别与推断框架。该方法不依赖经典 IV 有效性假设，只要基于交互项的识别条件成立，即可在有效与无效工具变量共存时进行有效因果推断。为处理右删失，作者通过 augmented IPCW 构造了删失调整的观测数据矩函数，该矩函数对干扰函数满足 Neyman 正交性并具有双重稳健性，允许灵活的干扰函数估计。推断采用广义经验似然（GEL），以适应多且弱的交互矩条件；在 many weak moment asymptotics 下证明了估计量的一致性与渐近正态性，并开发了识别强度与过度识别的诊断工具。对您可能有用：本文将正交性、GEL 与 many weak moments 结合的推断框架，为纵向/生存数据因果推断中的弱 IV 问题提供了新解法。
关键技术: structural accelerated failure time model, instrument interaction identification, Neyman orthogonality, augmented inverse probability censoring weighting, generalized empirical likelihood, many weak moment asymptotics
为什么对您有用: 直接连接因果推断中的 IV 与生存分析设定，其 many weak moment asymptotics 与 GEL 推断框架属于数学统计与假设检验的交叉。您 very_familiar 的 M-estimation 理论可直接切入分析其 GEL 估计量的渐近性质，而 moderately_familiar 的 identification theory 可用于审视其交互项识别条件的本质与一般化可能。中期可做：需先在 moderately_familiar 的 semiparametric theory 上长肌肉，以彻底厘清其 double robustness 与 Neyman orthogonality 在非标准矩条件下的效率界问题。

2. 2605.28749 — IV regression with distribution-valued outcomes¶

作者: David Van Dijcke, Kaspar W\"uthrich
分类: econ.EM · math.ST · stat.ME · stat.TH
相关性 9/10 · novelty: new_method
摘要: 在内生性设定下，目标是对分布型结果（distribution-valued outcome）进行 IV 回归，estimand 为 2-Wasserstein 空间中的条件分布回归系数函数。方法将全局 Fréchet 回归扩展至 IV 框架（IVFR），先利用 IV 权重构造加权分位曲线，再将其投影回有效分布空间以保证拟合结果合法，该投影在有限样本下可降低估计误差。理论上证明 IVFR 估计量弱收敛至均值零高斯过程，并建立 multiplier bootstrap 的一致有效性以进行均匀推断。模拟中投影步骤使 IMSE 降低达 63%；实证重访中国进口竞争对工资分布及食品券项目对出生体重分布的因果效应，所得均匀置信带比现有方法窄 9-10%。对您有用：本文将 IV 方法拓展至非欧分布空间，与您 primary interest 中的 IV 估计及 semiparametric theory 直接相关。
关键技术: IV Fréchet regression, 2-Wasserstein space, quantile curve projection, Gaussian process weak convergence, multiplier bootstrap, uniform inference
为什么对您有用: 本文直接连接您 primary interest 中的 IV 方法与 semiparametric theory，处理分布型结果的因果推断问题。您 technical_arsenal 中的 semiparametric theory 与 M-estimation theory 可用于分析其 Wasserstein 空间 M-estimator 的效率性质（如是否达到该空间下的 semiparametric efficiency bound）。follow-up 粗判：中期可做——需先在 moderately_familiar 的 M-estimation theory 上长肌肉，以攻克 Fréchet 回归在非欧空间中的效率界与最优推断问题。

3. 2605.27655 — Implementing the principal stratum strategy for intercurrent events with survival outcomes: a tutorial¶

作者: Xiaoxiao Zhou, Joyce Chen, Pallavi Mishra-Kalyani, Xiaoxue Li, Yuan Li Shen, Shu Wang, Susan Halabi, Fan Li
分类: stat.ME
相关性 8/10 · novelty: survey
摘要: 在 ICH E9 (R1) 估计量框架下，本文针对生存结局与二值并发事件（如治疗中断），目标是定义并估计主层因果效应（principal stratum causal effect），核心识别假设为 SUTVA、主层可忽略性及参数/半参数结构假设。方法上，文章系统梳理了两大估计路径：基于 mixture model 的参数极大似然估计（依赖 EM 算法），以及基于加权法的半参数估计（依赖 IPW 及 sensitivity analysis 中的 tipping-point/tilting 参数）。理论层面未推导新效率界或收敛率，重点在于实操：给出每种方法的 sensitivity analysis 方案及配套 R 代码，并通过模拟肿瘤临床试验比较两种方法的操作特征。对您而言，本文是主层策略在生存数据上应用的入门参考，适合快速了解该设定下的 identification 与 sensitivity analysis 现状，但方法学 novelty 较低。
关键技术: principal stratification, mixture model estimation, inverse probability weighting, sensitivity analysis, survival analysis
为什么对您有用: 本文直接涉及因果推断中的 principal stratification 识别与估计问题，属于您 primary interest 中 causal inference 的子方向。您武器库中的 semiparametric theory 与 M-estimation theory（moderately_familiar）可直接切入本文加权法的效率改进与 mixture model 的半参数化拓展，例如推导该设定下的 efficient influence function 或构造 one-step estimator。中期可做：需先在 moderately_familiar 的 semiparametric theory 上长肌肉，以突破现有方法仅提供参数/加权估计而缺乏半参数效率理论的局限。

4. 2605.28105 — Identifying Direct Causal Effects in Latent Factor Models by Accounting for Unidentified Parents¶

作者: Tom Hochsprung, Nils Sturma, Jakob Runge, Mathias Drton, Andreas Gerhardus
分类: stat.ME
相关性 8/10 · novelty: new_method
摘要: 在线性结构方程模型（含显式潜变量）设定下，目标是识别观测变量间的直接因果效应，提供基于观测协方差的（有理式）识别公式。现有基于潜变量投影（latent projection）的方法在观测变量被少数潜变量密集混淆时，往往无法认证多数效应的可识别性；而近期显式利用潜变量的算法在更密集的因果图上仍常给出无定论结果。本文提出新识别准则，核心洞见是递归识别方案可推广至显式计入具有（尚未识别直接效应的）因果父节点的情况，从而更好处理密集图。准则中的组合搜索问题可转化为网络流（network-flow）计算，由此构建了实用算法并提供了软件实现。对您可能有用：本文在潜变量线性SEM下推进了直接效应的识别理论，其网络流组合优化思路为高维因果图识别的计算代数方法提供了新切入点。
关键技术: linear structural equation models with latent variables, causal identification via rational formulas, latent projection framework, recursive identification schemes, network-flow computation, covariance constraints
为什么对您有用: 直接连接因果推断中的 identification theory 子方向，处理潜变量密集混淆下直接效应的识别问题。您 very_familiar 的 identification theory in causal inference 可直接评估其递归识别准则的完备性与局限，且 moderately_familiar 的 M-estimation theory 可用于后续将识别公式转化为估计量并推导渐近性质。立即可做：用现有因果识别理论工具审视其网络流算法在何种图结构下达到识别极限，并尝试构造反例。

5. 2605.28785 — Beyond Exchangeability: Distribution-Shift-Aware Integration of External Control Data in Randomized Trials¶

作者: Jiawei Shan, Yiteng Tu, Guanbo Wang, Chao Ying, Jiwei Zhao
分类: stat.ME
相关性 8/10 · novelty: new_method
摘要: 在 RCT 借用外部对照数据的设定下，目标是在 exchangeability 假设可能失效（即存在分布偏移）时，仍对试验人群的 ATE 进行有效估计。本文通过校准方程显式建模 RCT 与外部对照间的分布偏移，将仅基于试验的 efficient influence function 进行适配，构建了 augmented estimator 以充分利用外部信息。进一步提出 adaptive shrinkage estimator，在保证一致性的同时，理论上确保效率优于仅用 RCT 的 benchmark。仿真与真实数据验证了该方法在分布偏移下的实际优势。对您有用之处：该工作将 semiparametric efficiency theory 与 data integration 结合，是处理外部有效性与分布偏移权衡的典型范例。
关键技术: efficient influence function, distribution shift calibration, adaptive shrinkage estimation, external control borrowing, semiparametric efficiency bound
为什么对您有用: 直接连接因果推断中的 data integration / external validity 子方向，核心是 semiparametric efficiency theory 在分布偏移下的适配。您 very_familiar 的 estimation theory in causal inference 可直接用来审视其 influence function 适配逻辑，moderately_familiar 的 semiparametric theory 可用于验证其效率优势声称的紧致性。立即可做：用 very_familiar 的因果估计理论复现其 augmented estimator 并在典型分布偏移设定下检查其效率界是否可达。

6. 2605.28762 — Deep Neural Networks for Doubly Robust Estimation with Nonprobability Survey Samples¶

作者: Yufang Dai, Shihua Luo, Wendy Lou, Zilin Wang, Xuewen Lu
分类: math.ST · stat.AP · stat.CO · stat.ME · stat.ML · stat.TH
相关性 8/10 · novelty: new_method
摘要: 在非概率抽样与概率抽样数据融合设定下，目标是有限总体均值的双重稳健估计，假设抽样得分为未知非参数函数。本文提出 DNN 辅助的双重稳健框架：首先通过最大化结合两数据源的伪似然估计非参数 logit 抽样得分，并用 ADAM 优化 DNN 参数；随后将估计的抽样得分嵌入 DNN 辅助的逆概率加权（IPW）估计器与深度双重稳健（DR）估计器。在正则条件下建立了估计器的一致性与收敛速率，理论保证依赖于 DNN 的非参数收敛性质。模拟与 Pew/BRFSS 实证表明，在真实选择机制非线性时，该方法对倾向得分模型误设具有更强鲁棒性。对您有用：此工作将 DNN 嵌入 survey sampling 的 DR 估计，与您主攻的因果推断 DR 估计及半参数效率理论直接相关。
关键技术: doubly robust estimation, pseudo-likelihood estimation, deep neural network nonparametric regression, inverse probability weighting, convergence rate of DNN
为什么对您有用: 直接连接因果推断中的 DR 估计与半参数效率理论（主兴趣），特别是处理倾向得分非参数误设的鲁棒性。您可用 very_familiar 的高维渐近理论或 moderately_familiar 的半参数理论审视其 DNN 收敛速率与 DR 估计器的半参数效率界是否可达（当前论文仅证明一致性，未讨论效率）。立即可做：用半参数效率界验证其 DR 估计器是否达到 n^{-1/2}-CAN 及 semiparametric efficiency bound。

7. 2605.28762 — Deep Neural Networks for Doubly Robust Estimation with Nonprobability Survey Samples¶

作者: Yufang Dai, Shihua Luo, Wendy Lou, Zilin Wang, Xuewen Lu
分类: math.ST · stat.AP · stat.CO · stat.ME · stat.ML · stat.TH
相关性 8/10 · novelty: new_method
摘要: 在非概率抽样与概率抽样数据融合设定下，目标是有限总体均值的双重稳健估计，假设抽样得分为未知非参数函数。本文提出 DNN 辅助的双重稳健框架：首先通过最大化结合两数据源的伪似然估计非参数 logit 抽样得分，并用 ADAM 优化 DNN 参数；随后将估计的抽样得分嵌入 DNN 辅助的逆概率加权（IPW）估计器与深度双重稳健（DR）估计器。在正则条件下建立了估计器的一致性与收敛速率，理论保证依赖于 DNN 的非参数收敛性质。模拟与 Pew/BRFSS 实证表明，在真实选择机制非线性时，该方法对倾向得分模型误设具有更强鲁棒性。对您有用：此工作将 DNN 嵌入 survey sampling 的 DR 估计，与您主攻的因果推断 DR 估计及半参数效率理论直接相关。
关键技术: doubly robust estimation, pseudo-likelihood estimation, deep neural network nonparametric regression, inverse probability weighting, convergence rate of DNN
为什么对您有用: 直接连接因果推断中的 DR 估计与半参数效率理论（主兴趣），特别是处理倾向得分非参数误设的鲁棒性。您可用 very_familiar 的高维渐近理论或 moderately_familiar 的半参数理论审视其 DNN 收敛速率与 DR 估计器的半参数效率界是否可达（当前论文仅证明一致性，未讨论效率）。立即可做：用半参数效率界验证其 DR 估计器是否达到 n^{-1/2}-CAN 及 semiparametric efficiency bound。

天体统计 (astrostats, 2 篇)¶

1. 2605.27527 — Probabilistic Data-Driven Modelling of Astrophysical Transients: The Neural Process Family for Ultrafast and Class-Agnostic Light Curve Reconstruction with NightLANP¶

作者: Siddharth Chaini, Federica B. Bianco, Ashish Mahabal
分类: astro-ph.IM · cs.LG
相关性 8/10 · novelty: new_method
摘要: 在 Rubin 天文台 LSST 稀疏不规则多波段 cadence 设定下，目标是实现瞬变天体光变曲线的快速概率插值与重建。核心方法是将 Attentive Neural Processes (ANP) 应用于光变曲线重建，通过在多样化模拟瞬变信号上进行 meta-learning，将计算开销转移至训练阶段，从而实现 amortized inference。ANP 同时对六个波段进行联合插值，避免了 Gaussian Process (GP) 需逐条拟合、难以处理跨波段相关性及需预设核函数的缺陷。在 15 个瞬变类型的真实 Rubin cadence 模拟数据上，ANP 在回归质量、天体物理特征恢复及概率校准上均优于 GP 与标准神经网络基准。推理速度达微秒级，比 GP 快五个数量级，且避免了标准神经网络的过度自信与 GP 的欠自信问题，提供校准良好的不确定性量化。作为 astrostats 入门，本文清晰展示了大规模不规则天文数据的核心挑战与概率建模方案，对您关注的不规则观测下的统计计算与概率推断有直接参考价值。
关键技术: Attentive Neural Processes, amortized inference, meta-learning, probabilistic calibration, multi-band light curve interpolation, Gaussian Process benchmark
为什么对您有用: (1) 本文是优秀的 astrostats 入门读物，清晰展示了 LSST 时代稀疏不规则多波段观测的数据结构（噪声、选择效应、规模）与概率建模挑战，不假设天文先验知识；(2) 武器库中的软件工程能力与高维渐近理论足以支撑进入此方向，但 Neural Processes 的深度学习训练细节需额外补充；(3) 值得花时间读全文，特别是其概率校准与跨波段相关性的处理思路，对理解大规模不规则数据的统计计算权衡很有启发。

2. 2605.27538 — VROOM-SBI: A Fast Simulation-Based Bayesian Inference Methodology for QU-Fitting¶

作者: Arpan Pal, Preshanth Jagannathan
分类: astro-ph.IM · astro-ph.CO · astro-ph.GA
相关性 8/10 · novelty: application
摘要: 本文针对射电天文中的视线Faraday旋转QU-fitting问题，传统逐像素Bayesian推断因高维积分计算代价过大而无法用于巡天尺度。作者提出VROOM-SBI，用simulation-based inference（具体为neural posterior estimation）替代MCMC，将Stokes Q/U光谱的参数后验估计转化为条件密度回归问题。方法在四种标准退偏振模型上训练，推断精度与Faraday合成及经典QU-fitting可比，同时实现约500倍加速。模型与代码已开源并提供HuggingFace预训练权重。对您而言，这是一篇将SBI/NPE应用于天文数据的好入门读物，展示了替代MCMC的快速推断范式。
关键技术: simulation-based inference, neural posterior estimation, Faraday rotation QU-fitting, Bayesian inference acceleration, Stokes Q/U spectra modeling
为什么对您有用: (1) 作为astrostats的gateway reading，本文对天文背景（Faraday旋转、退偏振模型）和统计模型（Stokes光谱似然、参数后验）交代清晰，适合统计学者入门；(2) 武器库中的software development与Bayesian推断知识足以支撑理解并复现此SBI流程，无需额外长肌肉；(3) 值得花时间读全文以了解SBI在天文巡天尺度推断中的实际计算加速效果与精度折中。

其他 (other, 2 篇)¶

1. 2605.28429 — The 'Right' Extension of Type-I Error to Data-Dependent Levels¶

作者: Nick W. Koning
分类: math.ST · stat.TH
相关性 8/10
摘要: 在数据依赖（data-dependent）显著性水平下，本文研究如何将经典第一类错误概念合理推广至随机阈值设定。核心目标是证明现有文献中基于启发式与 E-value 联系的推广定义是唯一的"正确"形式。作者提出三条公理（嵌套经典常水平有效性、保持数据依赖水平下的经典有效性、关于拒绝声明强度的单调性），证明该推广是满足此三条公理的唯一延拓。进一步，将此结果应用于支撑 E-value 的常见定义：证明 E-value 恰是广义假设检验（可在数据驱动水平下拒绝）有效性的"正确"数值表征。对您有用：为数据依赖水平下的假设检验提供了严格的公理化基础，直接补充了您在 mathematical statistics & hypothesis testing 方向的理论工具箱。
关键技术: post-hoc hypothesis testing, data-dependent significance level, E-value, axiomatic characterization, generalized hypothesis test

2. 2605.28429 — The 'Right' Extension of Type-I Error to Data-Dependent Levels¶

作者: Nick W. Koning
分类: math.ST · stat.TH
相关性 8/10
摘要: 在数据依赖（data-dependent）显著性水平下，本文研究如何将经典第一类错误概念合理推广至随机阈值设定。核心目标是证明现有文献中基于启发式与 E-value 联系的推广定义是唯一的"正确"形式。作者提出三条公理（嵌套经典常水平有效性、保持数据依赖水平下的经典有效性、关于拒绝声明强度的单调性），证明该推广是满足此三条公理的唯一延拓。进一步，将此结果应用于支撑 E-value 的常见定义：证明 E-value 恰是广义假设检验（可在数据驱动水平下拒绝）有效性的"正确"数值表征。对您有用：为数据依赖水平下的假设检验提供了严格的公理化基础，直接补充了您在 mathematical statistics & hypothesis testing 方向的理论工具箱。
关键技术: post-hoc hypothesis testing, data-dependent significance level, E-value, axiomatic characterization, generalized hypothesis test

📌 中相关论文（按主题分组）¶

因果推断 (causal_inference, 1 篇)¶

1. 2605.28471 — The Modified Egger Intercept Tests for Detecting Horizontal Pleiotropy in Two-Sample Summary-Data Mendelian Randomization¶

作者: Yilei Ma, Youpeng Su, Xin Liu, Xuanye Cui, Ping Yin, Peng Wang
分类: stat.ME
相关性 6/10 · novelty: new_method
摘要: 在两样本汇总数据孟德尔随机化（MR）设定下，目标是检验水平多效性（即IV假设的违反），estimand为Egger截距（EI）。经典EI检验因Egger回归中存在的测量误差与winner's curse导致估计有偏，从而造成第一类错误失准。本文提出修正Egger截距（MEI）检验，利用近期发展的rerandomized IVW估计量在无方向性或相关性多效性原假设下构建偏差校正的EI估计量，并证明了其在实际条件下的渐近性质。进一步发现检验功效受SNP编码方向影响，为此结合两种特定等位基因编码方案下的MEI统计量以提升功效的稳健性。模拟与真实数据表明，组合检验在控制第一类错误与功效上均优于传统EI检验。对您可能有用：本文处理MR中IV假设的敏感性检验问题，属于流行病学因果推断的常见设定。
关键技术: Mendelian randomization, Egger intercept test, rerandomized IVW estimator, measurement error correction, winner's curse adjustment, instrumental variable assumption testing
为什么对您有用: 本文直接涉及流行病学因果推断中IV假设的敏感性检验（水平多效性检测），是IV方法在基因数据中的具体应用。研究者可用semiparametric theory与M-estimation theory中的偏差校正工具审视其渐近性质证明，或用higher-order U-statistics视角分析其组合检验的功效性质。属于中期可做：需先在moderately_familiar的M-estimation theory上补充测量误差模型与winner's curse的局部渐近理论，方能严格审视其偏差校正的更高阶性质。

非参数 / 半参数 (nonparam_semipara, 2 篇)¶

1. 2605.27718 — Robust Moment-Based Estimation via Spectral Gradient Reweighting¶

作者: Liu Zhang, Amit Singer
分类: math.ST · cs.LG · stat.ME · stat.TH
相关性 6/10 · novelty: new_method
摘要: 在广义矩估计（GMM）框架下，本文研究当似然不可用或计算不便时，如何对含离群值的污染数据进行稳健参数推断，设定为强污染模型（strong contamination）。核心方法 SGR-GMM 通过谱梯度重加权（SGR）原语对逐样本梯度进行软重加权：在固定中心点下，SGR 被建模为样本权重玩家与密度矩阵玩家之间的熵正则化谱博弈，并利用矩阵乘法权重（Matrix Multiplicative Weights）的 regret 界进行分析。理论层面，作者证明了 SGR 原语更新的显式收敛半径与有限步终止界，并给出了局部有限样本参数估计误差界，该界对污染比例、内点梯度稳定性、局部 GMM 识别强度及优化精度有显式依赖。应用上，将 SGR-GMM 特化为稳健对角加权 GMM（DGMM），用于加性高斯噪声与强污染下的异方差低秩高斯混合估计。对您可能有用：其逐样本梯度重加权与谱博弈框架，可为高维 M-estimation 的稳健性分析或 influence function 的重加权视角提供新工具。
关键技术: generalized method of moments, spectral gradient reweighting, matrix multiplicative weights, entropy-regularized spectral game, robust estimation under contamination, diagonally-weighted GMM
为什么对您有用: 本文连接到 semiparametric & nonparametric theory 中的 M-estimation 稳健性及 efficiency theory 中的 influence function 视角（逐样本梯度重加权本质是对 influence function 的谱修正）。用您 very_familiar 的 M-estimation theory 可以直接审视其局部识别强度与误差界的紧致性，且 moderately_familiar 的 semiparametric theory 可用于探讨该重加权机制是否逼近稳健 semiparametric efficiency bound。Follow-up 判断：立即可做——用 M-estimation 与 influence function 工具验证其有限样本界在经典 GMM 设定下是否 sharp，并探索重加权估计量的 asymptotic efficiency。

2. 2605.27718 — Robust Moment-Based Estimation via Spectral Gradient Reweighting¶

作者: Liu Zhang, Amit Singer
分类: math.ST · cs.LG · stat.ME · stat.TH
相关性 6/10 · novelty: new_method
摘要: 在广义矩估计（GMM）框架下，本文研究当似然不可用或计算不便时，如何对含离群值的污染数据进行稳健参数推断，设定为强污染模型（strong contamination）。核心方法 SGR-GMM 通过谱梯度重加权（SGR）原语对逐样本梯度进行软重加权：在固定中心点下，SGR 被建模为样本权重玩家与密度矩阵玩家之间的熵正则化谱博弈，并利用矩阵乘法权重（Matrix Multiplicative Weights）的 regret 界进行分析。理论层面，作者证明了 SGR 原语更新的显式收敛半径与有限步终止界，并给出了局部有限样本参数估计误差界，该界对污染比例、内点梯度稳定性、局部 GMM 识别强度及优化精度有显式依赖。应用上，将 SGR-GMM 特化为稳健对角加权 GMM（DGMM），用于加性高斯噪声与强污染下的异方差低秩高斯混合估计。对您可能有用：其逐样本梯度重加权与谱博弈框架，可为高维 M-estimation 的稳健性分析或 influence function 的重加权视角提供新工具。
关键技术: generalized method of moments, spectral gradient reweighting, matrix multiplicative weights, entropy-regularized spectral game, robust estimation under contamination, diagonally-weighted GMM
为什么对您有用: 本文连接到 semiparametric & nonparametric theory 中的 M-estimation 稳健性及 efficiency theory 中的 influence function 视角（逐样本梯度重加权本质是对 influence function 的谱修正）。用您 very_familiar 的 M-estimation theory 可以直接审视其局部识别强度与误差界的紧致性，且 moderately_familiar 的 semiparametric theory 可用于探讨该重加权机制是否逼近稳健 semiparametric efficiency bound。Follow-up 判断：立即可做——用 M-estimation 与 influence function 工具验证其有限样本界在经典 GMM 设定下是否 sharp，并探索重加权估计量的 asymptotic efficiency。

数理统计 / 假设检验 (hypothesis_testing, 4 篇)¶

1. 2605.27625 — Admissibility of Adaptive Monotone Step-Down Multiple Testing Procedures Under Arbitrary Covariance Dependence¶

作者: Prasenjit Ghosh, Arijit Chakrabarti
分类: math.ST · stat.TH
相关性 7/10 · novelty: new_theory
摘要: 在多元正态均值 \(\boldsymbol{X}\sim N_n(\boldsymbol{\theta}, \boldsymbol{\Sigma})\) 且协方差阵 \(\boldsymbol{\Sigma}\) 已知但任意的设定下，本文研究同时检验 \(H_{0i}:\theta_i=0\) 的 step-down 多重检验程序的容许性（admissibility）。核心方法考虑一类基于残差的单调 step-down 程序，其通过条件正态分布下标准化残差统计量的局部自适应严格单调变换来迭代排序活跃假设。主要定理证明，所有此类程序在以各分量 0-1 损失构成的向量值损失函数下均是容许的。证明依赖对诱导接受域的精细几何分析以及自适应阶段拒绝指标的结构不变性。该结果将 Cohen et al. (2009) 仅针对最大残差下降程序的容许性理论大幅推广，揭示出依赖下的容许性本质上由残差统计量诱导的单调排序结构驱动，而非检验规则的具体函数形式。对您有用：本文在任意协方差依赖下推进了多重检验的容许性理论，直接关联您在 mathematical statistics 中 hypothesis testing 的兴趣。
关键技术: monotone step-down multiple testing, admissibility under dependence, residual-based test statistics, conditional normal distribution, vector-valued loss function, geometric analysis of acceptance regions
为什么对您有用: 本文直接关联您 primary interest 中的 mathematical statistics (hypothesis testing) 子方向，在任意协方差依赖下给出了 step-down 多重检验的容许性刻画。您武器库中 very_familiar 的 minimax bounds for estimation problems 与 moderately_familiar 的 M-estimation theory 中的容许性/极小极大分析逻辑可与此处向量值损失下的容许性证明对话，特别是接受域几何分析可能用高维渐近视角重新审视。Follow-up 判断：中期可做——需先在 moderately_familiar 的 M-estimation theory 上补充多重检验容许性与向量值损失决策理论的细节，方可尝试将此处的残差单调排序结构推广至半参数或高维设定。

2. 2605.27625 — Admissibility of Adaptive Monotone Step-Down Multiple Testing Procedures Under Arbitrary Covariance Dependence¶

作者: Prasenjit Ghosh, Arijit Chakrabarti
分类: math.ST · stat.TH
相关性 7/10 · novelty: new_theory
摘要: 在多元正态均值 \(\boldsymbol{X}\sim N_n(\boldsymbol{\theta}, \boldsymbol{\Sigma})\) 且协方差阵 \(\boldsymbol{\Sigma}\) 已知但任意的设定下，本文研究同时检验 \(H_{0i}:\theta_i=0\) 的 step-down 多重检验程序的容许性（admissibility）。核心方法考虑一类基于残差的单调 step-down 程序，其通过条件正态分布下标准化残差统计量的局部自适应严格单调变换来迭代排序活跃假设。主要定理证明，所有此类程序在以各分量 0-1 损失构成的向量值损失函数下均是容许的。证明依赖对诱导接受域的精细几何分析以及自适应阶段拒绝指标的结构不变性。该结果将 Cohen et al. (2009) 仅针对最大残差下降程序的容许性理论大幅推广，揭示出依赖下的容许性本质上由残差统计量诱导的单调排序结构驱动，而非检验规则的具体函数形式。对您有用：本文在任意协方差依赖下推进了多重检验的容许性理论，直接关联您在 mathematical statistics 中 hypothesis testing 的兴趣。
关键技术: monotone step-down multiple testing, admissibility under dependence, residual-based test statistics, conditional normal distribution, vector-valued loss function, geometric analysis of acceptance regions
为什么对您有用: 本文直接关联您 primary interest 中的 mathematical statistics (hypothesis testing) 子方向，在任意协方差依赖下给出了 step-down 多重检验的容许性刻画。您武器库中 very_familiar 的 minimax bounds for estimation problems 与 moderately_familiar 的 M-estimation theory 中的容许性/极小极大分析逻辑可与此处向量值损失下的容许性证明对话，特别是接受域几何分析可能用高维渐近视角重新审视。Follow-up 判断：中期可做——需先在 moderately_familiar 的 M-estimation theory 上补充多重检验容许性与向量值损失决策理论的细节，方可尝试将此处的残差单调排序结构推广至半参数或高维设定。

3. 2605.27664 — BOOST: Power-Optimal Strong-FWER Testing for Block-Structured Multiplicity¶

作者: Prasanjit Dubey, Xiaoming Huo
分类: stat.ME
相关性 5/10 · novelty: new_method
摘要: 在具有设计预设块结构的多重检验问题（如 gatekeeping 试验、eQTL 映射、捆绑 A/B 测试）中，目标是在强 FWER 控制下实现功效最优。现有方法（Bonferroni, Holm, 闭检验等）在块可分类中均非功效最优；本文提出 BOOST 方法，针对块大小为 3 的情形给出了功效最优的强 FWER 检验程序。核心机制包括：基于等边际 KKT 条件在异质块间进行最优功效分配，可通过二分搜索在 O(B log(1/ε)) 内求解；有限样本下保证 O(K) 计算成本的强 FWER 有效性（无需独立性假设），且在跨块独立时严格优于 Sidak；针对未知备择密度 g，提出样本分割 plug-in 变体，α 水平膨胀为 O(B_T E||g-ĝ||_∞)，单假设功效损失不依赖于总块数 B_T。模拟显示在独立、等相关、稀疏及误设设定下，功效较最强基线提升 1.4-1.7 倍；在 BLUEPRINT eQTL 与 Upworthy A/B 实验数据上，全块发现数提升一个数量级。对您可能有用：本文为多重检验中的功效最优性与计算约束提供了精确的数学刻画，直接关联 hypothesis testing 与统计计算（O(K) vs O(K^2)）的交叉研究。
关键技术: strong FWER control, power-optimal testing, closed testing, equalized-marginal KKT condition, sample-splitting plug-in, block-separable multiple testing
为什么对您有用: 直接关联 primary interest 中的 hypothesis testing 子方向，特别是有限样本下多重检验的功效最优性与计算复杂度权衡（O(K) vs O(K^2) 闭检验）。您 very_familiar 的 minimax bounds for estimation problems 与软件开发的组合可以攻本文的口子：例如用 minimax 视角审视 plug-in 变体中 O(B_T E||g-ĝ||_∞) 膨胀项在非参数备择下的紧性，或将 O(K) 算法实现为高效 einsum/tensor-contraction 软件包。follow-up 粗判：立即可做——用 very_familiar 的非参估计理论与软件开发能力即可复现并拓展其算法实现与理论紧性分析。

4. 2605.28349 — Robust Inference for Dyadic Data with Dependent Ordered Nodes¶

作者: Ulrich Hounyo, Jiahao Lin, Xiaojun Song
分类: econ.EM · stat.AP
相关性 5/10 · novelty: new_method
摘要: 在 dyadic regression 设定下，传统假设认为仅共享节点的 dyad 才存在依赖，本文研究当节点有序且相邻节点受共同潜冲击时该范式失效的推断问题。此时无公共端点但节点序相近的 dyad 对仍具协方差，虽单对协方差弱，但此类对的数量随样本量发散，其聚合贡献对渐近方差不可忽略。作者提出两种方法：一是保留近邻节点 dyad 间协方差项的 dependent-node dyadic CRVE；二是行-列移动块 Jackknife，删除相邻节点块及关联 dyad。在沿节点序的弱依赖条件下，证明了两种程序的渐近有效性；MC 模拟表明考虑有序节点依赖可显著改善 size control，且 Jackknife 版本有限样本表现更稳定。对您有用：本文将依赖结构从经典 dyadic 扩展至有序近邻，其 CRVE 与 block jackknife 的渐近方差分析直接关联您在假设检验与高阶 U-statistic 投影中的技术储备。
关键技术: dyadic cluster-robust variance estimator, moving-block jackknife, ordered-node dependence, weak dependence asymptotics, dyadic regression
为什么对您有用: 本文直接处理 dyadic 数据在有序节点依赖下的假设检验与方差估计问题，属于您 primary interest 中的假设检验与数学统计方向。您 very_familiar 的高阶 U-statistic 计算与理论可用于剖析此处 CRVE 的聚合方差项结构（近邻 dyad 对的协方差求和本质上是 U-statistic 的 Hoeffding 分解问题），moderately_familiar 的 M-estimation 理论可支撑对估计量渐近性质的严格验证。立即可做：用 very_familiar 的高阶 U-statistic 投影与 treewidth 视角，分析其 CRVE 中近邻协方差项求和的计算复杂度与渐近贡献，验证其声称的方差估计一致性。

天体统计 (astrostats, 4 篇)¶

1. 2605.27562 — A Semi-Supervised Variational Autoencoder for Generating Neutron Star Equations of State¶

作者: Tianqi Zhao, Fanglida Yan, Alex Ross, James M. Lattimer
分类: astro-ph.IM · astro-ph.HE
相关性 7/10 · novelty: application
摘要: 本文在半监督变分自编码器（SSVAE）框架下，研究如何重构与生成中子星状态方程（EOS）。模型设定将高维 EOS 数据映射至低维潜空间，其中潜变量分为两部分：有物理观测监督的变量（最大质量 M_max 与标准半径 R_1.4）以及自动学习 EOS 额外特征（如壳-核转变区附近性质）的无监督变分潜变量。在 Skyrme EOS 数据集上训练后，仅需两个监督观测变量与一个无监督潜变量即可高保真重构 EOS，M_max 与 R_1.4 的平均绝对百分比误差低于 0.14%。从潜空间采样新 EOS 满足因果律与热力学稳定性，且与物理先验约束一致，可直接用于多信使贝叶斯推断。对您而言，这是一篇不错的 astrostats 入门读物，展示了深度生成模型如何为物理状态方程提供紧凑且可解释的参数化。
关键技术: semi-supervised variational autoencoder, latent variable model, Bayesian inference, multimessenger astronomy, equation of state reconstruction
为什么对您有用: (1) 本文属于 astrostats 方向的 gateway reading：物理背景交代清晰，数据（Skyrme EOS）与模型（SSVAE 潜空间结构）对应关系明确，适合统计学者入门中子星 EOS 这一数据建模场景。(2) 研究者的武器库（software development, high-dimensional asymptotics）足以支撑理解本文的 VAE 训练与潜空间推断，但若想在此方向做方法学改进（如对潜空间估计量建立半参效率界或 minimax 率），需先在 moderately_familiar 的 semiparametric theory 上长肌肉。(3) 值得花时间读全文，以评估将 debiased ML / semiparametric efficiency 引入 EOS 贝叶斯推断的可行性。

2. 2605.28572 — Unsupervised Morphological Characterization of Gravitational-Wave Glitches in LIGO O4a Using Frozen DINOv2 Features¶

作者: Luca Cirfeta
分类: astro-ph.IM
相关性 6/10 · novelty: application
摘要: 本文研究 LIGO O4a 观测轮次中引力波探测器噪声瞬变（glitches）的无监督形态学表征问题，核心设定为判断 O4a 是否引入了此前未见的新 glitch 形态。方法上，从 Q-transform 时频图的 384 维视觉嵌入出发，使用冻结的 DINOv2 ViTS/14 提取特征（无需标注数据），经 PCA 与 cosine 度量 UMAP 降维后，由 Dirichlet Process Mixture Model (DPMM) 进行聚类。通过消融实验、超参数扰动稳定性分析及与 Gravity Spy O3b 参考集的交叉验证系统评估聚类鲁棒性，并用 time-slide 背景测试排除 H1-L1 统计显著巧合。对 188,000+ 张时频图的分析得到零结果：所有异常聚类均以 >0.98 的余弦相似度映射到已知 Gravity Spy 类别，且 L1 嵌入鲁棒性（ARI > 0.90）显著高于 H1（ARI ~ 0.68-0.90），暗示 H1 噪声流形在 DINOv2 特征提取下存在结构性差异。作为天文统计入门读物，本文清晰展示了引力波数据中非平稳噪声的形态学分析全流程及零结果基线的建立。
关键技术: DINOv2 Vision Transformer, Q-transform spectrogram, Dirichlet Process Mixture Model, UMAP dimensionality reduction, time-slide background test, Adjusted Rand Index (ARI)
为什么对您有用: (1) 本文是天文统计的优秀入门读物，清晰展示了引力波数据中 glitch 形态学分析的数据结构（Q-transform 时频图）、模型假设（DPMM 聚类）与统计检验（time-slide 背景测试），无需天文先验知识即可理解。(2) 研究者的武器库（高维渐近理论、非参统计、软件开发）完全足以支撑进入此方向，甚至可对 DPMM 聚类的高维渐近性质或 UMAP 降维的统计保证进行理论深化。(3) 值得花时间读全文，以了解现代无监督深度特征在天文大数据中的实际统计表现及零结果的严谨验证范式。

3. 2605.28457 — Noise Suppression and Radio Frequency Interference Rejection for Self-Triggered Radio Detectors of Extensive Air Showers¶

作者: Pengfei Zhang, Xin Xu, Hanrui Wang, Xing Xu, Bohao Duan, Feng Wei, Hongwei Pan, Xishui Tian, Yi Zhang, Pengxiong Ma, Olivier Martineau-Huynh
分类: astro-ph.IM
相关性 5/10 · novelty: application
摘要: 本文针对下一代自触发射电探测器在真实射频干扰（RFI）环境下的灵敏度与触发可靠性问题，提出了一套端到端的系统级设计方法。核心设定是在银河系噪声主导的频段下，将宇宙线或中微子的广延大气簇射（EAS）信号与外部天空背景、内部器件噪声及人为非平稳干扰区分开。方法上采用系统级整合，将天空噪声建模、射频链路噪声预算、电磁兼容（EMC）抑制与测量驱动的验证相干结合，并引入基于ADC级差分内部噪声测量的间接噪声量化方法来估计低噪声放大器贡献。实验验证表明，该探测器单元在核心频段内可运行于接近银河系噪声极限的状态，且能在系统响应层面区分EAS信号与人为干扰。作为天文统计入门读物，本文清晰展示了射电天文数据中噪声结构、选择效应与硬件响应的耦合，对关注天文数据建模的统计学者有参考价值。
关键技术: sky-noise modeling, RF-chain noise budgeting, differential internal-noise measurement, EMC mitigation, self-triggered radio detection
为什么对您有用: 本文是面向统计学者的优秀天文数据入门读物，清晰阐述了射电天文中的核心数据挑战（非平稳人为干扰、银河系背景噪声与内部噪声的权衡），而非单纯的天体物理结果。研究者的武器库足以支撑进入此方向的数据建模部分（如非参数噪声建模、逆问题），但本文核心是硬件与信号处理层面的系统设计，统计方法学novelty极低。是否值得花时间读全文：若对射电天文数据结构好奇可速读引言与方法论框架，但无需深究硬件细节。

4. 2605.28373 — FARSim: a fast RF-chain-aware trigger-screening surrogate for radio detection of ultra-high-energy cosmic rays¶

作者: Xin Xu, Pengfei Zhang, Fufu Yang, Pengxiong Ma, Ramesh Koirala, Chao Zhang
分类: astro-ph.IM
相关性 4/10 · novelty: application
摘要: 在射电阵列探测超高能宇宙线的设定下，针对全 Monte Carlo 模拟（ZHAireS）在触发与阵列布局研究中计算代价过高的问题，本文提出快速代理框架 FARSim。核心机制是复用精简的参考射电足迹库，通过地磁场与电荷过剩场分解、地磁角与能量缩放、几何投影及基于等高线的核心位置采样来重构地面射电辐射，并估计触发观测量。进一步，将峰值场代理扩展到时域电场合成：结合预测的三分量峰值场向量与几何依赖的归一化脉冲模板，再通过 RF 链路响应传播实现电压域阈值与 L1 触发诊断。验证表明，时域扩展在真实峰值幅度下对 2112 条保留轨迹的中位向量波形 R^2 达 0.986，且相比重复全模拟大幅降低计算量。对您而言，这是一篇优秀的 astrostats 入门读物，清晰展示了物理模拟降维与统计代理模型的结合，但方法学 novelty 有限。
关键技术: physics-informed surrogate model, vector field decomposition, template-based time-domain synthesis, RF-chain response propagation, contour-based core sampling
为什么对您有用: (1) 本文是极好的 astrostats 入门读物，对宇宙线射电探测的物理设定、数据结构（足迹、脉冲模板）和计算瓶颈（全模拟代价）交代清晰，不预设天文背景；(2) 武器库足以支撑进入此方向，其核心是数值计算与软件优化，研究者 very_familiar 的软件设计与高维渐近理论可无缝对接代理模型的构建与误差分析；(3) 值得花时间读全文以了解天文实验中统计代理模型的具体需求，但若寻求方法学突破（如非参代理的 minimax 界），需另寻更偏统计的切入点。

其他 (other, 2 篇)¶

1. 2605.27711 — Improving Power in Randomized Controlled Trials with Time-to-Event Endpoints: A Risk-Free Approach¶

作者: Junyi Zhou, Qing Liu, May Mo, Amy Xia
分类: stat.ME
相关性 7/10 · novelty: new_method
摘要: 在RCT时间-事件终点设定下，目标是利用外部/历史数据提升边际hazard ratio (HR) 的估计效率且不引入偏差或膨胀Type I error。由于边际HR的non-collapsibility，连续终点PROCOVA的协变量调整范式无法直接移植。本文提出两步法：先在外部数据上用灵活supervised learning将基线协变量回归至martingale残差以构建prognostic score；再将该score作为额外协变量嵌入Ye et al. [2024]的非参数协变量调整log-rank检验与边际HR估计量。该方法在prognostic模型误设或外部与试验人群异质性下仍保证Type I error控制与边际HR的渐近无偏估计。方差缩减量近似等于prognostic score与试验中martingale pseudo-outcome相关系数的平方，并可自然扩展至分层随机化。对您有用：该文在RCT生存分析中实现了类似debiased ML / cross-fitting的"risk-free"效率提升，其martingale pseudo-outcome构造与您熟悉的semiparametric efficiency及influence function理论直接相通。
⚠️ 摘要不完整，待重跑（python -m research_news.rerun）
关键技术: nonparametric covariate-adjusted log-rank test, martingale pseudo-outcome, prognostic score covariate adjustment, non-collapsibility of marginal hazard ratio, variance reduction via external prediction
为什么对您有用: 本文直接连接因果推断RCT中的协变量调整与效率理论：在non-collapsible边际HR设定下，用martingale pseudo-outcome做prognostic score调整实现risk-free效率增益，与您primary interest中的semiparametric efficiency bound及estimation theory高度相关。您可用very_familiar中的estimation theory in causal inference分析其influence function结构，并用moderately_familiar的semiparametric theory审视其声称的"任意模型误设下渐近无偏"是否触及效率下界。Follow-up判断：立即可做——用您熟悉的influence function / minimax bound工具即可展开对其效率增益理论边界的验证与拓展。

2. 2605.28332 — Hybrid neural denoising for resource-efficient near- and sub-threshold radio triggering of extensive air showers¶

作者: Alperen Aksoy, Ilja Bekman, Markus Cristinziani, Eric-Teunis de Boone, Vesselin Dimitrov, Qader Dorosti, Chimezie Eguzo, Stefan Heidbrink, Stefan van Waasen, Andre Zambanini
分类: astro-ph.IM · hep-ex
相关性 5/10
摘要: 本文研究在强射频干扰环境下，如何为广延大气簇射（EAS）的射电自触发系统设计资源高效的近/亚阈值信号触发方案。核心方法是将波形恢复与信号分类视为单一受部署约束的联合问题，提出混合神经触发器：先用轻量卷积去噪器将含噪单通道迹映射为清洁脉冲估计，再用紧凑分类器评估。模型选择与部署通过超参优化、量化感知训练、定点量化及hls4ml固件导出实现端到端打通。在Pierre Auger模拟链的真实干扰背景与探测器折叠脉冲数据上，去噪器使简单峰值包络判据即可成为高效弱脉冲触发器；在10^-4误报率下，混合触发器保留约41%弱信号而经典方法保留0%。固件在FPGA上满足微秒级延迟与紧凑算力需求。对您而言，这是一篇极佳的astrostats入门读物，清晰展示了物理实验中"数据结构-噪声模型-计算约束-统计决策"的完整链条。
关键技术: convolutional denoiser, quantisation-aware training, fixed-point quantisation, hls4ml firmware export, FPGA deployment

🗂 其他论文（仅 LLM 评分，未生成摘要）¶

未生成中文摘要的论文，按 LLM 评分由高到低排列，仅保留评分与简评，便于回溯查全。一般为相关性低于展示阈值者；个别历史页也含当时因单日摘要上限未展开的高分篇目（评分仍清楚标着）。

1. 2605.27463 — When prompt perturbations break your A/B test: A valid statistical test for generative surveying¶

作者: Hayden Helm, Carey Priebe
分类: stat.ME · cs.AI · stat.AP
相关性 3/10
评分理由: Focuses on hypothesis testing for LLM prompt sensitivity, but the core machinery is classical rank/sign tests rather than high-dim or semiparametric theory.

2. 2605.28099 — A computationally-tractable measure of global sensitivity for sampling-based Bayesian inference¶

作者: Arina Odnoblyudova, Charita Dellaporta, Fran\c{c}ois-Xavier Briol
分类: stat.ME · stat.CO
相关性 3/10
评分理由: Bayesian sensitivity via Fisher divergence is adjacent to inverse problems but lacks direct overlap with primary causal/high-dim interests.

3. 2605.27859 — Near-Unit-Root Theory for Affine Processes¶

作者: Gael Anne, Yang Lu, Xuewen Yu, Xiaowen Zhou
分类: math.ST · stat.TH
相关性 3/10
评分理由: Time-series econometrics with local-to-unity asymptotics; tangential to primary interests in causal inference and high-dimensional statistics.

4. 2605.28339 — From nonstationarity to stationarity via \(1/f\) noise: discrete Fourier transforms and sample mean asymptotics for testing¶

作者: Mohamedou Ould Haye, Anne Philippe
分类: math.ST · stat.ME · stat.TH
相关性 3/10
评分理由: Long-memory time series asymptotics; unrelated to the researcher's core areas of causal inference, high-dim stats, and efficiency theory.

5. 2605.27844 — A Parameterization-Invariant DIC¶

作者: Xingyao Xiao (Stanford University), Sophia Rabe-Hesketh (University of California, Berkeley)
分类: stat.ME
相关性 2/10
评分理由: Reparameterization-invariant DIC deals with Bayesian model selection identifiability; tangential to the researcher's core theoretical arsenal.

6. 2605.28653 — Adaptive clinical trials based on design-optimal e-values with automatic curtailment: An application to single-arm trials with binary data¶

作者: Stef Baas, Judith ter Schure, Joost van Rosmalen
分类: stat.ME
相关性 2/10
评分理由: E-values and adaptive clinical trials are tangential to the researcher's core mathematical statistics and causal inference focus.

7. 2605.28406 — Sharp inequalities between variance-based dependent sensitivity indices and Shapley effects: upper-bounds¶

作者: Matieyendou Lamboni
分类: math.ST · math.PR · stat.TH
相关性 2/10
评分理由: Variance-based sensitivity analysis for dependent inputs; tangential to the researcher's causal inference and high-dimensional interests.

8. 2605.27516 — DebrisWatch II: Digging deeper for geosynchronous debris¶

作者: James A. Blake, Benjamin F. Cooke, Cristina Paragini, William Feline, Christopher A. Onken, Don Pollacco, Grant Privett, Toshifumi Yanagisawa, Robert Airey, Ioannis Apergis, Roberto Armellin, Lily Beesley, Paul Chote, Anna-Maria Cutolo, Stuart Eves, Tomoko Fujiwara, Daisuke Kuroda, Isobel S. Lockley, Alexander MacManus, James McCormac, Morgan A Mitchell, Tokuhiro Nimura, Kota Nishiyama, Shin-Ichiro Okumura, Thomas Schildknecht, Billy Shrive, Seitaro Urakawa, Dimitri Veras, Phineas Whitlock, Christian Wolf
分类: astro-ph.IM · astro-ph.EP
相关性 2/10
评分理由: Pure astrophysical survey result with no clear data/model exposition for a statistician; fails gateway rubric.

9. 2605.28768 — The unique ability of the Hubble Space Telescope to characterize young exoplanet environments¶

作者: Keighley E. Rockcliffe (University of Maryland Baltimore County, NASA Goddard Space Flight Center), Allison Youngblood (NASA Goddard Space Flight Center), Kevin France (University of Colorado), Cynthia Froning (Southwest Research Institute), P. C. Schneider (Kiel University), Elisabeth Newton (Dartmouth College), David J. Wilson (University of Colorado), Vighnesh Nagpal (University of Chicago, NSF Graduate Research Fellow), Sarah Peacock (University of Maryland Baltimore County, NASA Goddard Space Flight Center), Seth Redfield (Wesleyan University), Mayumi Liz de Andrade Miyazato (Colorado State University), Hans-R. M\"uller (Dartmouth College), Aylin Garc\'ia Soto (Boise State University)
分类: astro-ph.IM · astro-ph.EP
相关性 2/10
评分理由: Pure astrophysics result on UV irradiance and exoplanet atmospheres; lacks accessible data/model exposition for a statistician and has no clear statistical methodology question.

10. 2605.27496 — Model--based clustering for spherical and hyper--spherical data using elliptically symmetric distributions¶

作者: Theodoros Perdikis, Nader Alharbi, Michail Tsagris
分类: stat.ME
相关性 1/10
评分理由: Model-based clustering for directional data using EM; tangential to the researcher's primary interests in causal inference and high-dimensional theory.

11. 2605.27967 — Multi-Teacher Knowledge Distillation via Teacher-Informed Mixture Priors¶

作者: Luyang Fang, Yongkai Chen, Jiazhang Cai, Ping Ma, Wenxuan Zhong
分类: stat.ME · cs.AI · cs.LG · stat.ML
相关性 1/10
评分理由: Multi-teacher Bayesian knowledge distillation for deep learning; unrelated to the researcher's primary methodological interests.

12. 2605.28559 — Sequential generalized kernel equating: Providing comparable scores across multiple test forms with nonequivalent groups and differently measured covariates¶

作者: Michaela Va\v{r}ejkov\'a (Institute of Computer Science of the Czech Academy of Sciences, Prague, Czech Republic, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic), Patr\'icia Martinkov\'a (Institute of Computer Science of the Czech Academy of Sciences, Prague, Czech Republic, Faculty of Education, Charles University, Prague, Czech Republic), Eva Potu\v{z}n\'ikov\'a (Institute of Computer Science of the Czech Academy of Sciences, Prague, Czech Republic, Faculty of Education, Charles University, Prague, Czech Republic)
分类: stat.ME
相关性 1/10
评分理由: Test equating with covariates is unrelated to the researcher's primary or secondary interests.

13. 2605.27481 — Astronomy Open Science Competence Centre in Europe¶

作者: Marco Molinaro, Mark Allen, Joachim Wambsganns, Enrique Solano, Baptiste Cecconi, Markus Demleitner, Andr\'e Schaaff, Hendrik Heinl, Sara Bertocco
分类: astro-ph.IM
相关性 1/10
评分理由: Administrative project description for open science infrastructure; fails all gateway reading criteria for astrostatistics.

14. 2605.27687 — The Ultraviolet View of Star and Planet Formation: Disks, Accretion, and Outflows with the Hubble Space Telescope into the 2030s¶

作者: Kevin France, Eric Gaidos, Catherine Espaillat, Carlo F. Manara, Edwin Bergin
分类: astro-ph.IM · astro-ph.EP · astro-ph.SR
相关性 1/10
评分理由: Pure astrophysics review paper with no statistical methodology or data analysis problem exposed for outsiders.

15. 2605.27650 — Bayesian Imputation for Unplayed Games in Round-Robin Chess Tournaments: Application to Grand Chess Tour, Bucharest 2026¶

作者: Ravi Varadhan
分类: stat.ME
相关性 0/10
评分理由: Bayesian imputation for unplayed chess games; unrelated to the researcher's methodological interests.

Maintained by 陈星宇 · Homepage · Source on GitHub

2026-05-29 每日 arXiv 资讯¶

⭐ 高相关论文（按主题分组）¶

因果推断 (causal_inference, 7 篇)¶

1. 2605.28341 — Identification and Inference for Structural Accelerated Failure Time Models via Instrument Interactions¶

2. 2605.28749 — IV regression with distribution-valued outcomes¶

3. 2605.27655 — Implementing the principal stratum strategy for intercurrent events with survival outcomes: a tutorial¶

4. 2605.28105 — Identifying Direct Causal Effects in Latent Factor Models by Accounting for Unidentified Parents¶

5. 2605.28785 — Beyond Exchangeability: Distribution-Shift-Aware Integration of External Control Data in Randomized Trials¶

6. 2605.28762 — Deep Neural Networks for Doubly Robust Estimation with Nonprobability Survey Samples¶

7. 2605.28762 — Deep Neural Networks for Doubly Robust Estimation with Nonprobability Survey Samples¶

天体统计 (astrostats, 2 篇)¶

1. 2605.27527 — Probabilistic Data-Driven Modelling of Astrophysical Transients: The Neural Process Family for Ultrafast and Class-Agnostic Light Curve Reconstruction with NightLANP¶

2. 2605.27538 — VROOM-SBI: A Fast Simulation-Based Bayesian Inference Methodology for QU-Fitting¶

其他 (other, 2 篇)¶

1. 2605.28429 — The 'Right' Extension of Type-I Error to Data-Dependent Levels¶

2. 2605.28429 — The 'Right' Extension of Type-I Error to Data-Dependent Levels¶

📌 中相关论文（按主题分组）¶

因果推断 (causal_inference, 1 篇)¶

1. 2605.28471 — The Modified Egger Intercept Tests for Detecting Horizontal Pleiotropy in Two-Sample Summary-Data Mendelian Randomization¶

非参数 / 半参数 (nonparam_semipara, 2 篇)¶

1. 2605.27718 — Robust Moment-Based Estimation via Spectral Gradient Reweighting¶

2. 2605.27718 — Robust Moment-Based Estimation via Spectral Gradient Reweighting¶

数理统计 / 假设检验 (hypothesis_testing, 4 篇)¶

1. 2605.27625 — Admissibility of Adaptive Monotone Step-Down Multiple Testing Procedures Under Arbitrary Covariance Dependence¶

2. 2605.27625 — Admissibility of Adaptive Monotone Step-Down Multiple Testing Procedures Under Arbitrary Covariance Dependence¶

3. 2605.27664 — BOOST: Power-Optimal Strong-FWER Testing for Block-Structured Multiplicity¶

4. 2605.28349 — Robust Inference for Dyadic Data with Dependent Ordered Nodes¶

天体统计 (astrostats, 4 篇)¶

1. 2605.27562 — A Semi-Supervised Variational Autoencoder for Generating Neutron Star Equations of State¶

2. 2605.28572 — Unsupervised Morphological Characterization of Gravitational-Wave Glitches in LIGO O4a Using Frozen DINOv2 Features¶

3. 2605.28457 — Noise Suppression and Radio Frequency Interference Rejection for Self-Triggered Radio Detectors of Extensive Air Showers¶

4. 2605.28373 — FARSim: a fast RF-chain-aware trigger-screening surrogate for radio detection of ultra-high-energy cosmic rays¶

其他 (other, 2 篇)¶

1. 2605.27711 — Improving Power in Randomized Controlled Trials with Time-to-Event Endpoints: A Risk-Free Approach¶

2. 2605.28332 — Hybrid neural denoising for resource-efficient near- and sub-threshold radio triggering of extensive air showers¶

🗂 其他论文（仅 LLM 评分，未生成摘要）¶

1. 2605.27463 — When prompt perturbations break your A/B test: A valid statistical test for generative surveying¶

2. 2605.28099 — A computationally-tractable measure of global sensitivity for sampling-based Bayesian inference¶

3. 2605.27859 — Near-Unit-Root Theory for Affine Processes¶

4. 2605.28339 — From nonstationarity to stationarity via \(1/f\) noise: discrete Fourier transforms and sample mean asymptotics for testing¶

5. 2605.27844 — A Parameterization-Invariant DIC¶

6. 2605.28653 — Adaptive clinical trials based on design-optimal e-values with automatic curtailment: An application to single-arm trials with binary data¶

7. 2605.28406 — Sharp inequalities between variance-based dependent sensitivity indices and Shapley effects: upper-bounds¶

8. 2605.27516 — DebrisWatch II: Digging deeper for geosynchronous debris¶

9. 2605.28768 — The unique ability of the Hubble Space Telescope to characterize young exoplanet environments¶

10. 2605.27496 — Model--based clustering for spherical and hyper--spherical data using elliptically symmetric distributions¶

11. 2605.27967 — Multi-Teacher Knowledge Distillation via Teacher-Informed Mixture Priors¶

12. 2605.28559 — Sequential generalized kernel equating: Providing comparable scores across multiple test forms with nonequivalent groups and differently measured covariates¶

13. 2605.27481 — Astronomy Open Science Competence Centre in Europe¶

14. 2605.27687 — The Ultraviolet View of Star and Planet Formation: Disks, Accretion, and Outflows with the Hubble Space Telescope into the 2030s¶

15. 2605.27650 — Bayesian Imputation for Unplayed Games in Round-Robin Chess Tournaments: Application to Grand Chess Tour, Bucharest 2026¶

评论