AI资讯日报 - 2025/11/2

👨‍🔬 Mantas Mazeika, Alice Gatti, Cristina Menghini, Udari Madhushani Sehwag, Shivam Singhal, Yury Orlovskiy, Steven Basart, Manasi Sharma, Denis Peskoff, Elaine Lau, Jaehyuk Lim, Lachlan Carroll, Alice Blair, Vinaya Sivakumar, Sumana Basu, Brad Kenstler, Yuntao Ma, Julian Michael, Xiaoke Li, Oliver Ingebretsen, Aditya Mehta, Jean Mottola, John Teichmann, Kevin Yu, Zaina Shaik, Adam Khoja, Richard Ren, Jason Hausenloy, Long Phan, Ye Htet, Ankit Aich, Tahseen Rabbani, Vivswan Shah, Andriy Novykov, Felix Binder, Kirill Chugunov, Luis Ramirez, Matias Geralnik, Hernán Mesura, Dean Lee, Ed-Yeremai Hernandez Cardona, Annette Diamond, Summer Yue, Alexandr Wang, Bing Liu, Ernesto Hernandez, Dan Hendrycks

The Oversight Game: Learning to Cooperatively Balance an AI Agent's Safety and Autonomy

学术论文 ArXiv 重要度: 8

提出监督博弈框架，使AI代理学习在风险时请示、安全时自主行动，实现部署后安全控制。

👨‍🔬 William Overman, Mohsen Bayati

Defeating the Training-Inference Mismatch via FP16

学术论文 ArXiv 重要度: 7

发现BF16精度导致RL微调不稳定，改用FP16可消除训练-推理失配，提升稳定性和性能。

👨‍🔬 Penghui Qi, Zichen Liu, Xiangxin Zhou, Tianyu Pang, Chao Du, Wee Sun Lee, Min Lin

Clone Deterministic 3D Worlds with Geometrically-Regularized World Models

学术论文 ArXiv 重要度: 7

提出几何正则化世界模型，通过改进表示学习提升对确定性3D环境的克隆和长程预测能力。

👨‍🔬 Zaishuo Xia, Yukuan Lu, Xinyi Li, Yifan Xu, Yubei Chen

A General Incentives-Based Framework for Fairness in Multi-agent Resource Allocation

学术论文 ArXiv 重要度: 7

提出GIFF框架，利用标准价值函数实现多智能体资源分配中的公平性，无需额外训练。

👨‍🔬 Ashwin Kumar, William Yeoh

LLMs Process Lists With General Filter Heads

学术论文 ArXiv 重要度: 6

发现LLM通过“过滤头”机制执行列表处理任务，其编码的过滤谓词表示具有可移植性和泛化性。

👨‍🔬 Arnab Sen Sharma, Giordano Rogers, Natalie Shapira, David Bau

STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization

学术论文 ArXiv 重要度: 6

提出STaMP量化方法，通过序列维变换和混合精度保持低比特激活量化下的模型精度。

👨‍🔬 Marco Federici, Riccardo Del Chiaro, Boris van Breugel, Paul Whatmough, Markus Nagel

Gistify! Codebase-Level Understanding via Runtime Execution

学术论文 ArXiv 重要度: 6

提出Gistify任务，要求LLM从代码库中提取最小自包含文件复现特定功能，当前模型表现不佳。

👨‍🔬 Hyunji Lee, Minseon Kim, Chinmay Singh, Matheus Pereira, Atharv Sonwane, Isadora White, Elias Stengel-Eskin, Mohit Bansal, Zhengyan Shi, Alessandro Sordoni, Marc-Alexandre Côté, Xingdi Yuan, Lucas Caccia

Faithful and Fast Influence Function via Advanced Sampling

学术论文 ArXiv 重要度: 5

提出基于特征和logits的先进采样方法，提升影响函数估计的准确性并减少计算资源消耗。

👨‍🔬 Jungyeon Koh, Hyeonsu Lyu, Jonggyu Jang, Hyun Jong Yang

Deep sequence models tend to memorize geometrically; it is unclear why

学术论文 ArXiv 重要度: 5

研究发现序列模型以几何方式记忆事实，而非简单关联查找，这种几何记忆源于光谱偏差。

👨‍🔬 Shahriar Noroozizadeh, Vaishnavh Nagarajan, Elan Rosenfeld, Sanjiv Kumar

🤖 AI资讯日报

📊 今日趋势总结

Why Boring Businesses Outlast AI Hype Cycles

Ask HN: Anyone concerned about NYC Local Law 144?

Ask HN: What's the pain using current AI algorithms?

Ask HN: Is the rate of progress in AI exponential?

NLP, AI, ML, bots – a passing trend or much more? What's your take on this?

The AI Crackpot Index

Ask HN: What would you read to learn about "artificial intelligence"?

Ask HN: Dipping my toes with artificial intelligence and what to expect? (CS)

Common Lisp + Machine Learning Internship at Google (Mountain View, CA)

Bioinformatician

Show HN: Startup Raising capital through Book Sales

The Next Bill Gates or Albert Einstein in AI "Chris Clark" – Yourobot

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

AMO-Bench: Large Language Models Still Struggle in High School Math Competitions

Remote Labor Index: Measuring AI Automation of Remote Work

The Oversight Game: Learning to Cooperatively Balance an AI Agent's Safety and Autonomy

Defeating the Training-Inference Mismatch via FP16

Clone Deterministic 3D Worlds with Geometrically-Regularized World Models

A General Incentives-Based Framework for Fairness in Multi-agent Resource Allocation

LLMs Process Lists With General Filter Heads

STaMP: Sequence Transformation and Mixed Precision for Low-Precision Activation Quantization

Gistify! Codebase-Level Understanding via Runtime Execution

Faithful and Fast Influence Function via Advanced Sampling

Deep sequence models tend to memorize geometrically; it is unclear why

📅 历史日报目录