AI资讯日报 - 2025/10/13

👨‍🔬 Chengyu Wang, Paria Rashidinejad, DiJia Su, Song Jiang, Sid Wang, Siyan Zhao, Cai Zhou, Shannon Zejiang Shen, Feiyu Chen, Tommi Jaakkola, Yuandong Tian, Bo Liu

Mitigating Overthinking through Reasoning Shaping

学术论文 ArXiv 重要度: 6

提出GRSP方法，通过分段惩罚机制减少大推理模型的过度思考，平衡计算效率与准确性。

👨‍🔬 Feifan Song, Shaohang Wei, Bofei Gao, Yejie Wang, Wen Luo, Wei Li, Linli Yao, Weimin Xiong, Liang Chen, Tianyu Liu, Houfeng Wang

Autonomous Soft Robotic Guidewire Navigation via Imitation Learning

学术论文 ArXiv 重要度: 6

开发基于Transformer的模仿学习框架，实现软体机器人导丝在血管中的自主导航，在未见过几何结构中达83%成功率。

👨‍🔬 Noah Barnes, Ji Woong Kim, Lingyun Di, Hannah Qu, Anuruddha Bhattacharjee, Miroslaw Janowski, Dheeraj Gandhi, Bailey Felix, Shaopeng Jiang, Olivia Young, Mark Fuge, Ryan D. Sochol, Jeremy D. Brown, Axel Krieger

A methodology for clinically driven interactive segmentation evaluation

学术论文 ArXiv 重要度: 5

提出临床驱动的交互式分割评估方法，发现最小化交互信息损失与自适应缩放对模型鲁棒性至关重要。

👨‍🔬 Parhom Esmaeili, Virginia Fernandez, Pedro Borges, Eli Gibson, Sebastien Ourselin, M. Jorge Cardoso

Safe, Untrusted, "Proof-Carrying" AI Agents: toward the agentic lakehouse

学术论文 ArXiv 重要度: 5

提出基于证明携带代码的安全AI代理框架，使不可信代理能在生产数据上安全运行，推动代理化数据湖屋发展。

👨‍🔬 Jacopo Tagliabue, Ciro Greco

Titans Revisited: A Lightweight Reimplementation and Critical Analysis of a Test-Time Memory Model

学术论文 ArXiv 重要度: 4

对Titans测试时记忆模型进行轻量级复现与评估，发现其神经内存组件稳定提升性能但分块策略存在局限。

👨‍🔬 Gavriel Di Nepi, Federico Siciliano, Fabrizio Silvestri

🤖 AI资讯日报

📊 今日趋势总结

Why Boring Businesses Outlast AI Hype Cycles

Ask HN: What's the pain using current AI algorithms?

The AI Crackpot Index

Ask HN: Is the rate of progress in AI exponential?

NLP, AI, ML, bots – a passing trend or much more? What's your take on this?

Ask HN: Anyone concerned about NYC Local Law 144?

Common Lisp + Machine Learning Internship at Google (Mountain View, CA)

Ask HN: What would you read to learn about "artificial intelligence"?

Ask HN: Dipping my toes with artificial intelligence and what to expect? (CS)

Bioinformatician

Show HN: Startup Raising capital through Book Sales

The Next Bill Gates or Albert Einstein in AI “Chris Clark” – Yourobot

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Prompting Test-Time Scaling Is A Strong LLM Reasoning Data Augmentation

LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?

GraphMERT: Efficient and Scalable Distillation of Reliable Knowledge Graphs from Unstructured Data

Dyna-Mind: Learning to Simulate from Experience for Better AI Agents

BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Mitigating Overthinking through Reasoning Shaping

Autonomous Soft Robotic Guidewire Navigation via Imitation Learning

A methodology for clinically driven interactive segmentation evaluation

Safe, Untrusted, "Proof-Carrying" AI Agents: toward the agentic lakehouse

Titans Revisited: A Lightweight Reimplementation and Critical Analysis of a Test-Time Memory Model

📅 历史日报目录