AI资讯日报 - 2026/4/28

👨‍🔬 Zhou Ziheng, Huacong Tang, Jinyuan Zhang, Haowei Lin, Bangcheng Yang, Qian Long, Fang Sun, Yizhou Sun, Yitao Liang, Ying Nian Wu, Demetri Terzopoulos, Xiaofeng Gao

Learning to Think from Multiple Thinkers

学术论文 ArXiv 重要度: 7

研究从多个思维链中学习，发现被动收集数据困难，提出主动学习算法。

👨‍🔬 Nirmit Joshi, Roey Magen, Nathan Srebro, Nikolaos Tsilivis, Gal Vardi

Learning to Rotate: Temporal and Semantic Rotary Encoding for Sequential Modeling

学术论文 ArXiv 重要度: 7

提出SIREN-RoPE，将旋转空间变为可学习，提升注意力机制表现力。

👨‍🔬 Hailing Cheng, Daqi Sun, Xinyu Lu

Defective Task Descriptions in LLM-Based Code Generation: Detection and Analysis

学术论文 ArXiv 重要度: 7

开发SpecValidator检测代码生成任务描述缺陷，性能优于大型模型。

👨‍🔬 Amal Akli, Mike Papadakis, Maxime Cordy, Yves Le Traon

Green Shielding: A User-Centric Approach Towards Trustworthy AI

学术论文 ArXiv 重要度: 7

提出用户中心的可信AI方法，通过基准测试和扰动分析指导安全部署。

👨‍🔬 Aaron J. Li, Nicolas Sanchez, Hao Huang, Ruijiang Dong, Jaskaran Bains, Katrin Jaradeh, Zhen Xiang, Bo Li, Feng Liu, Aaron Kornblith, Bin Yu

Governing What You Cannot Observe: Adaptive Runtime Governance for Autonomous AI Agents

学术论文 ArXiv 重要度: 7

提出Agent Viability框架和RiskGate，实现自主代理的自适应运行时治理。

👨‍🔬 German Marin, Jatin Chaudhary

Scalable Hyperparameter-Divergent Ensemble Training with Automatic Learning Rate Exploration for Large Models

学术论文 ArXiv 重要度: 6

提出HDET，利用多副本探索学习率，自动调整超参数，提升训练效果。

👨‍🔬 Hailing Cheng, Tao Huang, Chen Zhu, Antonio Alonso

Leveraging LLMs for Multi-File DSL Code Generation: An Industrial Case Study

学术论文 ArXiv 重要度: 6

在宝马案例中，使用LLM生成多文件DSL代码，微调显著提升准确性和结构保真度。

👨‍🔬 Sivajeet Chand, Kevin Nguyen, Peter Kuntz, Alexander Pretschner

The Price of Agreement: Measuring LLM Sycophancy in Agentic Financial Applications

学术论文 ArXiv 重要度: 6

评估金融应用中LLM的谄媚行为，发现性能下降不大，但偏好信息导致失败。

👨‍🔬 Zhenyu Zhao, Aparna Balagopalan, Adi Agrawal, Dilshoda Yergasheva, Waseem Alshikh, Daniel M. Bikel

Benchmarking Source-Sensitive Reasoning in Turkish: Humans and LLMs under Evidential Trust Manipulation

学术论文 ArXiv 重要度: 5

研究土耳其语中源可信度对证据形态的影响，人类敏感而LLM不稳定。

👨‍🔬 Sercan Karakaş, Yusuf Şimşek

🤖 AI资讯日报

📊 今日趋势总结

MIT Non-AI License

Ask HN: Anyone concerned about NYC Local Law 144?

Ask HN: Is the rate of progress in AI exponential?

Why Boring Businesses Outlast AI Hype Cycles

Ask HN: What's the pain using current AI algorithms?

The AI Crackpot Index

Ask HN: What would you read to learn about "artificial intelligence"?

Show HN: Startup Raising capital through Book Sales

The Next Bill Gates or Albert Einstein in AI “Chris Clark” – Yourobot

Common Lisp + Machine Learning Internship at Google (Mountain View, CA)

Bioinformatician

NLP, AI, ML, bots – a passing trend or much more? What's your take on this?

Personalized Worked Example Generation from Student Code Submissions using Pattern-based Knowledge Components

Case-Specific Rubrics for Clinical AI Evaluation: Methodology, Validation, and LLM-Clinician Agreement Across 823 Encounters

Can Current Agents Close the Discovery-to-Application Gap? A Case Study in Minecraft

Learning to Think from Multiple Thinkers

Learning to Rotate: Temporal and Semantic Rotary Encoding for Sequential Modeling

Defective Task Descriptions in LLM-Based Code Generation: Detection and Analysis

Green Shielding: A User-Centric Approach Towards Trustworthy AI

Governing What You Cannot Observe: Adaptive Runtime Governance for Autonomous AI Agents

Scalable Hyperparameter-Divergent Ensemble Training with Automatic Learning Rate Exploration for Large Models

Leveraging LLMs for Multi-File DSL Code Generation: An Industrial Case Study

The Price of Agreement: Measuring LLM Sycophancy in Agentic Financial Applications

Benchmarking Source-Sensitive Reasoning in Turkish: Humans and LLMs under Evidential Trust Manipulation

📅 历史日报目录