AI资讯日报 - 2026/2/10

👨‍🔬 Shiyang Feng, Runmin Ma, Xiangchao Yan, Yue Fan, Yusong Hu, Songtao Huang, Shuaiyu Zhang, Zongsheng Cao, Tianshuo Peng, Jiakang Yuan, Zijie Guo, Zhijie Zhong, Shangheng Du, Weida Wang, Jinxin Shi, Yuhao Zhou, Xiaohan He, Zhiyin Yu, Fangchen Yu, Qihao Zheng, Jiamin Wu, Mianxin Liu, Chi Zhang, Shaowei Hou, Shuya Li, Yankai Jiang, Wenjie Lou, Lilong Wang, Zifu Wang, Jiong Wang, Wanghan Xu, Yue Deng, Dongrui Liu, Yiheng Wang, Wenlong Zhang, Fenghua Ling, Shufei Zhang, Xiaosong Wang, Shuangjia Zheng, Xun Huang, Siqi Sun, Shuyue Hu, Peng Ye, Chunfeng Song, Bin Wang, Conghui He, Yihao Liu, Xin Li, Qibin Hou, Tao Chen, Xiangyu Yue, Bin Wang, Liang He, Dahua Lin, Bowen Zhou, Bo Zhang, Lei Bai

Data Science and Technology Towards AGI Part I: Tiered Data Management

学术论文 ArXiv 重要度: 8

提出分层数据管理框架，支持LLM全训练周期，通过数据-模型协同进化提升训练效率和模型性能。

👨‍🔬 Yudong Wang, Zixuan Fu, Hengyu Zhao, Chen Zhao, Chuyue Zhou, Xinle Lin, Hongya Lyu, Shuaikang Xue, Yi Yi, Yingjiao Wang, Zhi Zheng, Yuzhou Zhang, Jie Zhou, Chaojun Xiao, Xu Han, Zhiyuan Liu, Maosong Sun

ARO: A New Lens On Matrix Optimization For Large Models

学术论文 ArXiv 重要度: 8

提出自适应旋转优化框架ARO，通过梯度旋转加速LLM训练，在基准测试中优于AdamW和正交化方法。

👨‍🔬 Wenbo Gong, Javier Zazo, Qijun Luo, Puqian Wang, James Hensman, Chao Ma

ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation

学术论文 ArXiv 重要度: 8

Diffusion models have achieved remarkable generation quality, but they suffer from significant inference cost due to their reliance on multiple sequential denoising steps, motivating recent efforts to distill this inference process into a few-step regime. However, existing distillation methods typically approximate the teacher trajectory by using linear shortcuts, which makes it difficult to match its constantly changing tangent directions as velocities evolve across timesteps, thereby leading to quality degradation. To address this limitation, we propose ArcFlow, a few-step distillation framework that explicitly employs non-linear flow trajectories to approximate pre-trained teacher trajectories. Concretely, ArcFlow parameterizes the velocity field underlying the inference trajectory as a mixture of continuous momentum processes. This enables ArcFlow to capture velocity evolution and extrapolate coherent velocities to form a continuous non-linear trajectory within each denoising step. Importantly, this parameterization admits an analytical integration of this non-linear trajectory, which circumvents numerical discretization errors and results in high-precision approximation of the teacher trajectory. To train this parameterization into a few-step generator, we implement ArcFlow via trajectory distillation on pre-trained teacher models using lightweight adapters. This strategy ensures fast, stable convergence while preserving generative diversity and quality. Built on large-scale models (Qwen-Image-20B and FLUX.1-dev), ArcFlow only fine-tunes on less than 5% of original parameters and achieves a 40x speedup with 2 NFEs over the original multi-step teachers without significant quality degradation. Experiments on benchmarks show the effectiveness of ArcFlow both qualitatively and quantitatively.

👨‍🔬 Zihan Yang, Shuyuan Tu, Licheng Zhang, Qi Dai, Yu-Gang Jiang, Zuxuan Wu

iGRPO: Self-Feedback-Driven LLM Reasoning

学术论文 ArXiv 重要度: 7

提出迭代组相对策略优化方法iGRPO，通过自反馈增强LLM数学推理能力，在多个基准上达到最先进水平。

👨‍🔬 Ali Hatamizadeh, Shrimai Prabhumoye, Igor Gitman, Ximing Lu, Seungju Han, Wei Ping, Yejin Choi, Jan Kautz

From Obstacles to Etiquette: Robot Social Navigation with VLM-Informed Path Selection

学术论文 ArXiv 重要度: 7

提出社交机器人导航框架，结合几何规划和视觉语言模型进行社交推理，在多种交互场景中实现最优性能。

👨‍🔬 Zilin Fang, Anxing Xiao, David Hsu, Gim Hee Lee

ANCRe: Adaptive Neural Connection Reassignment for Efficient Depth Scaling

学术论文 ArXiv 重要度: 7

提出自适应神经连接重分配框架ANCRe，通过优化残差连接提升网络深度利用效率，加速收敛并提高性能。

👨‍🔬 Yilang Zhang, Bingcong Li, Niao He, Georgios B. Giannakis

CIC-Trap4Phish: A Unified Multi-Format Dataset for Phishing and Quishing Attachment Detection

学术论文 ArXiv 重要度: 6

发布多格式钓鱼攻击数据集CIC-Trap4Phish，包含五种常见文件类型，支持静态特征提取和轻量级模型检测。

👨‍🔬 Fatemeh Nejati, Mahdi Rabbani, Mansur Mirani, Gunjan Piya, Igor Opushnyev, Ali A. Ghorbani, Sajjad Dadkhah

Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense

学术论文 ArXiv 重要度: 6

提出新一代验证码框架，利用人机认知差距设计动态任务，为高级GUI代理时代提供可扩展的防御机制。

👨‍🔬 Jiacheng Liu, Yaxin Luo, Jiacheng Cui, Xinyi Shang, Xiaohan Zhao, Zhiqiang Shen

GEBench: Benchmarking Image Generation Models as GUI Environments

学术论文 ArXiv 重要度: 6

推出GUI生成基准GEBench，评估图像生成模型的动态交互和时间一致性，并提出五维评估指标GE-Score。

👨‍🔬 Haodong Li, Jingwei Wu, Quan Sun, Guopeng Li, Juanxi Tian, Huanyu Zhang, Yanlin Lai, Ruichuan An, Hongbo Peng, Yuhong Dai, Chenxi Li, Chunmei Qing, Jia Wang, Ziyang Meng, Zheng Ge, Xiangyu Zhang, Daxin Jiang

Improving Detection of Rare Nodes in Hierarchical Multi-Label Learning

学术论文 ArXiv 重要度: 5

提出加权损失目标，结合节点不平衡加权和焦点加权，提升分层多标签学习中稀有节点的检测性能。

👨‍🔬 Isaac Xu, Martin Gillis, Ayushi Sharma, Benjamin Misiuk, Craig J. Brown, Thomas Trappenberg

🤖 AI资讯日报

📊 今日趋势总结

Why Boring Businesses Outlast AI Hype Cycles

Ask HN: What's the pain using current AI algorithms?

Ask HN: Anyone concerned about NYC Local Law 144?

NLP, AI, ML, bots – a passing trend or much more? What's your take on this?

Ask HN: Is the rate of progress in AI exponential?

The AI Crackpot Index

Ask HN: What would you read to learn about "artificial intelligence"?

MIT Non-AI License

Common Lisp + Machine Learning Internship at Google (Mountain View, CA)

The Next Bill Gates or Albert Einstein in AI “Chris Clark” – Yourobot

Show HN: Startup Raising capital through Book Sales

Bioinformatician

Robustness Is a Function, Not a Number: A Factorized Comprehensive Study of OOD Robustness in Vision-Based Driving

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Data Science and Technology Towards AGI Part I: Tiered Data Management

ARO: A New Lens On Matrix Optimization For Large Models

ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation

iGRPO: Self-Feedback-Driven LLM Reasoning

From Obstacles to Etiquette: Robot Social Navigation with VLM-Informed Path Selection

ANCRe: Adaptive Neural Connection Reassignment for Efficient Depth Scaling

CIC-Trap4Phish: A Unified Multi-Format Dataset for Phishing and Quishing Attachment Detection

Next-Gen CAPTCHAs: Leveraging the Cognitive Gap for Scalable and Diverse GUI-Agent Defense

GEBench: Benchmarking Image Generation Models as GUI Environments

Improving Detection of Rare Nodes in Hierarchical Multi-Label Learning

📅 历史日报目录