行业动态
Hacker News
重要度: 9
探讨传统企业如何比AI炒作周期更持久,强调可持续商业模式的重要性。
行业动态
Hacker News
重要度: 8
讨论当前AI算法在实际应用中的痛点与局限性。
行业动态
Hacker News
重要度: 8
Ask HN: Anyone concerned about NYC Local Law 144?
行业动态
Hacker News
重要度: 7
探讨NLP、AI、ML和机器人技术是短暂趋势还是具有深远影响。
行业动态
Hacker News
重要度: 7
讨论AI技术进步速度是否呈指数级增长。
行业动态
Hacker News
重要度: 6
介绍AI领域的“疯狂指数”,用于评估不切实际的AI主张。
行业动态
Hacker News
重要度: 6
征集学习人工智能的推荐阅读材料和资源。
行业动态
Hacker News
重要度: 5
介绍MIT非AI许可证,涉及AI模型使用的法律框架。
行业动态
Hacker News
重要度: 5
谷歌招聘Common Lisp与机器学习结合的实习生职位。
行业动态
Hacker News
重要度: 4
宣传Chris Clark作为AI领域潜在领军人物,探讨“上帝算法”概念。
行业动态
Hacker News
重要度: 3
展示一家通过图书销售筹集资金的初创公司。
行业动态
Hacker News
重要度: 3
生物信息学家的招聘信息,涉及AI在生物领域的应用。
学术论文
ArXiv
重要度: 9
研究自动驾驶视觉策略的分布外鲁棒性,分解环境因素并比较不同模型,发现ViT和基础模型特征显著提升鲁棒性。
👨🔬 Amir Mallak, Alaa Maalouf
学术论文
ArXiv
重要度: 9
提出统一代理框架InternAgent-1.5,支持端到端科学发现,在计算和实验领域实现自主研究并取得领先性能。
👨🔬 Shiyang Feng, Runmin Ma, Xiangchao Yan, Yue Fan, Yusong Hu, Songtao Huang, Shuaiyu Zhang, Zongsheng Cao, Tianshuo Peng, Jiakang Yuan, Zijie Guo, Zhijie Zhong, Shangheng Du, Weida Wang, Jinxin Shi, Yuhao Zhou, Xiaohan He, Zhiyin Yu, Fangchen Yu, Qihao Zheng, Jiamin Wu, Mianxin Liu, Chi Zhang, Shaowei Hou, Shuya Li, Yankai Jiang, Wenjie Lou, Lilong Wang, Zifu Wang, Jiong Wang, Wanghan Xu, Yue Deng, Dongrui Liu, Yiheng Wang, Wenlong Zhang, Fenghua Ling, Shufei Zhang, Xiaosong Wang, Shuangjia Zheng, Xun Huang, Siqi Sun, Shuyue Hu, Peng Ye, Chunfeng Song, Bin Wang, Conghui He, Yihao Liu, Xin Li, Qibin Hou, Tao Chen, Xiangyu Yue, Bin Wang, Liang He, Dahua Lin, Bowen Zhou, Bo Zhang, Lei Bai
学术论文
ArXiv
重要度: 8
提出分层数据管理框架,支持LLM全训练周期,通过数据-模型协同进化提升训练效率和模型性能。
👨🔬 Yudong Wang, Zixuan Fu, Hengyu Zhao, Chen Zhao, Chuyue Zhou, Xinle Lin, Hongya Lyu, Shuaikang Xue, Yi Yi, Yingjiao Wang, Zhi Zheng, Yuzhou Zhang, Jie Zhou, Chaojun Xiao, Xu Han, Zhiyuan Liu, Maosong Sun
学术论文
ArXiv
重要度: 8
提出自适应旋转优化框架ARO,通过梯度旋转加速LLM训练,在基准测试中优于AdamW和正交化方法。
👨🔬 Wenbo Gong, Javier Zazo, Qijun Luo, Puqian Wang, James Hensman, Chao Ma
学术论文
ArXiv
重要度: 8
Diffusion models have achieved remarkable generation quality, but they suffer from significant inference cost due to their reliance on multiple sequential denoising steps, motivating recent efforts to distill this inference process into a few-step regime. However, existing distillation methods typically approximate the teacher trajectory by using linear shortcuts, which makes it difficult to match its constantly changing tangent directions as velocities evolve across timesteps, thereby leading to quality degradation. To address this limitation, we propose ArcFlow, a few-step distillation framework that explicitly employs non-linear flow trajectories to approximate pre-trained teacher trajectories. Concretely, ArcFlow parameterizes the velocity field underlying the inference trajectory as a mixture of continuous momentum processes. This enables ArcFlow to capture velocity evolution and extrapolate coherent velocities to form a continuous non-linear trajectory within each denoising step. Importantly, this parameterization admits an analytical integration of this non-linear trajectory, which circumvents numerical discretization errors and results in high-precision approximation of the teacher trajectory. To train this parameterization into a few-step generator, we implement ArcFlow via trajectory distillation on pre-trained teacher models using lightweight adapters. This strategy ensures fast, stable convergence while preserving generative diversity and quality. Built on large-scale models (Qwen-Image-20B and FLUX.1-dev), ArcFlow only fine-tunes on less than 5% of original parameters and achieves a 40x speedup with 2 NFEs over the original multi-step teachers without significant quality degradation. Experiments on benchmarks show the effectiveness of ArcFlow both qualitatively and quantitatively.
👨🔬 Zihan Yang, Shuyuan Tu, Licheng Zhang, Qi Dai, Yu-Gang Jiang, Zuxuan Wu
学术论文
ArXiv
重要度: 7
提出迭代组相对策略优化方法iGRPO,通过自反馈增强LLM数学推理能力,在多个基准上达到最先进水平。
👨🔬 Ali Hatamizadeh, Shrimai Prabhumoye, Igor Gitman, Ximing Lu, Seungju Han, Wei Ping, Yejin Choi, Jan Kautz
学术论文
ArXiv
重要度: 7
提出社交机器人导航框架,结合几何规划和视觉语言模型进行社交推理,在多种交互场景中实现最优性能。
👨🔬 Zilin Fang, Anxing Xiao, David Hsu, Gim Hee Lee
学术论文
ArXiv
重要度: 7
提出自适应神经连接重分配框架ANCRe,通过优化残差连接提升网络深度利用效率,加速收敛并提高性能。
👨🔬 Yilang Zhang, Bingcong Li, Niao He, Georgios B. Giannakis
学术论文
ArXiv
重要度: 6
发布多格式钓鱼攻击数据集CIC-Trap4Phish,包含五种常见文件类型,支持静态特征提取和轻量级模型检测。
👨🔬 Fatemeh Nejati, Mahdi Rabbani, Mansur Mirani, Gunjan Piya, Igor Opushnyev, Ali A. Ghorbani, Sajjad Dadkhah
学术论文
ArXiv
重要度: 6
提出新一代验证码框架,利用人机认知差距设计动态任务,为高级GUI代理时代提供可扩展的防御机制。
👨🔬 Jiacheng Liu, Yaxin Luo, Jiacheng Cui, Xinyi Shang, Xiaohan Zhao, Zhiqiang Shen
学术论文
ArXiv
重要度: 6
推出GUI生成基准GEBench,评估图像生成模型的动态交互和时间一致性,并提出五维评估指标GE-Score。
👨🔬 Haodong Li, Jingwei Wu, Quan Sun, Guopeng Li, Juanxi Tian, Huanyu Zhang, Yanlin Lai, Ruichuan An, Hongbo Peng, Yuhong Dai, Chenxi Li, Chunmei Qing, Jia Wang, Ziyang Meng, Zheng Ge, Xiangyu Zhang, Daxin Jiang
学术论文
ArXiv
重要度: 5
提出加权损失目标,结合节点不平衡加权和焦点加权,提升分层多标签学习中稀有节点的检测性能。
👨🔬 Isaac Xu, Martin Gillis, Ayushi Sharma, Benjamin Misiuk, Craig J. Brown, Thomas Trappenberg