AI资讯日报 - 2025/7/30

👨‍🔬 Mohammed Baharoon, Luyang Luo, Michael Moritz, Abhinav Kumar, Sung Eun Kim, Xiaoman Zhang, Miao Zhu, Mahmoud Hussain Alabbad, Maha Sbayel Alhazmi, Neel P. Mistry, Kent Ryan Kleinschmidt, Brady Chrisler, Sathvik Suryadevara, Sri Sai Dinesh Jaliparthi, Noah Michael Prudlo, Mark David Marino, Jeremy Palacio, Rithvik Akula, Hong-Yu Zhou, Ibrahim Ethem Hamamci, Scott J. Adams, Hassan Rayhan AlOmaish, Pranav Rajpurkar

PHAX: A Structured Argumentation Framework for User-Centered Explainable AI in Public Health and Biomedical Sciences

学术论文 ArXiv 重要度: 8

介绍PHAX框架，利用结构化论证生成公共卫生和生物医学科学中用户中心的AI解释。

👨‍🔬 Bahar İlgen, Akshat Dubey, Georges Hattab

The Interspeech 2025 Speech Accessibility Project Challenge

学术论文 ArXiv 重要度: 7

2025年Interspeech语音无障碍项目挑战赛启动，旨在提升ASR系统对语音障碍者的识别性能。

👨‍🔬 Xiuwen Zheng, Bornali Phukon, Jonghwan Na, Ed Cutrell, Kyu Han, Mark Hasegawa-Johnson, Pan-Pan Jiang, Aadhrik Kuila, Colin Lea, Bob MacDonald, Gautam Mantena, Venkatesh Ravichandran, Leda Sari, Katrin Tomanek, Chang D. Yoo, Chris Zwilling

UserBench: An Interactive Gym Environment for User-Centric Agents

学术论文 ArXiv 重要度: 7

Large Language Models (LLMs)-based agents have made impressive progress in reasoning and tool use, enabling them to solve complex tasks. However, their ability to proactively collaborate with users, especially when goals are vague, evolving, or indirectly expressed, remains underexplored. To address this gap, we introduce UserBench, a user-centric benchmark designed to evaluate agents in multi-turn, preference-driven interactions. UserBench features simulated users who start with underspecified goals and reveal preferences incrementally, requiring agents to proactively clarify intent and make grounded decisions with tools. Our evaluation of leading open- and closed-source LLMs reveals a significant disconnect between task completion and user alignment. For instance, models provide answers that fully align with all user intents only 20% of the time on average, and even the most advanced models uncover fewer than 30% of all user preferences through active interaction. These results highlight the challenges of building agents that are not just capable task executors, but true collaborative partners. UserBench offers an interactive environment to measure and advance this critical capability.

👨‍🔬 Cheng Qian, Zuxin Liu, Akshara Prabhakar, Zhiwei Liu, Jianguo Zhang, Haolin Chen, Heng Ji, Weiran Yao, Shelby Heinecke, Silvio Savarese, Caiming Xiong, Huan Wang

XAI for Point Cloud Data using Perturbations based on Meaningful Segmentation

学术论文 ArXiv 重要度: 7

提出一种基于有意义分割的点云数据XAI方法，生成易于人类理解的解释。

👨‍🔬 Raju Ningappa Mulawade, Christoph Garth, Alexander Wiebel

Bridging Synthetic and Real-World Domains: A Human-in-the-Loop Weakly-Supervised Framework for Industrial Toxic Emission Segmentation

学术论文 ArXiv 重要度: 7

提出CEDANet框架，结合公民科学和弱监督域适应，实现工业有毒排放分割。

👨‍🔬 Yida Tao, Yen-Chia Hsu

Supervised Quantum Image Processing

学术论文 ArXiv 重要度: 6

比较四种量子图像表示法的压缩性能，探讨量子核在分类问题中的表现。

👨‍🔬 Marco Parigi, Mehran Khosrojerdi, Filippo Caruso, Leonardo Banchi

Exploring the Stratified Space Structure of an RL Game with the Volume Growth Transform

学术论文 ArXiv 重要度: 6

探索RL游戏嵌入空间的分层结构，提出体积增长变换方法分析代理行为。

👨‍🔬 Justin Curry, Brennan Lagasse, Ngoc B. Lam, Gregory Cox, David Rosenbluth, Alberto Speranzon

Staining and locking computer vision models without retraining

学术论文 ArXiv 重要度: 6

介绍无需重新训练即可染色和锁定计算机视觉模型的新方法，保护知识产权。

👨‍🔬 Oliver J. Sutton, Qinghua Zhou, George Leete, Alexander N. Gorban, Ivan Y. Tyukin

🤖 AI资讯日报

📊 今日趋势总结

Ask HN: Is the rate of progress in AI exponential?

Ask HN: Anyone concerned about NYC Local Law 144?

50% Cheaper GPUs for cloud-computing / Saving devs 50% compared to AWS

Ask HN: What's the pain using current AI algorithms?

NLP, AI, ML, bots – a passing trend or much more? What's your take on this?

The AI Crackpot Index

Common Lisp + Machine Learning Internship at Google (Mountain View, CA)

Ask HN: Dipping my toes with artificial intelligence and what to expect? (CS)

The Next Bill Gates or Albert Einstein in AI “Chris Clark” – Yourobot

Ask HN: Thoughts on grad school? (CS PhD)

Show HN: Startup Raising capital through Book Sales

Bioinformatician

google/trax

modelscope/modelscope

nl8590687/ASRT_SpeechRecognition

Olow304/memvid

zyddnys/manga-image-translator

google-deepmind/pysc2

netease-youdao/EmotiVoice

NVIDIA/cutlass

aladdinpersson/Machine-Learning-Collection

Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security

UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding

Foundation Models for Demand Forecasting via Dual-Strategy Ensembling

ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports

PHAX: A Structured Argumentation Framework for User-Centered Explainable AI in Public Health and Biomedical Sciences

The Interspeech 2025 Speech Accessibility Project Challenge

UserBench: An Interactive Gym Environment for User-Centric Agents

XAI for Point Cloud Data using Perturbations based on Meaningful Segmentation

Bridging Synthetic and Real-World Domains: A Human-in-the-Loop Weakly-Supervised Framework for Industrial Toxic Emission Segmentation

Supervised Quantum Image Processing

Exploring the Stratified Space Structure of an RL Game with the Volume Growth Transform

Staining and locking computer vision models without retraining

📅 历史日报目录