AI资讯日报 - 2025/11/6

👨‍🔬 Xingyao Wang, Simon Rosenberg, Juan Michelini, Calvin Smith, Hoang Tran, Engel Nyst, Rohit Malhotra, Xuhui Zhou, Valerie Chen, Robert Brennan, Graham Neubig

Visualization Biases MLLM's Decision Making in Network Data Tasks

学术论文 ArXiv 重要度: 7

研究发现可视化技术会显著影响多模态大语言模型在网络数据分析中的判断，产生系统性偏见。

👨‍🔬 Timo Brand, Henry Förster, Stephen G. Kobourov, Jacob Miller

Structured Matrix Scaling for Multi-Class Calibration

学术论文 ArXiv 重要度: 6

提出结构化正则化方法解决多分类校准中的过拟合问题，显著提升概率估计准确性。

👨‍🔬 Eugène Berta, David Holzmüller, Michael I. Jordan, Francis Bach

Grounded Misunderstandings in Asymmetric Dialogue: A Perspectivist Annotation Scheme for MapTask

学术论文 ArXiv 重要度: 6

提出视角主义标注方案分析对话中的理解偏差，发现多重性差异是导致指称错位的主要原因。

👨‍🔬 Nan Li, Albert Gatt, Massimo Poesio

ChiMDQA: Towards Comprehensive Chinese Document QA with Fine-grained Evaluation

学术论文 ArXiv 重要度: 6

发布高质量中文多文档问答数据集，涵盖六大领域6068个问答对，提供细粒度评估体系。

👨‍🔬 Jing Gao, Shutiao Luo, Yumeng Liu, Yuanming Li, Hongji Zeng

Explaining Human Choice Probabilities with Simple Vector Representations

学术论文 ArXiv 重要度: 5

基于向量表示建立人类决策模型，发现匹配/反匹配和最大化/最小化两种策略足以解释随机环境中的选择行为。

👨‍🔬 Peter DiBerardino, Britt Anderson

DQN Performance with Epsilon Greedy Policies and Prioritized Experience Replay

学术论文 ArXiv 重要度: 5

系统研究epsilon贪婪策略和经验回放对DQN性能的影响，为资源受限环境提供强化学习实践建议。

👨‍🔬 Daniel Perkins, Oscar J. Escobar, Luke Green

🤖 AI资讯日报

📊 今日趋势总结

Why Boring Businesses Outlast AI Hype Cycles

Ask HN: What's the pain using current AI algorithms?

Ask HN: Anyone concerned about NYC Local Law 144?

The AI Crackpot Index

Ask HN: Is the rate of progress in AI exponential?

NLP, AI, ML, bots – a passing trend or much more? What's your take on this?

Ask HN: What would you read to learn about 'artificial intelligence'?

Ask HN: Dipping my toes with artificial intelligence and what to expect? (CS)

Common Lisp + Machine Learning Internship at Google (Mountain View, CA)

Bioinformatician

The Next Bill Gates or Albert Einstein in AI “Chris Clark” – Yourobot

Show HN: Startup Raising capital through Book Sales

Whisper Leak: a side-channel attack on Large Language Models

Watermarking Large Language Models in Europe: Interpreting the AI Act in Light of Technology

Outbidding and Outbluffing Elite Humans: Mastering Liar's Poker via Self-Play and Reinforcement Learning

LiveTradeBench: Seeking Real-World Alpha with Large Language Models

AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing

The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents

Visualization Biases MLLM's Decision Making in Network Data Tasks

Structured Matrix Scaling for Multi-Class Calibration

Grounded Misunderstandings in Asymmetric Dialogue: A Perspectivist Annotation Scheme for MapTask

ChiMDQA: Towards Comprehensive Chinese Document QA with Fine-grained Evaluation

Explaining Human Choice Probabilities with Simple Vector Representations

DQN Performance with Epsilon Greedy Policies and Prioritized Experience Replay

📅 历史日报目录