AI资讯日报 - 2025/12/28

Large language models (LLMs) have revolutionized software development through AI-assisted coding tools, enabling developers with limited programming expertise to create sophisticated applications. However, this accessibility extends to malicious actors who may exploit these powerful tools to generate harmful software. Existing jailbreaking research primarily focuses on general attack scenarios against LLMs, with limited exploration of malicious code generation as a jailbreak target. To address this gap, we propose SPELL, a comprehensive testing framework specifically designed to evaluate the weakness of security alignment in malicious code generation. Our framework employs a time-division selection strategy that systematically constructs jailbreaking prompts by intelligently combining sentences from a prior knowledge dataset, balancing exploration of novel attack patterns with exploitation of successful techniques. Extensive evaluation across three advanced code models (GPT-4.1, Claude-3.5, and Qwen2.5-Coder) demonstrates SPELL's effectiveness, achieving attack success rates of 83.75%, 19.38%, and 68.12% respectively across eight malicious code categories. The generated prompts successfully produce malicious code in real-world AI development tools such as Cursor, with outputs confirmed as malicious by state-of-the-art detection systems at rates exceeding 73%. These findings reveal significant security gaps in current LLM implementations and provide valuable insights for improving AI safety alignment in code generation applications.

👨‍🔬 Yifan Huang, Xiaojun Jia, Wenbo Guo, Yuqiang Sun, Yihao Huang, Chong Wang, Yang Liu

Measuring all the noises of LLM Evals

学术论文 ArXiv 重要度: 7

系统定义并测量LLM评估中的三类噪声，提出全配对方法，发现预测噪声通常大于数据噪声，为高效实验设计提供依据。

👨‍🔬 Sida Wang

PhononBench:A Large-Scale Phonon-Based Benchmark for Dynamical Stability in Crystal Generation

学术论文 ArXiv 重要度: 7

推出首个基于声子计算的大规模晶体生成动态稳定性基准，揭示当前模型生成晶体平均稳定率仅25.83%，并识别出28,119个稳定结构。

👨‍🔬 Xiao-Qi Han, Ze-Feng Gao, Peng-Jie Guo, Zhong-Yi Lu

C2LLM Technical Report: A New Frontier in Code Retrieval via Adaptive Cross-Attention Pooling

学术论文 ArXiv 重要度: 6

提出C2LLM代码嵌入模型系列，采用多头注意力池化模块生成序列嵌入，在MTEB-Code基准上创下同类模型新纪录。

👨‍🔬 Jin Qin, Zihan Liao, Ziyin Zhang, Hang Yu, Peng Di, Rui Wang

SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance

学术论文 ArXiv 重要度: 6

提出SMART小型语言模型，采用分层处理结构，参数仅45.51M，在工程手册辅助任务上准确率比GPT-2高21.3%，幻觉更少。

👨‍🔬 Divij Dudeja, Mayukha Pal

LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation

学术论文 ArXiv 重要度: 6

提出LookPlanGraph方法，利用视觉语言模型动态更新场景图以适应环境变化，在模拟与真实机器人指令跟随任务中优于静态图方法。

👨‍🔬 Anatoly O. Onishchenko, Alexey K. Kovalev, Aleksandr I. Panov

Improving the Convergence Rate of Ray Search Optimization for Query-Efficient Hard-Label Attacks

学术论文 ArXiv 重要度: 5

提出动量优化算法ARS-OPT，加速硬标签黑盒对抗攻击中的射线搜索，在ImageNet和CIFAR-10上超越13种先进方法。

👨‍🔬 Xinjie Xu, Shuyu Cheng, Dongwei Xu, Qi Xuan, Chen Ma

Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval

学术论文 ArXiv 重要度: 5

提出轻量级两阶段检索流程，结合事件实体提取与BEiT-3模型，在OpenEvents v1基准上实现0.559的平均精度，显著优于基线。

👨‍🔬 Dao Sy Duy Minh, Huynh Trung Kiet, Nguyen Lam Phu Quy, Phu-Hoa Pham, Tran Chi Nguyen

Learning Factors in AI-Augmented Education: A Comparative Study of Middle and High School Students

学术论文 ArXiv 重要度: 4

比较初高中学生在AI辅助编程学习中的关键学习因素，发现初中生评价模式整体性强，高中生则更分化，为年龄适配的AI整合策略提供依据。

👨‍🔬 Gaia Ebli, Bianca Raimondi, Maurizio Gabbrielli

🤖 AI资讯日报

📊 今日趋势总结

Why Boring Businesses Outlast AI Hype Cycles

Ask HN: What's the pain using current AI algorithms?

Ask HN: Is the rate of progress in AI exponential?

NLP, AI, ML, bots – a passing trend or much more? What's your take on this?

Ask HN: Anyone concerned about NYC Local Law 144?

Ask HN: What would you read to learn about 'artificial intelligence'?

Ask HN: Dipping my toes with artificial intelligence and what to expect? (CS)

The AI Crackpot Index

Common Lisp + Machine Learning Internship at Google (Mountain View, CA)

Show HN: Startup Raising capital through Book Sales

Bioinformatician

The Next Bill Gates or Albert Einstein in AI “Chris Clark” – Yourobot

Optimizing Decoding Paths in Masked Diffusion Models by Quantifying Uncertainty

Model Merging via Multi-Teacher Knowledge Distillation

Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Consulting, Data Analyst, and Management Tasks

Casting a SPELL: Sentence Pairing Exploration for LLM Limitation-breaking

Measuring all the noises of LLM Evals

PhononBench:A Large-Scale Phonon-Based Benchmark for Dynamical Stability in Crystal Generation

C2LLM Technical Report: A New Frontier in Code Retrieval via Adaptive Cross-Attention Pooling

SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance

LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation

Improving the Convergence Rate of Ray Search Optimization for Query-Efficient Hard-Label Attacks

Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval

Learning Factors in AI-Augmented Education: A Comparative Study of Middle and High School Students

📅 历史日报目录