📊 研究方向热度分析
推理能力研究关注思维链、多步推理、逻辑演绎等认知能力提升。
- VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning?
- Slumbering to Precision: Enhancing Artificial Neural Network Calibration Through Sleep-like Processes
- Is continuous CoT better suited for multi-lingual reasoning?
- DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding
- R2F: Repurposing Ray Frontiers for LLM-free Object Navigation
大模型智能体研究持续火热,涵盖多智能体协作、工具调用、自主决策等核心能力。
- Long-Short Term Agents for Pure-Vision Bronchoscopy Robotic Autonomy
- ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning
- PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents
- CMMR-VLN: Vision-and-Language Navigation via Continual Multimodal Memory Retrieval
- One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States
安全与对齐研究备受关注,涵盖模型安全、隐私保护、对抗攻击防御等关键议题。
- Alignment-Aware and Reliability-Gated Multimodal Fusion for Unmanned Aerial Vehicle Detection Across Heterogeneous Thermal-Visual Sensors
- DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding
- UNBOX: Unveiling Black-box visual models with Natural-language
- Visual Self-Fulfilling Alignment: Shaping Safety-Oriented Personas via Threat-Related Images
- Geometrically Constrained Outlier Synthesis
检索增强生成与记忆机制研究持续发展,关注动态索引、结构化存储、持续学习等。
- Slumbering to Precision: Enhancing Artificial Neural Network Calibration Through Sleep-like Processes
- DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding
- CMMR-VLN: Vision-and-Language Navigation via Continual Multimodal Memory Retrieval
- One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States
- IronEngine: Towards General AI Assistant
效率优化研究热度攀升,关注量化、剪枝、推理加速等模型压缩与部署技术。
- DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding
- R2F: Repurposing Ray Frontiers for LLM-free Object Navigation
- Geometrically Constrained Outlier Synthesis
- First Estimation of Model Parameters for Neutrino-Induced Nucleon Knockout Using Simulation-Based Inference
- Improving through Interaction: Searching Behavioral Representation Spaces with CMA-ES-IG
时序预测研究涵盖时空数据建模、多变量预测、异常检测等应用场景。
- GCGNet: Graph-Consistent Generative Network for Time Series Forecasting with Exogenous Variables
- Not All News Is Equal: Topic- and Event-Conditional Sentiment from Finetuned LLMs for Aluminum Price Forecasting
- Beyond Scaling: Assessing Strategic Reasoning and Rapid Decision-Making Capability of LLMs in Zero-sum Environments
- Cross-Domain Uncertainty Quantification for Selective Prediction: A Comprehensive Bound Ablation with Transfer-Informed Betting
- When to Lock Attention: Training-Free KV Control in Video Diffusion
强化学习研究热度高涨,涵盖策略优化、多任务学习、安全RL等前沿方向。
- DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding
- ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning
- A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation
- RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning
- Reviving ConvNeXt for Efficient Convolutional Diffusion Models
该研究方向持续发展,产出多篇高质量论文。
- Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL
- TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection
- Rescaling Confidence: What Scale Design Reveals About LLM Metacognition
- Abundant Intelligence and Deficient Demand: A Macro-Financial Stress Test of Rapid AI Adoption
- Dynamic Knowledge Fusion for Multi-Domain Dialogue State Tracking
机器人与具身智能研究涵盖导航、操控、人机交互等关键应用场景。
- Long-Short Term Agents for Pure-Vision Bronchoscopy Robotic Autonomy
- CMMR-VLN: Vision-and-Language Navigation via Continual Multimodal Memory Retrieval
- R2F: Repurposing Ray Frontiers for LLM-free Object Navigation
- IronEngine: Towards General AI Assistant
- Graph-Instructed Neural Networks for parametric problems with varying boundary conditions
视觉语言模型研究持续活跃,关注多模态对齐、跨模态推理等关键问题。
- VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning?
- Alignment-Aware and Reliability-Gated Multimodal Fusion for Unmanned Aerial Vehicle Detection Across Heterogeneous Thermal-Visual Sensors
- CMMR-VLN: Vision-and-Language Navigation via Continual Multimodal Memory Retrieval
- Visual Self-Fulfilling Alignment: Shaping Safety-Oriented Personas via Threat-Related Images
- R2F: Repurposing Ray Frontiers for LLM-free Object Navigation
🔥 本周亮点论文
Long-Short Term Agents for Pure-Vision Bronchoscopy Robotic Autonomy
ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning
PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents
DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding
A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation
RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning
Alignment-Aware and Reliability-Gated Multimodal Fusion for Unmanned Aerial Vehicle Detection Across Heterogeneous Thermal-Visual Sensors
UNBOX: Unveiling Black-box visual models with Natural-language
👥 作者关系图谱分析
以下展示了本周cs.AI论文中发表量最多的作者及其合作关系。节点大小表示论文数量,连线粗细表示合作频次。
🏆 高产作者榜单
Hao Wang (5篇)
本周在cs.AI领域发表多篇论文,研究方向涵盖多个前沿领域。
Arman Cohan (4篇)
本周在cs.AI领域发表多篇论文,研究方向涵盖多个前沿领域。
Lei Zhang (4篇)
本周在cs.AI领域发表多篇论文,研究方向涵盖多个前沿领域。
Mohamad Alkadamani (4篇)
本周在cs.AI领域发表多篇论文,研究方向涵盖多个前沿领域。
Halim Yanikomeroglu (4篇)
本周在cs.AI领域发表多篇论文,研究方向涵盖多个前沿领域。
Fan Yang (4篇)
本周在cs.AI领域发表多篇论文,研究方向涵盖多个前沿领域。
Mikhail Pautov (4篇)
本周在cs.AI领域发表多篇论文,研究方向涵盖多个前沿领域。
Yiran Zhao (3篇)
本周在cs.AI领域发表多篇论文,研究方向涵盖多个前沿领域。
📈 研究趋势洞察
🤖 智能体安全研究升温
多智能体系统的安全问题成为研究热点,MCP服务器风险评估、智能体攻击防御等方向涌现多篇重要工作。
🧠 持续学习与记忆机制
LLM上下文窗口的需求分页技术、持续学习的记忆机制等研究为智能体长期运行提供了新的解决方案。
🦾 具身智能安全评估
LABSHIELD、HomeSafe-Bench等基准测试的出现,标志着具身智能安全评估进入标准化阶段。
⚡ 效率优化深入架构
Expert Threshold Routing、状态空间模型编译器优化等工作推动了大模型效率优化的架构级创新。
🏥 医疗AI多模态发展
Meissa等多模态医疗智能体系统的出现,展示了轻量级医疗AI的实际应用潜力。
🎨 生成模型新范式
ConvNeXt扩散模型、视频到音乐生成等研究展示了生成模型架构创新的新方向。
评论