Ph.D. Student @ CAS
Ming Ma
Internship Experience
Alibaba Tongyi Lab 2026.01 - Present
Research Intern
Omni Base Team: Multi-modal Agents
Ant Group 2025.10 - 2026.01
Research Intern
Ling Base Team: Pre-training Quality
Microsoft Research Asia (MSRA) 2025.07 - 2025.10
Research Intern
Multi-Agent & Debugging
Papers
- DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems (ICLR 2026)A closed-loop intervention–validation framework that auto-debugs LLM multi-agent systems beyond passive failure-log analysis.
- A Tale of Two Graphs: Separating Knowledge Exploration from Outline Structure for Open-Ended Deep Research (ICML 2026)A co-evolving dual-graph architecture (outline + knowledge) for open-ended deep-research agents that detects gaps and steers retrieval.
- Label Words as Local Task Vectors in In-Context Learning (ACL 2026)Reveals that in-context learning relies on distributed local task vectors carried by label words rather than a single global encoding.
- Omni-Decision: A Progressive Evidence-State Agent System for Omni-Modal QAAn evidence-state-driven Planner–Critic–Reducer multi-agent system for long-video and omni-modal question answering.
- QAQ: Bidirectional Semantic Coherence for Selecting High-Quality Synthetic Code InstructionsA bidirectional question–answer coherence filter for selecting high-value synthetic code-instruction data.
- SimLens for Early Exit in Large Language Models: Eliciting Accurate Latent Predictions with One More TokenA training-free intermediate-layer decoding method enabling accurate early exit on single-token LLM tasks.
- From end-to-end to step-by-step: Learning to abstract via abductive reinforcement learning (IJCAI 2025)An abductive RL framework using drift-diffusion models to switch between deadlock and exploration in long-horizon sparse-reward tasks.
Education
Chinese Academy of Sciences (CAS)
2022.09 - Present
Shandong University
2019.03 - 2022.06
Beijing Institute of Technology
2019.09 - 2020.06
Naval Aviation University
2017.08 - 2019.03
Links
Life & Photography