Self-Improving and Self-Adapting Agents

We study test-time methods for self‑improving and self‑adapting agents which represent an emerging frontier of artificial intelligence in which autonomous systems can not only perform tasks but also learn from interactions, refine their own behaviors, and even modify their own code or learning processes over time.

As large language models (LLMs) are increasingly deployed in open-ended settings, understanding and improving their behavior through test-time methods has become a central challenge. In this work, we examine how LLMs can self-adapt and self-improve without additional training through structured test-time interventions. First, we propose a human- and property-centric evaluation framework grounded in a meta-analysis of over 150 papers, identifying 21 core properties of natural language prompts and analyzing their underexplored influence across models and tasks. Building on these insights, we introduce LongGuide, a method that automatically generates dual-stream guidelines to steer generation toward desired language and output distributions. We further present the first systematic study of format bias in LLMs and propose actionable strategies to reduce over-reliance on superficial input formats. To enhance reasoning and robustness, we propose Multi-expert input modeling, which simulates collaborative agent decision-making, and Adversarial In-Context Learning, a competitive setup that iteratively refines inputs through generator–discriminator interactions. Across 20+ tasks, these methods significantly outperform strong baselines in truthfulness, generalization, and robustness. Our findings offer a comprehensive foundation for building adaptable LLM agents that improve through interaction, feedback, and structure-aware test-time methods—without reliance on additional gradient-based training.

Xuan Long Do
Xuan Long Do
A*STAR Doctoral Student (Aug ‘23)
Co-Supervised by Kenji Kawaguchi

PhD Candidate August 2023 Intake

Hai N. Nguyen
Hai N. Nguyen
Research Intern (Jan ‘25)

My name is Hai, current AI Research Resident at VinAi. I have graduated from Vietnam National University, University of Science (Vietnam). I’m interested in Optimization, Optimal Transport and Large Language Models.

Duy C. Dinh
Duy C. Dinh
Research Intern (Jan ‘25)

My name is Duy. I am currently working as an AI Engineer at Creative Force and graduated from Hanoi University of Science and Technology (HUST). With a strong foundation in machine learning research and a growing passion for Generative AI, I seek opportunities to contribute to meaningful and impactful research.

Trong Xuan Do
Trong Xuan Do
Research Intern (Jan ‘25)

Trong recently graduated from Hanoi University of Science and Technology (HUST). His research focuses on deep learning and improving mathematical reasoning capalibities of large language models (LLMs).

Yiwen Wang
Yiwen Wang
Research Intern (June ‘25)

Research Intern

Duc Anh Nguyen
Duc Anh Nguyen
Research Intern (Jul ‘25)

My name is Duc Anh, I am from Vietnam. Currently, I am working at Qualcomm AI Research as an AI Resident. My research focuses on the intersection of theory and application of Large Language Models, with the goal of improving the efficiency, scalability and robustness of state-of-the-art models.

Min-Yen Kan
Min-Yen Kan
Associate Professor

WING lead; interests include Digital Libraries, Information Retrieval and Natural Language Processing.