Self-Improving and Self-Adapting Agents
We study test-time methods for self‑improving and self‑adapting agents which represent an emerging frontier of artificial intelligence in which autonomous systems can not only perform tasks but also learn from interactions, refine their own behaviors, and even modify their own code or learning processes over time.
As large language models (LLMs) are increasingly deployed in open-ended settings, understanding and improving their behavior through test-time methods has become a central challenge. In this work, we examine how LLMs can self-adapt and self-improve without additional training through structured test-time interventions. First, we propose a human- and property-centric evaluation framework grounded in a meta-analysis of over 150 papers, identifying 21 core properties of natural language prompts and analyzing their underexplored influence across models and tasks. Building on these insights, we introduce LongGuide, a method that automatically generates dual-stream guidelines to steer generation toward desired language and output distributions. We further present the first systematic study of format bias in LLMs and propose actionable strategies to reduce over-reliance on superficial input formats. To enhance reasoning and robustness, we propose Multi-expert input modeling, which simulates collaborative agent decision-making, and Adversarial In-Context Learning, a competitive setup that iteratively refines inputs through generator–discriminator interactions. Across 20+ tasks, these methods significantly outperform strong baselines in truthfulness, generalization, and robustness. Our findings offer a comprehensive foundation for building adaptable LLM agents that improve through interaction, feedback, and structure-aware test-time methods—without reliance on additional gradient-based training.