Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Posts
Future Blog Post
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.
Blog Post number 4
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 3
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 2
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 1
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
portfolio
Portfolio item number 1
Short description of portfolio item number 1
Portfolio item number 2
Short description of portfolio item number 2 
publications
Self-Adaptive Multi-Agent Systems for StarCraft: Brood War
arXiv preprint, 2021
Efficient Dual-Process Cognitive Recommender Balancing Accuracy and Diversity
International Conference on Database Systems for Advanced Applications (DASFAA 2022), 2022
Promoting Quality and Diversity in Population-Based Reinforcement Learning via Hierarchical Trajectory Space Exploration
ICRA 2022, 2022
Robust Multi-Agent Reinforcement Learning Driven by Correlated Equilibrium
arXiv preprint, 2022
Heterogeneous Graph Neural Network-based Imitation Learning for Gate Sizing Acceleration
ICCAD 2022, 2022
Multiagent Q-Learning with Sub-Team Coordination
NeurIPS 2022, 2022
Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System
Conference on Robot Learning (CoRL 2022), 2022
Learning to Shape Rewards Using a Game of Two Partners
AAAI 2023, 2023
Cooperative Multiagent Transfer Learning with Coalition Pattern Decomposition
IEEE Transactions on Games, vol. 16, no. 2, 2023
Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints
ICLR 2023, 2023
Leveraging Large Language Models for Optimised Coordination in Textual Multi-Agent Reinforcement Learning
arXiv preprint, 2023
Research and Applications of Game Intelligence
SCIENTIA SINICA Informationis, vol. 53, no. 10, 2023
Traj-MAE: Masked Autoencoders for Trajectory Prediction
ICCV 2023, 2023
ChessGPT: Bridging Policy Learning and Language Modeling
NeurIPS 2023, 2023
A Survey on Algorithms for Nash Equilibria in Finite Normal-Form Games
Computer Science Review, vol. 51, 2024
ROS-LLM: A ROS Framework for Embodied AI with Task Feedback and Structured Reasoning
arXiv preprint arXiv:2406.19741, 2024
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
ICML 2024, 2024
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
arXiv preprint arXiv:2411.03562, 2024
GUI Agents with Foundation Models: A Comprehensive Survey
arXiv preprint arXiv:2411.04890, 2024
Pangu-Agent: A Fine-tunable Generalist Agent with Structured Reasoning
arXiv preprint arXiv:2312.14878, 2024
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
ICLR 2025, 2025
Lightweight Neural App Control
ICLR 2025, 2025
AppVLM: A Lightweight Vision Language Model for Online App Control
ICLR 2025 Workshop on Foundation Models in the Wild, 2025
Deep Research Agents: A Systematic Examination and Roadmap
arXiv preprint arXiv:2506.18096, 2025
PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration
arXiv preprint arXiv:2508.18040, 2025
Memento: Fine-tuning LLM Agents Without Fine-tuning LLMs
arXiv preprint arXiv:2508.16153, 2025
Kolb-Based Experiential Learning for Generalist Agents With Human-Level Kaggle Data Science Performance
arXiv preprint, 2025
Hi-Agent: Hierarchical Vision-Language Agents for Mobile Device Control
arXiv preprint arXiv:2510.14388, 2025
Adapting Like Humans: A Metacognitive Agent with Test-Time Reasoning
arXiv preprint arXiv:2511.23262, 2025
See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm
arXiv preprint arXiv:2512.08629, 2025
Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control
NeurIPS 2025, 2025
Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Temporal Grounding
NeurIPS 2025, 2025
VideoAgent2: Enhancing the LLM-Based Agent System for Long-Form Video Understanding by Uncertainty-Aware CoT
NeurIPS 2025 Workshop on Scaling Environments for Agents, 2025
ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning
NeurIPS 2025, 2025
VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning
NeurIPS 2025 Workshop on VLM4RWD, 2025
Darwin Mobile Agent: A Roadmap for Self-Evolution
arXiv preprint, 2026
ScenDroid: A Scenario-Level Benchmark for Long-Horizon, Time-Evolving GUI Agents
ICLR 2026 Workshop on Lifelong Agents, 2026
ViMo: A Generative Visual GUI World Model for App Agents
ICLR 2026, 2026
TAME: A Trustworthy Test-Time Evolution of Agent Memory with Systematic Benchmarking
arXiv preprint arXiv:2602.03224, 2026
ResMAS: Resilience Optimization in LLM-based Multi-agent Systems
AAAI 2026, 2026
A Robot Operating System Framework for Using Large Language Models in Embodied AI
Nature Machine Intelligence, 2026
Are GUI Agents Focused Enough? Automated Distraction via Semantic-level UI Element Injection
arXiv preprint arXiv:2604.07831, 2026
Memory Intelligence Agent
arXiv preprint arXiv:2604.04503, 2026
InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking
arXiv preprint arXiv:2604.02971, 2026
AFE-Master: Enhancing LLM-Driven Autonomous Feature Engineering with Domain-Specific Language Parsing and Guided Local Search
Proceedings of the ACM Web Conference 2026 (WWW 2026), 2026
Beyond Syntax: Action Semantics Learning for App Agents
CVPR 2026, 2026
talks
Talk 1 on Relevant Topic in Your Field
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Conference Proceeding talk 3 on Relevant Topic in Your Field
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
teaching
Teaching experience 1
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Teaching experience 2
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.
