Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Page Not Found

Archive Layout with Content

Posts by Category

Posts by Collection

CV

CV

Markdown

Page not in menu

Page Archive

Portfolio

Publications

Sitemap

Posts by Tags

Talk map

Talks and presentations

Teaching

Terms and Privacy Policy

Blog posts

Jupyter notebook markdown generator

Posts

Future Blog Post

less than 1 minute read

Published: January 01, 2199

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published: August 14, 2015

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published: August 14, 2014

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published: August 14, 2013

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published: August 14, 2012

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

Portfolio item number 1

Short description of portfolio item number 1

Portfolio item number 2

Short description of portfolio item number 2

publications

Self-Adaptive Multi-Agent Systems for StarCraft: Brood War

arXiv preprint, 2021

Efficient Dual-Process Cognitive Recommender Balancing Accuracy and Diversity

International Conference on Database Systems for Advanced Applications (DASFAA 2022), 2022

Promoting Quality and Diversity in Population-Based Reinforcement Learning via Hierarchical Trajectory Space Exploration

ICRA 2022, 2022

Robust Multi-Agent Reinforcement Learning Driven by Correlated Equilibrium

arXiv preprint, 2022

Heterogeneous Graph Neural Network-based Imitation Learning for Gate Sizing Acceleration

ICCAD 2022, 2022

Multiagent Q-Learning with Sub-Team Coordination

NeurIPS 2022, 2022

Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System

Conference on Robot Learning (CoRL 2022), 2022

Learning to Shape Rewards Using a Game of Two Partners

AAAI 2023, 2023

Cooperative Multiagent Transfer Learning with Coalition Pattern Decomposition

IEEE Transactions on Games, vol. 16, no. 2, 2023

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

ICLR 2023, 2023

Leveraging Large Language Models for Optimised Coordination in Textual Multi-Agent Reinforcement Learning

arXiv preprint, 2023

Research and Applications of Game Intelligence

SCIENTIA SINICA Informationis, vol. 53, no. 10, 2023

Traj-MAE: Masked Autoencoders for Trajectory Prediction

ICCV 2023, 2023

ChessGPT: Bridging Policy Learning and Language Modeling

NeurIPS 2023, 2023

A Survey on Algorithms for Nash Equilibria in Finite Normal-Form Games

Computer Science Review, vol. 51, 2024

ROS-LLM: A ROS Framework for Embodied AI with Task Feedback and Structured Reasoning

arXiv preprint arXiv:2406.19741, 2024

Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control

ICML 2024, 2024

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

arXiv preprint arXiv:2411.03562, 2024

GUI Agents with Foundation Models: A Comprehensive Survey

arXiv preprint arXiv:2411.04890, 2024

Pangu-Agent: A Fine-tunable Generalist Agent with Structured Reasoning

arXiv preprint arXiv:2312.14878, 2024

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents

ICLR 2025, 2025

SPA-Bench: A Comprehensive Benchmark for Smartphone Agent Evaluation

ICLR 2025, 2025

Lightweight Neural App Control

ICLR 2025, 2025

AppVLM: A Lightweight Vision Language Model for Online App Control

ICLR 2025 Workshop on Foundation Models in the Wild, 2025

Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction

AAMAS 2025, 2025

Deep Research Agents: A Systematic Examination and Roadmap

arXiv preprint arXiv:2506.18096, 2025

PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration

arXiv preprint arXiv:2508.18040, 2025

Memento: Fine-tuning LLM Agents Without Fine-tuning LLMs

arXiv preprint arXiv:2508.16153, 2025

Kolb-Based Experiential Learning for Generalist Agents With Human-Level Kaggle Data Science Performance

arXiv preprint, 2025

Hi-Agent: Hierarchical Vision-Language Agents for Mobile Device Control

arXiv preprint arXiv:2510.14388, 2025

Learning Precise Affordances from Egocentric Videos for Robotic Manipulation

ICCV 2025, 2025

Adapting Like Humans: A Metacognitive Agent with Test-Time Reasoning

arXiv preprint arXiv:2511.23262, 2025

See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm

arXiv preprint arXiv:2512.08629, 2025

Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control

NeurIPS 2025, 2025

Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Temporal Grounding

NeurIPS 2025, 2025

VideoAgent2: Enhancing the LLM-Based Agent System for Long-Form Video Understanding by Uncertainty-Aware CoT

NeurIPS 2025 Workshop on Scaling Environments for Agents, 2025

ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning

NeurIPS 2025, 2025

VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning

NeurIPS 2025 Workshop on VLM4RWD, 2025

Darwin Mobile Agent: A Roadmap for Self-Evolution

arXiv preprint, 2026

K²-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control

ICLR 2026, 2026

ScenDroid: A Scenario-Level Benchmark for Long-Horizon, Time-Evolving GUI Agents

ICLR 2026 Workshop on Lifelong Agents, 2026

ViMo: A Generative Visual GUI World Model for App Agents

ICLR 2026, 2026

TAME: A Trustworthy Test-Time Evolution of Agent Memory with Systematic Benchmarking

arXiv preprint arXiv:2602.03224, 2026

ResMAS: Resilience Optimization in LLM-based Multi-agent Systems

AAAI 2026, 2026

AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search

AAAI 2026, 2026

A Robot Operating System Framework for Using Large Language Models in Embodied AI

Nature Machine Intelligence, 2026

Are GUI Agents Focused Enough? Automated Distraction via Semantic-level UI Element Injection

arXiv preprint arXiv:2604.07831, 2026

Memory Intelligence Agent

arXiv preprint arXiv:2604.04503, 2026

InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking

arXiv preprint arXiv:2604.02971, 2026

AFE-Master: Enhancing LLM-Driven Autonomous Feature Engineering with Domain-Specific Language Parsing and Guided Local Search

Proceedings of the ACM Web Conference 2026 (WWW 2026), 2026

Beyond Syntax: Action Semantics Learning for App Agents

CVPR 2026, 2026

talks

Talk 1 on Relevant Topic in Your Field

Published: March 01, 2012

This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!

Tutorial 1 on Relevant Topic in Your Field

Published: March 01, 2013

More information here

Talk 2 on Relevant Topic in Your Field

Published: February 01, 2014

More information here

Conference Proceeding talk 3 on Relevant Topic in Your Field

Published: March 01, 2014

This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.