Beyond Syntax: Action Semantics Learning for App Agents
B Tang, D Luo, J Liu, J Chen, S Gong, J Hao, J Wang, K Shao. (2026). "Beyond Syntax: Action Semantics Learning for App Agents." CVPR 2026.
B Tang, D Luo, J Liu, J Chen, S Gong, J Hao, J Wang, K Shao. (2026). "Beyond Syntax: Action Semantics Learning for App Agents." CVPR 2026.
H Liang, J Hao, J Liu, Y Ma, Z Cao, J Liang, K Shao, Z Du, F Ni, Y Yuan. (2026). "AFE-Master: Enhancing LLM-Driven Autonomous Feature Engineering with Domain-Specific Language Parsing and Guided Local Search." Proceedings of the ACM Web Conference 2026 (WWW 2026).
KY Lee, Y Huang, Z He, H Zhou, W Luo, K Shao, M Fang, J Wang. (2026). "InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking." arXiv preprint arXiv:2604.02971.
J Qiao, W Meng, Y Cheng, Z Lin, Z Zhang, X Tan, J Gong, K Shao, Y Xie. (2026). "Memory Intelligence Agent." arXiv preprint arXiv:2604.04503.
W Yang, C Jin, H Zhu, W Luo, D Yuen, K Shao, H Huang, J Duan, J Cao. (2026). "Are GUI Agents Focused Enough? Automated Distraction via Semantic-level UI Element Injection." arXiv preprint arXiv:2604.07831.
CE Mower, Y Wan, H Yu, A Grosnit, J Gonzalez-Billandon, M Zimmer, K Shao, J Wang. (2026). "A Robot Operating System Framework for Using Large Language Models in Embodied AI." Nature Machine Intelligence.
Y Li, L Li, Z Wu, Q Liao, J Hao, K Shao, F Xu, Y Li. (2026). "AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search." AAAI 2026.
Z Zhou, Z Liu, J Liu, Q Shao, Y Wang, K Shao, D Jin, F Xu. (2026). "ResMAS: Resilience Optimization in LLM-based Multi-agent Systems." AAAI 2026.
Y Cheng, J Zhou, Y Hu, Y Chen, H Zhou, M Chen, Z Zhang, K Shao, Y Xie. (2026). "TAME: A Trustworthy Test-Time Evolution of Agent Memory with Systematic Benchmarking." arXiv preprint arXiv:2602.03224.
D Luo, B Tang, K Li, G Papoudakis, J Song, S Gong, J Hao, J Wang, K Shao. (2026). "ViMo: A Generative Visual GUI World Model for App Agents." ICLR 2026.
Z Wu, Y Kang, D Sheng, J Xing, G Wu, D Yuen, D Mo, Y Jing, K Li, W Luo, K Shao. (2026). "ScenDroid: A Scenario-Level Benchmark for Long-Horizon, Time-Evolving GUI Agents." ICLR 2026 Workshop on Lifelong Agents.
Z Wu, D Mo, H Lu, J Xing, J Liu, Y Jing, K Li, K Shao, J Hao, Y Shi. (2026). "K²-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control." ICLR 2026.
D Beechey, D Yuen, J Liu, D Luo, T He, W Luo, J Wang, K Shao. (2026). "Darwin Mobile Agent: A Roadmap for Self-Evolution." arXiv preprint.
Q Wu, J Liu, J Hao, J Wang, K Shao. (2025). "VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning." NeurIPS 2025 Workshop on VLM4RWD.
S Huang, L Yang, Y Song, S Chen, L Cui, Z Wan, Q Zeng, Y Wen, K Shao. (2025). "ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning." NeurIPS 2025.
Z Zhi, Q Wu, W Li, Y Li, K Shao, K Zhou. (2025). "VideoAgent2: Enhancing the LLM-Based Agent System for Long-Form Video Understanding by Uncertainty-Aware CoT." NeurIPS 2025 Workshop on Scaling Environments for Agents.
J Hu, Z Cheng, S Gong, I Guan, J Hao, J Wang, K Shao. (2025). "Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Temporal Grounding." NeurIPS 2025.
G Papoudakis, T Coste, J Hao, J Wang, K Shao. (2025). "Succeed or Learn Slowly: Sample Efficient Off-Policy Reinforcement Learning for Mobile App Control." NeurIPS 2025.
H Zhao, W Ding, Y Yang, Z Tian, L Yang, K Shao, J Wang. (2025). "See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm." arXiv preprint arXiv:2512.08629.
Y Li, Z He, Y Huang, Z Xiao, C Yu, M Fang, K Shao, J Wang. (2025). "Adapting Like Humans: A Metacognitive Agent with Test-Time Reasoning." arXiv preprint arXiv:2511.23262.
G Li, N Tsagkas, J Song, R Mon-Williams, S Vijayakumar, K Shao. (2025). "Learning Precise Affordances from Egocentric Videos for Robotic Manipulation." ICCV 2025.
Z Wu, H Lu, J Xing, C Zhang, Y Li, Y Zhu, Y Yang, Y Jing, K Li, K Shao. (2025). "Hi-Agent: Hierarchical Vision-Language Agents for Mobile Device Control." arXiv preprint arXiv:2510.14388.
H Bou-Ammar, A Grosnit, A Maraval, R SN, Z Zhao, J Doran, G Paolo, K Shao, J Wang. (2025). "Kolb-Based Experiential Learning for Generalist Agents With Human-Level Kaggle Data Science Performance." arXiv preprint.
H Zhou, Y Chen, S Guo, X Yan, KH Lee, Z Wang, KY Lee, G Zhang, K Shao. (2025). "Memento: Fine-tuning LLM Agents Without Fine-tuning LLMs." arXiv preprint arXiv:2508.16153.
X Wang, Z Cui, H Li, Y Zeng, C Wang, R Song, Y Chen, K Shao, Q Zhang. (2025). "PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration." arXiv preprint arXiv:2508.18040.
Y Huang, Y Chen, H Zhang, K Li, H Zhou, M Fang, L Yang, X Li, L Shang, K Shao. (2025). "Deep Research Agents: A Systematic Examination and Roadmap." arXiv preprint arXiv:2506.18096.
T Jafferjee, J Ziomek, T Yang, Z Dai, J Wang, M Taylor, K Shao, J Wang. (2025). "Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction." AAMAS 2025.
G Papoudakis, T Coste, Z Wu, J Hao, J Wang, K Shao. (2025). "AppVLM: A Lightweight Vision Language Model for Online App Control." ICLR 2025 Workshop on Foundation Models in the Wild.
F Christianos, G Papoudakis, T Coste, J Hao, J Wang, K Shao. (2025). "Lightweight Neural App Control." ICLR 2025.
J Chen, D Yuen, B Xie, Y Yang, G Chen, Z Wu, L Yixing, X Zhou, W Liu, K Shao. (2025). "SPA-Bench: A Comprehensive Benchmark for Smartphone Agent Evaluation." ICLR 2025.
T Wang, Z Wu, J Liu, J Hao, J Wang, K Shao. (2025). "DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents." ICLR 2025.
F Christianos, G Papoudakis, M Zimmer, T Coste, Z Wu, J Chen, K Shao, J Wang. (2024). "Pangu-Agent: A Fine-tunable Generalist Agent with Structured Reasoning." arXiv preprint arXiv:2312.14878.
S Wang, W Liu, J Chen, Y Zhou, W Gan, X Zeng, Y Che, S Yu, X Hao, K Shao. (2024). "GUI Agents with Foundation Models: A Comprehensive Survey." arXiv preprint arXiv:2411.04890.
A Grosnit, A Maraval, J Doran, G Paolo, A Thomas, RSHN Beevi, K Shao, J Wang. (2024). "Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level." arXiv preprint arXiv:2411.03562.
Z Xiong, R Vuorio, J Beck, M Zimmer, K Shao, S Whiteson. (2024). "Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control." ICML 2024.
CE Mower, Y Wan, H Yu, A Grosnit, J Gonzalez-Billandon, M Zimmer, K Shao, J Wang. (2024). "ROS-LLM: A ROS Framework for Embodied AI with Task Feedback and Structured Reasoning." arXiv preprint arXiv:2406.19741.
H Li, W Huang, Z Duan, DH Mguni, K Shao, J Wang, X Deng. (2024). "A Survey on Algorithms for Nash Equilibria in Finite Normal-Form Games." Computer Science Review, vol. 51.
X Feng, Y Luo, Z Wang, H Tang, M Yang, K Shao, D Mguni, Y Du, J Wang. (2023). "ChessGPT: Bridging Policy Learning and Language Modeling." NeurIPS 2023.
H Chen, J Wang, K Shao, F Liu, J Hao, C Guan, G Chen, PA Heng. (2023). "Traj-MAE: Masked Autoencoders for Trajectory Prediction." ICCV 2023.
J Hao, K Shao, K Li, D Li, H Mao, S Hu, Z Wang. (2023). "Research and Applications of Game Intelligence." SCIENTIA SINICA Informationis, vol. 53, no. 10.
O Slumbers, DH Mguni, K Shao, J Wang. (2023). "Leveraging Large Language Models for Optimised Coordination in Textual Multi-Agent Reinforcement Learning." arXiv preprint.
D Mguni, A Sootla, J Ziomek, O Slumbers, Z Dai, K Shao, J Wang. (2023). "Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints." ICLR 2023.
T Zhou, F Zhang, K Shao, Z Dai, K Li, W Huang, W Wang, B Wang, D Li. (2023). "Cooperative Multiagent Transfer Learning with Coalition Pattern Decomposition." IEEE Transactions on Games, vol. 16, no. 2.
D Mguni, T Jafferjee, J Wang, N Perez-Nieves, W Song, F Tong, M Taylor, K Shao. (2023). "Learning to Shape Rewards Using a Game of Two Partners." AAAI 2023.
Z Dai, T Zhou, K Shao, DH Mguni, B Wang, J Hao. (2022). "Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System." Conference on Robot Learning (CoRL 2022).
W Huang, K Li, K Shao, T Zhou, M Taylor, J Luo, D Wang, H Mao, J Hao. (2022). "Multiagent Q-Learning with Sub-Team Coordination." NeurIPS 2022.
X Zhou, J Ye, CW Pui, K Shao, G Zhang, B Wang, J Hao, G Chen. (2022). "Heterogeneous Graph Neural Network-based Imitation Learning for Gate Sizing Acceleration." ICCAD 2022.
Y Hu, K Shao, D Li, J Hao, W Liu, Y Yang, J Wang, Z Zhu. (2022). "Robust Multi-Agent Reinforcement Learning Driven by Correlated Equilibrium." arXiv preprint.
J Miao, T Zhou, K Shao, M Zhou, W Zhang, J Hao, Y Yu, J Wang. (2022). "Promoting Quality and Diversity in Population-Based Reinforcement Learning via Hierarchical Trajectory Space Exploration." ICRA 2022.
Y Gao, K Shao, Z Duan, Z Wei, D Li, B Wang, M Zhao, J Hao. (2022). "Efficient Dual-Process Cognitive Recommender Balancing Accuracy and Diversity." International Conference on Database Systems for Advanced Applications (DASFAA 2022).
K Shao, Z Dai, T Zhou, Y Zhu, D Li, H Mao, J Hao. (2021). "Self-Adaptive Multi-Agent Systems for StarCraft: Brood War." arXiv preprint.
Conference proceedings talk at Testing Institute of America 2014 Annual Conference, Los Angeles, CA, USA
Talk at London School of Testing, London, UK
Tutorial at UC-Berkeley Institute for Testing Science, Berkeley, CA, USA
Talk at UC San Francisco, Department of Testing, San Francisco, CA, USA