Yongji Fu

Yongji Fu 符永骥

I am Yongji Fu (符永骥), an MSc student in Robotics Engineering at the University of Bristol (2025.09 – 2026.10), advised by Nathan F. Lepora and Guanqun Cao. Before Bristol I received my BSc in Information Management and Information Systems from Chongqing University of Posts and Telecommunications.

Goal   To build robotic and agentic systems that continuously learn and iteratively self-improve through interaction with the physical world.

Research Interests   large-scale machine learning · world model for robot learning · continuous self-evolving agent · general-purpose loco-manipulation

Learning to Search, Searching to Learn: A Closed-Loop Framework for Large-Scale Vehicle Routing Problems

Learning to Search, Searching to Learn: A Closed-Loop Framework for Large-Scale Vehicle Routing Problems

Yongji Fu, Yong Wang, Jun Deng, et al.

NeurIPS 2026 (submission), Under review, 2025

TouchSteer: Grounding Natural Language in Tactile Perception via Steering Vectors

Guanqun Cao, Yongji Fu, Yi Zhou, Gaojie Jin, Zhenyu Lu, Shan Luo

IEEE Transactions on Robot Learning (TRL) (submission), Under review, 2025

Learning Realistic Expressions for Humanoid Face Robots

Yongji Fu, et al.

ICRA 2026 (submission), Under review, 2025

Visuo-Tactile Latent World Models

Yongji Fu, et al.

ICRA 2026 (submission), Under review, 2025

Constructing Dynamic S-boxes Based on Chaos and Irreducible Polynomials for Image Encryption

Constructing Dynamic S-boxes Based on Chaos and Irreducible Polynomials for Image Encryption

Chunlei Luo, Yong Wang, Yongji Fu, et al.

Nonlinear Dynamics(Springer,JCR Q1,IF 6.0), 2024

Continually Learning Interactive Robot

An embodied agent that keeps expanding its behavior repertoire through ongoing human–robot interaction — new skills, new object concepts, and new language grounding are acquired online rather than baked in at training time.

  • HRI
  • Continual Learning
  • Multimodal
  • Agent

AURA: Autoresearch via Reflective Adaptation for Compound AI Systems

Inspired by Karpathy's *autoresearch* direction, AURA is a sample-efficient prompt optimizer for compound AI systems: after every rollout it hands the full trace back to the LLM and asks for one named edit to its own prompt. Across multi-hop QA, instruction following, and AIME-style math, AURA matches GRPO with up to 35× fewer rollouts and beats MIPROv2 by ~10 points on aggregate.

  • LLM
  • Prompt Optimization
  • Compound AI
  • Reflection
  • Autoresearch

Real-time Packaging QA on a Hazardous-Explosive Production Line

Industrial vision QA system for a live hazardous-powder-explosive packaging line. ≥ 99% accuracy over a 30-day production run on the customer's RTX 4060; operator-level alarms via a fixed-protocol online-monitoring API.

  • Multi-scale Feat
  • Boundary Loss
  • Cython
  • TensorRT
  • Reparam