Duration
36 months
Lifelong Multimodal Language Learning by Explaining and Exploiting Compositional Knowledge
Duration
36 months
Host Institution
University of Hamburg
Funding
DFG Research Grant
Focus
Robust multimodal lifelong learning
LUMO studies how multimodal AI systems can remain reliable when tasks change. The project focuses on lifelong learning across vision-language understanding and language-conditioned robotic manipulation.
We combine controlled benchmarks, concept-based explainability, neuro-symbolic learning, and sim2real experiments to understand how compositional knowledge is formed, preserved, and reused.
Build diagnostic datasets and simulation environments for compositional vision-language learning and language-conditioned robotic manipulation.
Explain how concepts and relations emerge during continual learning using concept-based XAI and training-dynamics analysis.
Inject symbolic constraints into embedding spaces to improve retention, compositionality, and robust transfer across tasks.
Transfer insights from simulation to the NICO and NICOL robots and compare concept representations across simulated and real settings.