Current interests

My long-term goal is to develop systems capable of open-ended scientific knowledge discovery. My current interests revolve around (in decreasing order of active involvement):

  • Autonomous information seeking: uncertainty self-awareness, action-taking attending to information gain maximization, goal-directed exploration.
  • Reinforcement learning for large models: efficient exploratinon capabilities in large search-spaces, open-ended & intrinsic objectives.
  • Predictive world models: providing scalable streams of experiential data, grounding long reasoning trajectories and alleviating hallucinations.
  • Knowledge representations: formal languages (e.g., math or code), hierarchical, and compositional representations.

Research output

Type
Title
Authors
Year
Publisher
Recognitions
Links
conferenceCo-Adaptation of Embodiment and Control with Self-Imitation LearningSergio Hernández-Gutiérrez, Ville Kyrki, Kevin S. Luck2025IROS
workshopRecursive Decomposition with Dependencies for Generic Divide and Conquer ReasoningSergio Hernández-Gutiérrez, Minttu Alakuijala, Alexander V. Nikitin, Pekka Marttinen2024NeurIPS Sys2 Reasoning
workshopFollowing Ancestral Footsteps: Co-Designing Morphology and Behaviour with Self-Imitation LearningSergio Hernández-Gutiérrez, Ville Kyrki, Kevin S. Luck2024EARL RSS (oral presentation) and EWRLBest Workshop Paper Award (EARL RSS)
thesisSolving Reasoning Problems with Large Language Models via Recursive DecompositionSergio Hernández-Gutiérrez, Pekka Marttinen, Alexander Nikitin, Minttu Alakuijala2024Aalto University
seminarA Comprehensive Overview of Goal-Conditioned Hierarchical Reinforcement Learning: Algorithms, Challenges, and Future DirectionsSergio Hernández-Gutiérrez, Vivienne Wang2023Aalto University