Kaveh Hassani — AI Research Scientist
About Me
I am an AI Research Scientist at Meta Superintelligence Labs where I work on post-training and evaluating LLMs for continual improvement based on large-scale online user interactions. My current research focus is on self-improving LLMs and multi-agent reinforcement learning for LLMs. Before joining Meta Superintelligence Labs, I was an AI Research Scientist at Meta's Ranking and Foundational AI Research group, working on large-scale generative learning for recommender systems. I was also a Machine Learning Lecturer at the University of Toronto teaching Fundamentals of Deep Learning. Prior to that, I held the position of Principal AI Research Scientist and Research Manager at the Autodesk AI Lab. I earned my PhD from the University of Ottawa, with a focus on deep learning for common-sense reasoning in virtual environments. My research has been published in top-tier AI venues including NeurIPS, ICLR, ICML, and TMLR.
Research Interests
Self-Improving LLMs · LLM Reasoning · Multi-Agent RL · Post-Training · Generative Retrieval
News
- [Jun 2026] One paper accepted at ICML 2026: Structure Enables Effective Self-Localization of Errors in LLMs
- [Mar 2026] Two papers accepted at EACL 2026: Imbalanced Gradients in RL Post-Training and How to Make LLMs Strong Node Classifiers?
- [2025] One paper accepted at KDD 2025: Generating Long Semantic IDs in Parallel for Recommendation
- [2025] One paper accepted at ICLR 2025: Learning Graph Quantized Tokenizers
- [2025] Joined Meta Superintelligence Labs
