Rafael Rafailov is a researcher in artificial intelligence working primarily on decision making and reinforcement learning. He is currently at Thinking Machines, where he worked on the company's first public release - Tinker. Before that he completed a Ph.D. in Computer Science at Stanford University and was a student researcher at Google DeepMind, where he co-authored influential works on embodied AI - the RT-X and OpenVLA series and RLHF post-training - Direct Preference Optimization and Generative Reward Models which have been widely adopted in industry. He thinks a lot about Meta-Learning these days.

TEDAI 2025 - Home Page - AI Conference at San Francisco

TEDAI Talks - Featured Speakers and Presentations

TEDAI Panels - Expert Discussions and Industry Insights

TEDAI Hackathon - Innovation Competition

Rafael Rafailov

Reinforcement Learning at Thinking Machines Lab

Previously at Stanford, Google Deepmind and UC Berkeley

About TEDAI San Francisco - Our Mission and Vision

Contact TEDAI San Francisco - Get in Touch

Privacy Policy - TEDAI San Francisco

AI Glossary - TEDAI San Francisco Terms and Definitions

Follow TEDAI San Francisco on Instagram

Connect with TEDAI San Francisco on Facebook

Join TEDAI San Francisco on LinkedIn

Watch TEDAI San Francisco on YouTube

Follow TEDAI San Francisco on TikTok

Follow TEDAI San Francisco on X (Twitter)