Tri Dao is an Assistant Professor at the Computer Science Department at Princeton University and Chief Scientist at Together AI. He obtained his PhD in Computer Science from Stanford in 2023. He works at the intersection of machine learning and systems, and his research interests include sequence models with long-range memory and structured matrices for compact deep learning models. His work on FlashAttention and Mamba has been widely adopted by many organizations and research labs to speed up training and inference of large language models.