Mohammad Taufeeque
I am a research engineer at FAR.AI. My current research interests are scalable interpretability, post-training interventions that robustly preserve values like honesty across contexts and personas, and improving the introspective awareness of LLMs.
At FAR, my prior work has included scalable interpretability via sparse codebook features, mechanistic analysis of goals and planning in recurrent agents, red-teaming frontier LLMs, and preserving honesty during RL training with deception probes.
I graduated from IIT Bombay with a B.Tech in Computer Science, where my bachelor’s thesis with Prof. Shivaram Kalyanakrishnan won the NeurIPS 2021 Reconnaissance Blind Chess competition. Previously, I interned at Microsoft Research with Prof. Sunita Sarawagi and Dr. Sriram Rajamani, and at TU Braunschweig with Prof. Thomas Deserno.
Publications
- Fianchetto: Speed, Belief, Guile, Caution to Win at Reconnaissance Blind ChessBachelor’s Thesis, 2022
-