Hi, I am a senior at the University of Wisconsin, Madison, studying mathematics and computer science.
I am researching large language models in Professor Kangwook Lee’s lab.
I also work on Reinforcement Learning for Robotics at Ohalo.
I serve as the Deputy Director of the Wisconsin AI Safety Initiative.
For all of my work experience, see my resume.
Here, I test how the mathematical theory of generalization bounds works in practice.
Here, I implement a decoder-only language model using Pytorch, and I build a trainer to train the model on a shakespeare dataset.
Here, I implement PPO and show it working for breakout and cartpole.
Here, I implement RLHF using my own transformer implementation and a simple policy gradient algorithm.
For details on my projects, see the projects page.