OS Mixture of Depths
Implemented Mixture-of-Depths: Dynamically allocating compute in transformer-based language models by Raposo et al.
OS ShortGPT
Unofficial implementations of block/layer-wise pruning methods for LLMs.
OS Stealing Part of a Production Language Model
Implemented Stealing Part of a Production Language Model by Carlini et al.
General-GPT
Initial exploration of fine-tuning GPT-2 for interleaved CLIP embedding input and output. The goal of this project is to showcase that GPT is able to directly reason across multiple modalities.
shivaen.org
My Personal Website
ZEST
Zoom Education Suite, an add-on to Zoom calls that my team and I built as part of HooHacks 2020
TreasureAI
An AI that attempts to find the treasure in an OpenAI gym envionment