Open Source

I build stuff because it makes me better and it's really fun.

I tend to build things that are useful for me, and hopefully, they will be useful for others.

If you find something interesting or want to build something together, Email Me.

ML paper reconstructions

I reconstruct papers and popular models from scratch in either PyTorch or JAX.

MlxCLI

Python image

CLI functions to run LLMs from terminal.

Run large ML models optimized for Apple Silicon directly from your terminal.

Links:

GitHub

MlxServer

Python image
NextJS image

Create local server to run LLMs.

Run inference on large ML models locally optimized for Apple Silicon.


Anthropic Bedrock

NodeJS Image
NextJS Image
Python Image

SDKs to interact with Anthropic's models on AWS Bedrock.

Typescript and python SDKs that perform all of AWS Bedrock auth and provide clean functions to interact with Anthropic's models on Bedrock.


Scrape Wikipedia

GoLang Image

Concurrently scrape wikipedia and tokenize the outputs.

An open-source Wikipedia scraper using GoLang (for concurrency), and then tokenize those outputs using openai's tiktoken tokenizer.

Links:

GitHub

Monosemanticity

NodeJS Image
NextJS Logo
Python

A 80x faster visualization of feature activations.

I take the data from Anthropic's dictionary learning and index using redis to make retrieval 80x+ faster.


Augment ML

NodeJS Image
NextJS Logo
Python

Multi-modal labeling and RLHF tagging.

An open-source infrastructure for labeling multimodal data while enabling RLHF tagging and augmenting your existing training data at no cost.

Links:

Website

Tinygrad Docs

NodeJS Image
NextJS Logo

Wrote docs and examples for Tinygrad.

I wrote and tested an example of every function in the Tensor and NN libraries for Tinygrad. I wrote all of the docs and examples on how to use the library on MNIST and more. Tinygrad is a ML framework that is focused on making it really easy to build a model. It's also there to provide optimizations for inference but those come with some tradeoffs.


Little book of DL

Summarizing all of high level Deep Learning.

I wrote a summary of all of the high level concepts of deep learning. I think this is really important because it gives a fundamental, first-principle understanding of everything that is going on in Deep Learning.

Links:

GitHub

NPM Library AptosJS

NodeJS Image
Aptos Logo

NodeJS library to interact with the Aptos Blockchain.

Created a NodeJS library that provides react webhooks to interact with the Aptos blockchain. I created this because I want to remove all complexity when interacting with blockchains. I believe the only reason they are not widely adopted is because of the barrier to entry in the form of complexity.


CambrianML

NextJS Logo
Pinecone Logo
Redis Logo
LangChain Logo

Web app to interact with arXiv better.

Cambrian is a web application that allows you to get the most out of any arXiv paper. It allows you to chat with papers, search for papers, send them to friends, and share your papers publically.

Links:

Website

Fine-tuning LLaMA with LoRA

PyTorch Logo
Python Logo
Huggingface Logo

I fine-tuned LLaMA with LoRA on a sentiment task.

I took the contents of a HuggingFace Twitter sentiment dataset and fine-tuned LLaMA with LoRA on it. I use LoRA to fine-tune the model for training efficiency.

Links:

GitHub

Deep NN in NumPy

Python Logo

I train a deep neural network in NumPy.

Training a neural network using PyTorch is easy, all of the primitives are given to you, but what if you had to implement it from scratch? I did just that, I implemented a deep neural network in NumPy.

Links:

GitHub

Open-Source Chat UI

NodeJS Logo
NextJS Logo
Postgres Logo

I created the ChatGPT UI and made it open-source.

I thought it was cool if I made the ChatGPT UI open-source. Moreover, it would give me more experience with streaming words as they come from the OpenAI API, and optimization in storing/sharing chats.


Essay Embedding Search

Python Logo
LangChain Logo
Chroma Logo

Search through Paul Graham Essays using embeddings.

I created a Python script that allows you to search for any phrase in Paul Graham's Essays. It uses embeddings to find the most similar essay to your query. I use ChromaDB as the embedding database.

Links:

GitHub

Collaborative Text Editor

NodeJS Logo
NextJS Logo
Redis Logo
SocketIO Logo

Work with other people in a single text editor.

I created a collaborative text editor using NextJS and SocketIO. It allows you to work with other people in a single text editor. I built this to further my understanding of sockets in a real-time application. Worked many optimizations as well.

Links:

GitHub

Cybersyn Data Visualization

NodeJS Logo
Snowflake Logo
NextJS Logo

I created a BI tool for data visualization.

I built this BI tool to show the power of the Snowflake Data Marketplace. It allows you to go grab any dataset and manipulate it such that you can build your app on top of it. It is similar to the AppStore but for data. Also, Cybersyn is cool.


AI Blog Generator

NextJS Image
NodeJS Image

Web application to generate SEO blog articles

I created this because I wanted to see if LLM can create SEO-optimized blog articles. The theory behind it is if someone is starting a new startup, they can rank for keywords with low KD on Google and basically get cheap traffic.


Reverse Video Playback

NodeJS Logo
NextJS Logo
FFMpeg Logo

Render a video in reverse.

I created this web application to render a video in reverse. I used FFMPEG & NextJS to do this. Moreover, this is done in serverless functions on Vercel.

Links:

GitHub

Notion to Blog

NodeJS Image

Notion as a CMS

I wanted to see if I can use Notion as a CMS. This way all changes to a potential blog for a web application would update in minutes on the site.

Links:

GitHub

Field Trip Records Front-End

NodeJS Logo
NextJS Logo
Framer Logo
Tailwind Logo

Front-end site for FTR record label.

I created the front-end site for the FTR record label. I created it using NextJS and Framer Motion. I also used TailwindCSS for styling. I used Vercel for hosting.


Randomization Experiment Interface

Python Logo

Control trial randomization

A script to perform randomization for experiments. It randomly assigns different testing groups and calculates regression p-values as an interface.

Links:

GitHub