On generative LLMs

I’m going to detox from generative LLMs for a while. To be clear, I think generative LLMs (erroneously referred to as “AI”) are incre...

Oct 01, 2024

Accelerating Network Training

This post is a (non-exhaustive) summary of the techniques I’ve used to accelerate neural network training. I’m going to focus on the ...

Read More

Other Posts

On generative LLMs

I’m going to detox from generative LLMs for a while. To be clear, I think generative LLMs (erroneously referred to as “AI”) are incredible tools, and I do believe there is merit in kn...

Oct 01, 2024

An Undergraduate Understanding of System Security

Before I get too far removed from the classroom, I’m going to reflect on the fundamentals of information security that I learned in undergrad and applied in a system security course I...

Apr 20, 2024

It works on my machine

“But it worked on my machine!” is usually the fault of the programmer or some quirk of configuration working behind the scenes to foil what would otherwise be smoothly executing softw...

Apr 01, 2024

Making Models Smaller

When we think about solving problems using machine learning, usually our first concern is building a model that is good at whatever it is we want it to do. For object detection system...

Dec 27, 2023

Accelerating Network Training

This post is a (non-exhaustive) summary of the techniques I’ve used to accelerate neural network training. I’m going to focus on the most fundamental adaptations to improve the traini...

Dec 25, 2023

Unconventional Training

The “conventional” approach to supervised machine learning usually follows this recipe: Some fixed dataset \(\mathcal{D}\) of labeled data \((x, y)\) is split into train, validation,...

Dec 23, 2023

ML Without Neural Networks

Neural networks have taken over ML, and I don’t see that changing in the near-term. With everyone focused on transformers and generative AI, it’s worthwhile to take a step back and ex...

Dec 15, 2023