All Posts

Published on
10 January 2026
Building an LLM from scratch part 3 - coding attention mechanisms
The third chapter teaches how to code attention mechanisms. The LLM breakthrough. We start with a simple version with non-trainable weights and make adjustments until we have multi-headed attention as used in GPT-2.
Published on
19 December 2025
Building an LLM from scratch part 2 - working with text data
The second chapter covers embedding and tokenisation.
Published on
16 December 2025
Building an LLM from scratch part 1 - introduction and setup
Some notes on reading the first chapter of the book 'Building a Large Language Model From Scratch' by Sebastian Raschka
Published on
13 December 2025
Running a docker image on a Synology NAS without a registry
docker
I wanted to run a docker image of a little app I made on my Synolgy NAS without the extra effort of publishing the image to a registry. Here's how I did it.
Published on
20 January 2021
How to split git repo and keep history of multiple projects
.NET
Here's how I split a off multiple projects from a large solution into a new git repository and kept the history.

Building an LLM from scratch part 3 - coding attention mechanisms