Building an LLM from scratch in 60 minutes

Room 13Tue 27 Oct • 11:15–12:15AI & AgentsAdvanced
Armed with nothing but C# and the .NET libraries, we'll endeavor on a whirlwind tour to build a toy GPT-like LLM from scratch. Expect a hardcore deep dive coding session that demystifies the essential building blocks of LLMs, including tokenization, embeddings, layers, self-attention, transformers, auto-differentiation, gradient descent, and more. By the end of the session, we'll train our LLM on a corpus of text and attempt to have a meaningful - or gibberish - conversation with it.

About the speaker

Bart De Smet

Originally from Belgium, Bart relocated to work at the Microsoft headquarters in 2007 after graduating from Ghent University while being a C# MVP. After having worked on WPF and .NET for a few years, Bart focused on reactive programming for quite some time, helping to build Reactive Extensions (Rx) and large scale event processing systems powering various cloud services. After a brief stint in AutoML focusing on Python performance tuning, Bart is now focusing on building next generation distributed systems in the office of the CTO.