End-to-End AI Research Journey
Models
Fields
Mechanics
The Objective
This journal documents my end-to-end learning journey across Artificial Intelligence. The goal is to build a profound understanding of neural network architectures, from fundamental Multi-Layer Perceptrons to state-of-the-art Large Language Models.
Machine learning is evolving from academic theory into the foundational infrastructure of software. This website serves as a public index of my research, mathematical derivations, and code implementations.
Rather than treating models as black boxes, my approach
focuses on understanding the underlying mechanics. This
includes studying the optimization functions in
By implementing these systems, leveraging techniques like
My latest findings, implementation notes, and insights are chronologically documented below:
-
The First Equation of Machine Learning: Linear Regression in Vanilla Rust
A deep dive into the fundamental equation of Machine Learning, exploring the math behind Linear Regression and implementing it in Rust from scratch.