A Deep Dive into On-Policy TD Control: The SARSA Algorithm18 September 2025·9 minsDeep dive into SARSA, a foundational on-policy TD control algorithm for reinforcement learning.
Temporal Difference: Bootstrapping in Reinforcement Learning17 September 2025·2 minsUnderstanding the TD learning update rule and bootstrapping in reinforcement learning.
Monte Carlo Learning in RL15 September 2025·13 minsGuide to Monte Carlo methods in RL: learning from complete episodes and full returns.
Model Free RL: Prediction, Control, and the MRP-MDP Duality ↗ ↖14 September 2025Originally published on NeuraForge
My Three Months at Relativity: Building AI for Legal Tech7 September 2025·6 minsReflections on building AI for legal tech during my Applied Science internship at Relativity.
Reinforcement Learning Essentials: MDPs & Optimal Control ↗ ↖9 August 2025Originally published on NeuraForge
Beyond Supervised Learning: Unlocking AI's Potential with Reinforcement Learning ↗ ↖14 July 2025Originally published on NeuraForge
Implementing GPT-Style Attention: A Step-by-Step Guide with PyTorch ↗ ↖21 January 2025Originally published on NeuraForge
The Ultimate Guide to Preparing Text Data for Language Modeling with PyTorch ↗ ↖6 January 2025Originally published on NeuraForge
PyTorch in Practice: Essential Building Blocks for Modern Deep Learning ↗ ↖2 January 2025Originally published on Medium