↓Skip to main content

Posts

2025

A Deep Dive into On-Policy TD Control: The SARSA Algorithm

18 September 2025·9 mins

Deep dive into SARSA, a foundational on-policy TD control algorithm for reinforcement learning.

Temporal Difference: Bootstrapping in Reinforcement Learning

17 September 2025·2 mins

Understanding the TD learning update rule and bootstrapping in reinforcement learning.

Monte Carlo Learning in RL

15 September 2025·13 mins

Guide to Monte Carlo methods in RL: learning from complete episodes and full returns.

Model Free RL: Prediction, Control, and the MRP-MDP Duality ↗ ↖

14 September 2025

Originally published on NeuraForge

My Three Months at Relativity: Building AI for Legal Tech

7 September 2025·6 mins

Reflections on building AI for legal tech during my Applied Science internship at Relativity.

Reinforcement Learning Essentials: MDPs & Optimal Control ↗ ↖

9 August 2025

Originally published on NeuraForge

Beyond Supervised Learning: Unlocking AI's Potential with Reinforcement Learning ↗ ↖

14 July 2025

Originally published on NeuraForge

Implementing GPT-Style Attention: A Step-by-Step Guide with PyTorch ↗ ↖

21 January 2025

Originally published on NeuraForge

The Ultimate Guide to Preparing Text Data for Language Modeling with PyTorch ↗ ↖

6 January 2025

Originally published on NeuraForge

PyTorch in Practice: Essential Building Blocks for Modern Deep Learning ↗ ↖

2 January 2025

Originally published on Medium

←
1
2
3
⋯
5
→