Articles

All posts

Letters·January 25, 2026·7 min read

Stack Money for the Wrong Reasons.

The uncomfortable truth about wealth, revenge, and why dark fuel burns just as bright.

Read

Letters·January 20, 2026·9 min read

Let Your Calm Be the Threat.

How to negotiate from strength - knowing when to speak, when to stay silent, and how to leave room for both sides to win.

letters negotiation business

Read

Letters·January 19, 2026·16 min read

Become Undefinable.

A letter from my future self on becoming undefinable, building a personal monopoly, and why the paradox is the edge.

letters discipline self-improvement

Read

Tutorials·January 19, 2026·12 min read

CS336 Notes: Lecture 17 - Alignment, RL 2

RL foundations for LLMs: policy gradients, baselines for variance reduction, GRPO implementation details, and practical training considerations for reasoning models.

machine-learning alignment stanford-cs336

Read

Tutorials·January 18, 2026·16 min read

CS336 Notes: Lecture 16 - Alignment, RL 1

Advanced RL for alignment: PPO implementation details, GRPO as a simpler alternative, overoptimization risks, and case studies from DeepSeek R1, Kimi K1.5, and Qwen 3.

machine-learning alignment stanford-cs336

Read