Adam Sohn

A small corner of the web. Projects below, links to find me elsewhere at the bottom.

Writing and data visualization

GRPO, step by step
An interactive explainer of Group Relative Policy Optimization — the RL objective behind modern reasoning models, walked through piece by piece with click-to-pronounce math.
Reliably Incorrect
Notes and visualizations on the agent capability threshold — when LLM agents stop being reliably wrong and start being occasionally right.
λ-bench variance
Five Sonnet 4.6 runs on the same LamBench task, classified by Opus 4.6, rendered as flame charts — what variance looks like when you stop averaging it away.

Projects

intercept
Turn any website into a typed JSON API using self-improving agents.
agent-tuning
Using recursion to achieve predictable agent output.
doomberg-terminal
A Chrome extension that performs algorithmic trading using Robinhood's web interface and market data.
alphadidactic
An iterative research agent: searches academic research, applies it to time-series data, and probes it to find novel discoveries.

Elsewhere


adamsohn.com