Adam Sohn
A small corner of the web. Projects below, links to find me elsewhere at the bottom.
Writing and data visualization
- GRPO, step by step
- An interactive explainer of Group Relative Policy Optimization — the RL objective behind modern reasoning models, walked through piece by piece with click-to-pronounce math.
- Reliably Incorrect
- Notes and visualizations on the agent capability threshold — when LLM agents stop being reliably wrong and start being occasionally right.
- λ-bench variance
- Five Sonnet 4.6 runs on the same LamBench task, classified by Opus 4.6, rendered as flame charts — what variance looks like when you stop averaging it away.
Projects
- intercept
- Turn any website into a typed JSON API using self-improving agents.
- agent-tuning
- Using recursion to achieve predictable agent output.
- doomberg-terminal
- A Chrome extension that performs algorithmic trading using Robinhood's web interface and market data.
- alphadidactic
- An iterative research agent: searches academic research, applies it to time-series data, and probes it to find novel discoveries.
Elsewhere
adamsohn.com