Pinned Posts

Deep Dive into Claude Code Harness - Chapter 1: Memory Management

What makes the Claude Code harness best-in-class? Let's dive into its leaked source code to learn about the rigorous engineering within. Turns out, its memory management is remarkably similar to the wiki system described in my previous post.

Introducting Wiki-Skills: Agent Skill for Managing Markdown Vaults

Kaparthy shared a viral note about a building an LLM-managed 'personal wiki' - which I interpret as a specific instance of a locally stored markdown vault. Here, I present Wiki Skills - a set of agentic Skills that captures the process of creating, querying, and updating this knowledge vault.

Autoresearch: Autonomous Hill-Climbing for Any Optimizable System

Run optimization experiments while you sleep

Coffee Water Composition

TL;DR: Boston’s tap water is terrible for making high quality coffee (wrt aroma and taste). No need to start at zero with deionized water – spike your water to get better results. In these notes, I aim to derive a recipe for optimizing my flat whites using Boston’s (filtered) tap water.

A Review on Agentic Memory

Managing context for long-running agents, across sessions, or swarms of agents, remains an active area of research and development. This is a survey of the current landscaspe in the world of Agentic memory.

Python Tooling

An index of relevant Python tooling references in this blog

Tower Defense

A Javascript version of the original Flash-based Tower Defense, circa 2008

Testing your code in multiple environments with nox and uv

Often in production, you might want to test your code in different python versions or environment variables. You could use the `matrix` command in Github CI, but it's much more tedious to do the same thing locally. Instead of manually setting up each environment and variable, Nox helps you automate this kind of testing. Let's see what it takes to set up Nox and test my Poetry-managed package in three different Python environments.
Read More

uv & Python Virtual Environment Tools

uv is a super fast Python package and project manager, written in Rust. It purports to be a drop-in replacement for many of the familiar tools we use, such as Pip, Conda, Poetry, Virtualenv, Pipx, etc… But how realistic is this? What is the developer experience if we ditch our existing tools for uv? Kevin Renskers wrote a nice comparison between Poetry and uv, and came to the conclusion that the number one strength of uv is in how it manages virtual environments. So in this brain dump, I’ll focus on virtual environment management aspect of uv, and compare it to other tools like poetry, conda, and virtualenv.

Read More

Pytest Fixtures

Fixtures in pytest are functions that provide a fixed baseline for tests, allowing resources (like database connections, input data, etc.) to be accessed and reused across multiple test cases. They can also handle setup and teardown for each test. Upon test initialization, fixtures are executed, and their results cached to be used and reused. The persistence of this cache depends on a specified “scope”. In this note, my intention is to jot down important concepts about fixtures and show some common ways in which they’re used.

Read More

Pytest Markers & Parametrization

Pytest markers are like tags. Tag your tests to keep them organized. Use markers to indicate test priority, or group them by performance, integration, or acceptance. Use them to skip tests under certain conditions, or mark them as expected failures. Use parametrization to run your test under different parameters.

Read More