Sitemap - 2023 - Matt Rickard

Pseudonyms in American History

Fairchildren

ChatGPT After One Year

McNamara Fallacy

Data Quality in LLMs

Discord and AI GTM

Standard Causes of Human Misjudgment (Munger)

The Unreasonable Effectiveness of Monte Carlo

Razor and Blades Model

Drawbacks of Moving to the Edge

Are Things Getting Worse?

How AI Changes Workflows

Duties of the Board of Directors

Strategies for the GPU-Poor

Take Your Time Making Decisions

The Encyclopedia of Integer Sequences

The Catilinarian Conspiracy

The Model is Not The Product

The AI-neid

Model Merge (Frankenmerge)

The Cost of Index Everything

What if Google Wasn’t The Default?

Copilot is an Incumbent Business Model

Eroom's Law

The Lucretius Problem

The Call to Adventure

AI Agents, Today

Norvig's Agent Definition

The Context Length Observation

To be, or not to be; ay, there’s the point.

Improving RAG: Strategies

Static Sites Aren't Simple Anymore

Lessons from llama.cpp

Why Model Evaluation is Difficult

Mechanical Turks

Regulatory Capture in the Railroad Industry

What if OpenAI Builds This?

On Mixing Client and Server

Infrastructure as Code Will be Written by AI

When A/B Testing Doesn’t Work

Between Images and Text: CLIP

Positioning Yourself Near The Opportunity

Tech Invariants

Horizontal Tuning: Instruction, Chat, and What Else?

Retrieval Augmented Generation

The Silent Todo List

Pitfalls of File-based Routing

Benefits of Small LLMs

Can OpenAI Win Consumer and Enterprise?

The Inner-Platform Effect

Revision: Generative Text-to-UI

Why Is the Frontend Stack So Complicated?

An Intelligent Wikipedia

The Half-Life of the AI Stack

Nagle's Algorithm

Black Swan by Shakespeare

Anticipate the Cheap

Moravec's Paradox

The American Experiment

Incentives Behind Programming Languages

3D as the New Modality

Keep Going

The Benefit of a Personal Notation

Pitfalls of Database Migrations

Generative Interfaces

Lessons From Debugging

The Compression / Learning Duality

Is AI a Platform Shift?

Passkeys, Crypto, and Signing AI Content

Is Data Still a Moat?

Multi-Modal AI is a UX Problem

Observer-Expectancy at Scale

LLMs as System 1 Thinkers

The Fundamental Attribution Error

Good Enough Abstractions

Automate (But Automate Last)

AI Biographers

Customized End-User Software (with AI)

Five Principles from Renaissance Technologies

Moving Upmarket

The Age-old Resistance to Generated Code

Cincinnatus

Undetectable AI

The New Economics of Generating Code

The Inevitability of Scope Creep

Fine-tuning Stable Diffusion XL with Personal Photos

The Low-Hanging Fruit

Beyond Prompt Engineering

The Hardware / Software Cycle

Raft: The Distributed Systems Algorithm

The Widgetification of Apple

Type Constraints for LLM Output

Why Fast?

Technical Metrics to Track in Engineering Orgs

TypeScript Type System Hacks

Capital Intense AI Bets

Imputavi, Computavi, Putavi.

Llama 2 in the Browser

The Contrarian Strategy of OpenAI

GitHub Inbox as GTM

AI and Text-First Interfaces

The Business Source License

The Free Lunch of Model Distillation

Adding Constraints

Copy and Paste Libraries

Beating the Average

A Model API Gateway for 20+ LLMs

De Facto Ports

The Truth About GPU Utilization

Directly Responsible Individuals

Akin's Laws of Spacecraft Design

What is a Prompt Engineer

Learning Via Negativa

Why Tailwind CSS Won

My Everyday LLM Uses

Dark Mode and Marginal Benefit

Llama/Unix

Deterministic, Structured LLM Completions

Bayes' Rule

A Fine-Tuning Marketplace

Majority Rule Developers

NVIDIA's CUDA Monopoly

Prescriptive vs. Descriptive Feedback

Focus On What Could Go Right (Not Wrong)

Apple's Hidden Social Network

Data Versioning

Automatic and Universal Adversarial Prompts

Minimum Effective Dose and Discrete Outcomes

On Compounding

Git Merge Strategies and Algorithms

Modern Samizdat Libraries

Amara's Law

Black Swan Theory

Optimizing $Model.cpp

Commonplace Books

Virtual Machines and Path Dependence

On Getting Married

The RLHF Advantage

Robots.txt For LLMs

Why Did Meta Open-Source Llama 2?

Diffusion of Innovation

Literate Programming with LLMs

Schopenhauer's 36 Stratagems for Winning Arguments

Scale to Zero for AI Workloads

Stop Overengineering

Hard Launches vs. Soft Launches

Virtualized Linux on macOS Internals

Why Frameworks Fail

Categorization and Classification with LLMs

Towards Technical CEOs

ActivityPub, Decentralization - Do Users Care?

The Anti-AI Movement

Customer Obsession is the Only Winning Strategy

Avoiding Footguns

Where AI Fits in Engineering Organizations

Myths About ORMs

Google Search's Death by a Thousand Cuts

The Circular Startup Economy

Personal Lessons From LLMs

Overcoming LLM Hallucinations

Monopolizing Useless Resources

Dark Patterns

Platform Engineering vs. DevOps

Model Evaluation is (Still) An Art

Clear, Simple, and Wrong

What Comes After Kubernetes?

No Feature is Free (Especially AI Ones)

Mixture of Experts: Is GPT-4 Just Eight Smaller Models?

The LLaMA Ecosystem

Merge Queues

Unforced Errors

The Low-Background Steel of AI

Why Does Every AI Cycle Start With Chat?

A Token Efficient Language for LLMs

Keep The Main Thing The Main Thing

ControlNet QR Codes

It’s Too Early To Call Winners in AI

The Last Innings of Web 2.0

Biography of an Idea

The Myth of The AI Infrastructure Phase

LLMs for Software Portability

ChatGPT Plugins Don't Have PMF

Levels of Autonomy in AI Agents

The Problems with Tokenization in LLMs

What Diffusion Models Can Teach Us About LLMs

Sequence and Version Control Models

Practical Dependency Management for Developers

The End of Public APIs

Faster Horses: AI Products That Companies Think They Want

Templating Magic Isn't Free

AI Means More Developers

Two Years of Daily Blogging

Prompt Engineering is Configuration Engineering

SEO Inside AI

A List of Things I Was Wrong About

The ChatGPT Plugin Specification

The Local Development Story

Twitter and the Idea Maze

The New Tax on Engineering Against the Grain

Piecewise

On Regulating AI

On Device AI?

A List of Leaked System Prompts

Intercloud Brokers

React LLM: Easily Run Models in the Browser with Headless Components

Context-Free Grammar Parsing With LLMs

Taylor Swift and Launch Cadence

StackOverflow/ChatGPT

Self-hosted Compilers and Bootstrapped AI

Unix Philosophy for AI

Why Declarative Build Systems Aren’t Popular

The New AI Moats

On-Demand Package Manager

Second-level Thinking

ReLLM: Exact Structure for Large Language Model Completions

llm.ts

The Python Family of Languages

WebGPU and the New OS

Applications of Category Theory

Implementing LLMs in the Browser

React Components as the API

Why Multi-Cloud Failed

Principle of Least Astonishment

Probabilistic Data Structures and LLMs

16 Lessons from the Tanenbaum–Torvalds Debates

Tanenbaum–Torvalds Debates, Part 1

The ptrace syscall

Building A Kernel From Scratch

Autonomous LLM Agents Are At Least 10 Years Out

The Peanut Butter Manifesto (2006)

Sandbox Your Prompts

Stochastic/Deterministic

Jevons Paradox and LLMs

Everyday Fourier Transforms

On The Shortness Of Life

Synthetic Data From Compilers

Foundational Models Are Not Enough

Imperative, Declarative, Interrogative, and Exclamative Interfaces

But / Therefore

A List of 1 Billion+ Parameter LLMs

No GPUs Before Product Market Fit

The Path Dependence of YAML Templates

Buyers in the Foundational Model Stack

Internet Magic Numbers

A High-level LLMOps Architecture

Reasons To Do a Startup

The Automation Frontier

Why Open-Source a Model?

Consumption Pricing Units in AI

Reflections on 10,000 Hours of DevOps

The AI Partnership Race

A Hacker's Guide to LLM Optimization

Kubernetes as a Dev Tool

Code, not Chat, in Generative AI

Distributed Systems and AI

Necessary Conditions for an App Store Monopoly

Remix Culture

Are Incumbents Accruing All The AI Value?

Model Arbitrage

Modeling Context Length vs. Information Retrieval Cost in LLMs

Framework-Defined Infrastructure

The Missing Semester of CS

The Value of Software Generalists

Foundational Models Are Commodities

On OpenAI's Kubernetes Cluster

Choosing the Right Model

Gell-Mann Amnesia

On Prompt Injection

Local LLaMA on a Mac M1

Automatic1111 and AI Aggregators

All the -Ops

Chain of Thought Paradigms in LLMs

Data Local Machine Learning

On React.js

Applications, Services, and Workflows Overload

Will The AI Stack Be Open Or Closed?

The Stoic Arrow

Product Windows

ChatML and the ChatGPT API

The Bundling of the Software Developer

Shells and Distribution

More Art than Science

Wrong Sequences For Startups

Commoditization of Large Language Models: Part 3

Taxonomy of Startup Risk

The Connectors/Loaders Trap

Erdős Vocabulary

Why DSLs Fail

Amdahl's Law and Optimization

Why Python Won't Be the Language of LLMs

Nix Is Fighting The Last War

Interfaces for Uncertainty

Why ChatGPT Needs AuthZ

Ask Yourself Dumb Questions

Carrying a Dozen Problems

Blurring Language Boundaries

LLM Ops, Part 1

What Won't Change

Multi-Model vs. Multi-Cloud

Should OSS Projects Have Telemetry?

Reactive Strategy

Docker Without Docker

Applications of Gall's Law

Composable Models

Stack Shift

If It Doesn’t Ship, It Doesn’t Exist

Optimal Stopping Theory

Future is Not Evenly Distributed

Zero Interest Rate Policy Phenomena

The Frontend Bundling Cycle

Type-Safe API Calls: tRPC vs. gRPC

Overview of GPT-as-a-Backend

Eisenhower Priority Matrix

What Doesn't Fit in Git

Cloud Services Ranked: Build vs. Buy

Workflow Engine Paradigms

Prompt Engineering Shouldn't Exist

GPT Lineage

How to Version an API

Cheap UIs

Cloud Spend Breakdown

Rebuilding in Rust

All Models Are Wrong (But Some Are Useful)

The Problem with Drag-n-Drop Infrastructure

Output vs. Outcome

Garbage In, Garbage out?

Winning Ugly and The Metagame

Minix and nanoGPT

Fully Managed Infrastructure

Lessons from the Last AI Cycle

White Space

Fine-Tuning an OCR Model

Semi-Organic Growth

What's Next After NextJS

RPA: The Human Interface

The Implications of More Developers

Counterfactuals

A New ML Stack

Top Posts of 2022