Frank Thomas

Frank ThomasNotes on building software, running engineering teams, and the parts that overlap.https://frankthomas.dev/Rate Limits, Retries, and the Hidden Accuracy Killer in LLM Pipelineshttps://frankthomas.dev/blog/rate-limits-hidden-accuracy-killer/https://frankthomas.dev/blog/rate-limits-hidden-accuracy-killer/We spent weeks investigating a 6% accuracy variance in our document extraction benchmarks. The root cause wasn't the model, the prompts, or the routing.Sat, 16 May 2026 00:00:00 GMTWhy Heuristic Routing Fails on Long Documentshttps://frankthomas.dev/blog/why-heuristic-routing-fails-long-documents/https://frankthomas.dev/blog/why-heuristic-routing-fails-long-documents/When a 120-page insurance policy goes through document extraction, the AI only sees fragments. How those fragments are selected determines everything.Thu, 14 May 2026 00:00:00 GMTBenchmarking Document Extraction: How We Measure Accuracy Across 653 Documentshttps://frankthomas.dev/blog/benchmarking-document-extraction/https://frankthomas.dev/blog/benchmarking-document-extraction/Every document extraction vendor claims 95%+ accuracy. None of them publish how they measure it.Sun, 10 May 2026 00:00:00 GMTSchema-Driven Extraction: Configuration Over Code for Document AIhttps://frankthomas.dev/blog/schema-driven-extraction/https://frankthomas.dev/blog/schema-driven-extraction/Most document extraction relies on prompt engineering. Schema-driven extraction replaces hope with a contract.Wed, 06 May 2026 00:00:00 GMTDocuments are a data source. Treat them like one.https://frankthomas.dev/blog/documents-as-a-data-source/https://frankthomas.dev/blog/documents-as-a-data-source/Most extraction projects fail because teams treat the document as the artifact instead of the schema.Wed, 22 Apr 2026 00:00:00 GMTDocs-first development with AI agentshttps://frankthomas.dev/blog/docs-first-development-with-ai-agents/https://frankthomas.dev/blog/docs-first-development-with-ai-agents/Most teams using AI agents do code-first, doc-later. We do the opposite, and it's working better than expected.Sat, 18 Apr 2026 00:00:00 GMTWe built for flexibility. It caused chaos.https://frankthomas.dev/blog/built-for-flexibility-caused-chaos/https://frankthomas.dev/blog/built-for-flexibility-caused-chaos/Notes from a v1 to v2 rewrite — what happens when you build a system that can do anything and nobody can explain what it does.Wed, 08 Apr 2026 00:00:00 GMTSmall teams and the monolithhttps://frankthomas.dev/blog/small-teams-and-the-monolith/https://frankthomas.dev/blog/small-teams-and-the-monolith/An apology to my younger self for splitting an eight-person team across nine services.Thu, 12 Mar 2026 00:00:00 GMTThe document extraction pipeline: we tried everythinghttps://frankthomas.dev/blog/the-document-extraction-pipeline-we-tried-everything/https://frankthomas.dev/blog/the-document-extraction-pipeline-we-tried-everything/AWS Textract, Google Document AI, RAG pipelines, LlamaIndex — a tour of everything we used before deciding to build our own.Fri, 20 Feb 2026 00:00:00 GMTWriting more in 2026https://frankthomas.dev/blog/writing-more-in-2026/https://frankthomas.dev/blog/writing-more-in-2026/New year, new excuse. I'm going to try journaling in public about the work.Tue, 06 Jan 2026 00:00:00 GMT