Insights

Thinking in Public

Perspectives on fractional leadership, AI infrastructure, and operational execution from operators who build.

01May 29, 20266 min read

vLLM Adds Native HIP W4A16 Kernel for AMD ROCm, Boosting Local LLM Inference

The recent merge of a native HIP W4A16 kernel into vLLM marks a tangible step forward for AMD‑based AI workloads, delivering measurable throughput gains that narrow the performance gap with proprietary alternatives. This contribution, high…

local inferenceopen modelsself-hostingAI hardwareoff the thumb

02May 28, 20265 min read

Five Frontier LLMs Disagree on Two‑Thirds of Real‑World Fact‑Check Claims

A recent study evaluating five leading frontier language models on a set of 1,000 real‑world fact‑check statements found that the models disagree on 67 % of the claims [[3]](https://lenz.io/research/llm-disagreement). The disagreement rate…

local inferenceopen modelsself-hostingAI hardwareoff the thumb

03May 27, 20265 min read

260K-parameter LLM Runs on an Emulated 90s CPU Inside an 18‑Year‑Old RTOS

A recent experiment shows that a language model with only 260 000 parameters can generate text inside a JavaScript emulator of a Freescale ColdFire MCF5307 CPU, all while operating under an RTOS that was written in 2008【12】. The achievemen…

local inferenceopen modelsself-hostingAI hardwareoff the thumb

04May 26, 20263 min read

Going Off the Thumb: Why Local Inference and Deterministic Tools Beat Cloud AI

The recent exposure of Microsoft Copilot Cowork’s ability to exfiltrate files through uncontrolled email agents shows how cloud‑hosted AI can become a liability rather than an asset【1】. When an agent can send messages to a user’s own inbox…

local inferenceopen modelsself-hostingAI hardwareoff the thumb

05April 14, 20269 min read

On-Premise LLM Deployments for Small Businesses

Every week, founders paste sensitive client data into cloud AI tools without thinking about where it goes. Local AI deployment has crossed a practical threshold, and the window to build this infrastructure proactively is open now.

AI InfrastructureData PrivacyOperational StrategySmall Business TechLLM Deployment

06April 13, 20268 min read

Fractional COO Engagements: How to Structure Them

Most founders buy fractional COO engagements like consulting retainers and get consulting retainer results. Here is what the engagement actually is, what conditions make it work, and what failure modes to name before you sign anything.

fractional leadershipoperationsgrowth-stageCOOorganizational design

07April 13, 202611 min read

Open Source GTM: What Happens After GitHub Virality

A repo going viral on GitHub is not a success event. It is a stress test of infrastructure most founders have not built yet. Here is what the operational reality of open source GTM looks like in 2025.

open sourcego-to-marketdeveloper toolsstartup strategyAIGitHubthinking in public

08April 12, 20262 min read

Fractional Leadership in the Age of AI

The economics of full-time C-suite hiring no longer make sense for most growth-stage companies. AI tooling is accelerating this shift.

fractional-leadershipaiadvisory