01/AI / ML SOLUTIONS·BUILD. SHIP. OWN.

I ship production AI.And I've run the enterprise that lives with it.

Most opinions on AI have never shipped a model. Most executives who ran technology at scale stopped writing code years ago. I do both. 18+ years owning enterprise P&L, ERP modernizations, and multi-site cloud migrations, now building production AI: local LLMs, agentic orchestration, RAG, and the pipelines that feed real systems. That intersection is the whole point.

Start a build →See the Stampede build →

OPERATOR TELEMETRY / LIVE

ENTERPRISE IT

18YR+

VP / Director of Technology

AI SHIPPED

3MIT

Vision · Team-X · Agent-X

TEST COVERAGE

2,300+PASS

Team-X passing tests

PIPELINE LIVE

308SKU

Dual-model · Cloudflare

BUS / STATUS

LOCAL

RAG

AGENTIC

EVAL

GPU

SOVEREIGN

OWNED

LIVE

MASTER / PEAK

INFER

LOCAL

EVAL

PASS

VU · SYSTEM LOAD4-CH LIVE

LLM

%CAP

RAG

%CAP

EVAL

%CAP

SHIP

%CAP

02/CASE STUDY·STAMPEDE DISTRIBUTION

A two-model AI pipeline, shipped to production.

Stampede Distribution sells industrial safety gear. Their product data arrived raw from an ERP and from 26 different vendor sites, no two formatted alike, every description duplicated across the web. I built a Cloudflare-native enrichment pipeline that reads the source, extracts structured data, and rewrites every listing for search uniqueness, with hard rules against inventing a single compliance claim. It runs on owned infrastructure. No third-party AI bill, no data leaving the account.

PIPELINE / 2-STAGE ENRICHMENT

INERP + 26 vendor adaptersShopify · WooCommerce · BigCommerce

GEMMA 3

QWEN 2.5

DBD1 PIM + R2 assetssharp WebP · 1200w + 400w

OUTLive products pageFuse.js faceted search

Stage 1 · Extraction

Gemma 3 12B, with a Qwen 2.5 Coder 32B fallback. Reads up to 25KB of cleaned vendor HTML, emits structured JSON: title, specs, features, classified documents (MSDS, spec sheets, compliance PDFs), part numbers, ANSI standards. A quality gate refuses to write anything if the model did not return real content.

Stage 2 · SEO rewrite

The same dual model rewrites vendor-verbatim copy for uniqueness at temperature 0.1, while preserving every technical fact. Hard rule: never invent an ANSI, OSHA, EPA, or CE claim. On a parse failure it degrades to the vendor copy rather than guessing.

SKUs308Pilot catalog

Vendor adapters264 platform types

LLM calls / run6162 per SKU

AI vendor bill$0Owned Workers AI

What made it production, not a demo

Dual-model fallback with a documented swap history: dropped Llama 3.3 70B for timeouts and Llama 4 Scout for parse failures.
Idempotent re-runs guarded by an enriched_at timestamp. Safe to re-run, safe to force.
No-hallucinated-compliance rule enforced in the prompt. Regulatory claims are never fabricated.
Graceful degradation to vendor copy on failure, plus a full enrichment audit trail.

03/WHAT I BUILD·FOUR ENGAGEMENTS

Four ways to put AI to work.

01 / Integration

Production AI integration

RAG, agentic orchestration, and data-enrichment pipelines wired into systems you already run. Built to ship, measured with eval harnesses, not slideware.

Proof: Stampede pipeline + Team-X

02 / Sovereignty

Local-first, sovereign AI

Own the model and the data. On-prem or in your own cloud account, no third-party AI bill, no data leaving your perimeter, no vendor that can change the terms on you.

Proof: Vision Studio + Agent-X

03 / Strategy

AI strategy for operators

Where AI actually pays versus where it is theater. Honest scoping from someone who has owned the P&L and the SLA, plus the benchmark harnesses that keep a system honest after launch.

Proof: 18+ years operating at scale

04 / Cost

Cloud repatriation and cost

The math on what you actually spend renting compute, and a path back to owned infrastructure where it pays. Real break-even analysis, not a migration for its own sake.

Proof: The Cloud Repatriation Reckoning

04/RUN THE CODE·MIT · OPEN SOURCE

Don't take my word. Run it.

Three production AI systems, all MIT-licensed and public. Fork them, read them, run them locally.

Vision Studio

Python / PyTorch backend for local image and video generation, CUDA-accelerated. No API keys, no cloud bill.

Site →GitHub →

Team-X

Multi-agent LLM orchestration with RAG and MCP tool-calling, backed by 2,300+ passing tests. Run a company, not a prompt.

Site →GitHub →

Agent-X

A .NET, local-first RAG document-intelligence app that keeps your data on your machine.

Site →GitHub →

05/POINT OF VIEW·OWN, DON'T RENT

AI is infrastructure to own, not rent.

When you rent your AI from a closed provider, you rent the terms too. They decide what counts as a violation, they change the pricing, and your data lives in someone else's stack. The systems I build run on infrastructure you own. I write about why, with the numbers behind it.

POSITION PAPERS / POV

POV-012026.05.31Technology Trends · 10 MIN READYour AI Didn't Get Smarter. It Got Disciplined.The headlines say AI got smarter. From the inside, that is the wrong word: not raw intelligence, but precision, and the discipline to catch its own mistakes before you ever see them.POV-022026.05.24IT Strategy · 10 MIN READThe Cloud Repatriation Reckoning37signals cut roughly $2M a year, Dropbox saved $75M, and a16z pegged the trapped value at $100B. At scale the cloud unit economics invert, and the operators moving back to owned hardware are the signal everyone else is ignoring.POV-032026.05.17Technology Trends · 10 MIN READRun an AI Company. Don't Rent One.Microsoft cancelled Claude Code licenses, Anthropic killed volume discounts, Salesforce committed $300M to tokens. Sovereign, open-source, local-first AI is the only stack with a credible future.

06/FAQ·STRAIGHT ANSWERS

Can you build AI that runs on our own infrastructure?

Yes. That is the default. The Stampede pipeline runs entirely on the client's own Cloudflare account: their models, their database, their assets. On-prem and your-own-cloud deployments are both standard. No data leaves your perimeter and there is no per-call bill to a third-party AI vendor.

How do you keep a model from hallucinating in a regulated workflow?

Hard rules in the prompt plus a quality gate in code. In the Stampede build the model is forbidden from inventing any ANSI, OSHA, EPA, or CE claim, and the pipeline refuses to write a result that lacks real extracted content. On a parse failure it degrades to the verified source rather than guessing.

What does production AI actually cost versus a SaaS subscription?

It depends on volume, but owned inference removes the per-seat and per-call meter entirely. The Stampede pilot enriched 308 products across 616 model calls at no third-party AI cost. For steady workloads, owning the compute usually beats renting it past a break-even point I will calculate for your case.

Do you only do greenfield AI, or integration with existing systems?

Mostly integration. The hard part of enterprise AI is rarely the model, it is the data and the systems around it. I have spent 18+ years inside ERPs, migrations, and infrastructure, so the work plugs into what you already run instead of replacing it.

Who actually does the work?

I do. The person who scopes the engagement is the person who writes the code and ships it. No handoff to junior staff, no account managers in between.

06/ENGAGE

Let's build somethingyou own.

No theater. No rented stack you cannot inspect. A direct conversation about what to build, what it costs, and how to ship it.

Start a build Email directly