CONSENT CONSOLE/MK-V
DEFAULT

Telemetry consent. Operator-grade.

We capture only the signals we need to keep the site running, understand which content earns reads, and credit referral partners. You decide what stays on. Default is strict opt-in.

Privacy Policy →Terms →
JURISDICTIONOutside regulated jurisdictionsFRAMEWORKNo regional opt-in framework applied

COMPLIANCE FRAMEWORKS RECOGNIZED

GDPREU / EEA
CCPACalifornia
LGPDBrazil
PIPEDACanada
ePrivacyEU Directive
Strategia-X
L
-6dB
C
-1dB
R
-3dB
01/AI / ML SOLUTIONS·BUILD. SHIP. OWN.

I ship production AI.And I've run the enterprise that lives with it.

Most opinions on AI have never shipped a model. Most executives who ran technology at scale stopped writing code years ago. I do both. 18+ years owning enterprise P&L, ERP modernizations, and multi-site cloud migrations, now building production AI: local LLMs, agentic orchestration, RAG, and the pipelines that feed real systems. That intersection is the whole point.

Enterprise IT18YRS+VP / Director of Technology
AI systems shipped3MITVision Studio · Team-X · Agent-X
Test coverage2,300+Team-X passing tests
Pipeline live308SKUDual-model · Cloudflare
02 / CASE STUDY

A two-model AI pipeline, shipped to production.

Stampede Distribution sells industrial safety gear. Their product data arrived raw from an ERP and from 26 different vendor sites, no two formatted alike, every description duplicated across the web. I built a Cloudflare-native enrichment pipeline that reads the source, extracts structured data, and rewrites every listing for search uniqueness, with hard rules against inventing a single compliance claim. It runs on owned infrastructure. No third-party AI bill, no data leaving the account.

PIPELINE / 2-STAGE ENRICHMENT
INERP + 26 vendor adaptersShopify · WooCommerce · BigCommerce
01
GEMMA 3
02
QWEN 2.5
DBD1 PIM + R2 assetssharp WebP · 1200w + 400w
OUTLive products pageFuse.js faceted search

Stage 1 · Extraction

Gemma 3 12B, with a Qwen 2.5 Coder 32B fallback. Reads up to 25KB of cleaned vendor HTML, emits structured JSON: title, specs, features, classified documents (MSDS, spec sheets, compliance PDFs), part numbers, ANSI standards. A quality gate refuses to write anything if the model did not return real content.

Stage 2 · SEO rewrite

The same dual model rewrites vendor-verbatim copy for uniqueness at temperature 0.1, while preserving every technical fact. Hard rule: never invent an ANSI, OSHA, EPA, or CE claim. On a parse failure it degrades to the vendor copy rather than guessing.

SKUs308Pilot catalog
Vendor adapters264 platform types
LLM calls / run6162 per SKU
AI vendor bill$0Owned Workers AI
What made it production, not a demo
  • Dual-model fallback with a documented swap history: dropped Llama 3.3 70B for timeouts and Llama 4 Scout for parse failures.
  • Idempotent re-runs guarded by an enriched_at timestamp. Safe to re-run, safe to force.
  • No-hallucinated-compliance rule enforced in the prompt. Regulatory claims are never fabricated.
  • Graceful degradation to vendor copy on failure, plus a full enrichment audit trail.
03 / WHAT I BUILD

Four ways to put AI to work.

01 / Integration

Production AI integration

RAG, agentic orchestration, and data-enrichment pipelines wired into systems you already run. Built to ship, measured with eval harnesses, not slideware.

Proof: Stampede pipeline + Team-X
02 / Sovereignty

Local-first, sovereign AI

Own the model and the data. On-prem or in your own cloud account, no third-party AI bill, no data leaving your perimeter, no vendor that can change the terms on you.

Proof: Vision Studio + Agent-X
03 / Strategy

AI strategy for operators

Where AI actually pays versus where it is theater. Honest scoping from someone who has owned the P&L and the SLA, plus the benchmark harnesses that keep a system honest after launch.

Proof: 18+ years operating at scale
04 / Cost

Cloud repatriation and cost

The math on what you actually spend renting compute, and a path back to owned infrastructure where it pays. Real break-even analysis, not a migration for its own sake.

Proof: The Cloud Repatriation Reckoning
04 / RUN THE CODE

Don't take my word. Run it.

Three production AI systems, all MIT-licensed and public. Fork them, read them, run them locally.

Vision Studio

Python / PyTorch backend for local image and video generation, CUDA-accelerated. No API keys, no cloud bill.

Team-X

Multi-agent LLM orchestration with RAG and MCP tool-calling, backed by 2,300+ passing tests. Run a company, not a prompt.

Agent-X

A .NET, local-first RAG document-intelligence app that keeps your data on your machine.

05 / POINT OF VIEW

AI is infrastructure to own, not rent.

When you rent your AI from a closed provider, you rent the terms too. They decide what counts as a violation, they change the pricing, and your data lives in someone else's stack. The systems I build run on infrastructure you own. I write about why, with the numbers behind it.

06 / FAQ

Can you build AI that runs on our own infrastructure?

Yes. That is the default. The Stampede pipeline runs entirely on the client's own Cloudflare account: their models, their database, their assets. On-prem and your-own-cloud deployments are both standard. No data leaves your perimeter and there is no per-call bill to a third-party AI vendor.

How do you keep a model from hallucinating in a regulated workflow?

Hard rules in the prompt plus a quality gate in code. In the Stampede build the model is forbidden from inventing any ANSI, OSHA, EPA, or CE claim, and the pipeline refuses to write a result that lacks real extracted content. On a parse failure it degrades to the verified source rather than guessing.

What does production AI actually cost versus a SaaS subscription?

It depends on volume, but owned inference removes the per-seat and per-call meter entirely. The Stampede pilot enriched 308 products across 616 model calls at no third-party AI cost. For steady workloads, owning the compute usually beats renting it past a break-even point I will calculate for your case.

Do you only do greenfield AI, or integration with existing systems?

Mostly integration. The hard part of enterprise AI is rarely the model, it is the data and the systems around it. I have spent 18+ years inside ERPs, migrations, and infrastructure, so the work plugs into what you already run instead of replacing it.

Who actually does the work?

I do. The person who scopes the engagement is the person who writes the code and ships it. No handoff to junior staff, no account managers in between.

06/ENGAGE

Let's build somethingyou own.

No theater. No rented stack you cannot inspect. A direct conversation about what to build, what it costs, and how to ship it.