Principal AI Engineer · Enterprise Advisory · 30 Years

Strategy consulting meets
hands-on AI engineering.

I help enterprise organizations navigate complex technology problems — from cloud migration strategy and AI governance to full-stack agentic systems — and I don't stop at the recommendation. I build the thing, deploy it, and operate it in production.

Thirty years of infrastructure experience spans on-premises datacenters, virtualization, networking, storage, enterprise software, and cloud platforms — long before AI entered the picture. That foundation is what makes the AI work grounded: every architecture decision is evaluated against operational reality, not just capability demos.

30 years of infrastructure — now applied to AI

The 30 years isn't a marketing number — it covers the full arc of enterprise infrastructure: on-premises datacenters, storage area networks, virtualization with VMware and Hyper-V, Windows and Linux server environments, enterprise networking, and the first wave of cloud migration starting in the mid-2010s. AI and cloud-native architectures are the most recent chapter, not the whole story.

Most of that career has been in enterprise technology consulting — helping large organizations answer the hardest infrastructure question: what do we actually do with what we have? Cloud migration strategy, application rationalization, vendor selection, build-vs-buy analysis, and the organizational change that makes any of it stick. That consulting background is the lens through which I approach AI: not "what's the most technically interesting system" but "what solves the actual problem, at what cost, with what governance, and who runs it after I leave."

Since 2024 I've gone deep on applied AI — as a builder, not an observer. The Park Whisperer is my personal production AI platform: real-time theme park intelligence running 24/7 across Azure, GCP, and AWS, processing satellite weather data, training ML models, running a multi-agent knowledge RAG, and publishing daily AI-generated content to three social platforms — all operated solo, under $100/month. It's the environment where I stress-test every architectural pattern before recommending it to clients.

How I think about AI systems

⚙️
Production first, prototype second
I design for deployment from day one — not for demos. Every architecture decision is evaluated against operational questions: Who runs it when it breaks? How is it monitored? What does it cost at steady state? What happens when a model call fails?
📊
Quantified decisions
I don't migrate to a new technology without first measuring the cost of the current one. The Data Pipeline Evolution case study documents exactly how I concluded Cosmos DB was adding $394/month with zero benefit — and what the migration back to PostgreSQL cost. Numbers drive decisions.
🔗
Full-stack, not just the model
AI systems are icebergs. The model or agent is the visible 10%. The other 90% is the data pipeline feeding it, the infrastructure running it, the monitoring catching its failures, and the output schema that downstream consumers depend on. I build the whole iceberg.
🛡️
Governance as architecture
For enterprise AI — especially in regulated industries — governance isn't a checkbox after the fact. It's a constraint that shapes the architecture: provenance-by-construction, auditable reasoning chains, human approval gates at the right decision points. I've shipped systems that a compliance officer can review.
💰
Cost discipline
Token economics is an architecture decision, not a runtime concern. I design prompts, context windows, caching layers, and model selection to reflect an explicit cost-vs-value function: what capability does this token spend actually buy, and is that capability worth the margin it consumes? The same discipline applies to infrastructure — every cloud resource has a utilization curve and a retirement threshold.
🔄
Iterate to the right answer
Requirements documents don't survive contact with real users. I build tight feedback loops directly with the business stakeholders who will operate the system — not just review it. Each iteration is evaluated against what they actually need to decide, not what the spec said. The operational matrix gets finalized in production, with real data, with the people whose workflow it changes.

What I build

Agentic AI Systems
Multi-agent · Tool-use · RAG
Multi-agent pipelines with tool libraries, retrieval-augmented generation over structured and unstructured data, SLM-based intent routing, and multi-model collaboration patterns (cheap model for synthesis, capable model for output). I've built agentic systems on AWS Bedrock, Azure AI Foundry, Azure Functions, and n8n.
Cloud migration intelligence Park Agent Chat Seller Intelligence Knowledge RAG
ML Pipelines & Forecasting
scikit-learn · Prophet · LightGBM · BigQuery
End-to-end ML systems: feature engineering, training pipelines on BigQuery data warehouses, model serialization to GCS, and inference services that serve predictions every 20 minutes. Time-series forecasting for operational systems where latency and cost matter more than marginal accuracy gains.
Weather ML (6 models, 44K obs) Ride Whisperer forecasting Venue impact prediction
AI-Generated IaC & Deployment
Kiro · Bicep · CloudFormation · Terraform
Using AI-powered development tools (Kiro, GitHub Copilot) to generate production-grade infrastructure from specification — Bicep modules, CloudFormation stacks, Cloud Run job definitions, Container App configurations. I've deployed the same application logic to AWS and Azure via AI-generated IaC with zero forked business code.
EARE AWS + Azure Advisory AI AWS + Azure Gov-Process SaaS
Data Engineering & ETL
PostgreSQL · BigQuery · Azure Cosmos DB · pgvector
Real-time data pipelines that run on tight cadences (5–10 minutes), upsert semantics with deduplication, event-driven processing, and warehouse-scale historical data for ML training. Experience across three cloud data platforms and a full cycle of architecture evolution — including the hard lesson of when not to use a NoSQL document store.
Park Data Ingest ETL GOES-18 Lightning Ingestion METAR/Radiosonde pipeline Data Pipeline Evolution
AI Content Operations
Automated content factory · Video · Multi-platform
Scheduled AI pipelines that produce and publish social media content, blog posts, and short-form video with no human touch. ElevenLabs TTS with per-character timestamp alignment, FFmpeg video encoding, word-sync caption generation, and direct API publishing to Instagram, TikTok, and YouTube Shorts.
SLM Content Pipeline AI Video Production Team Multi-Model Content
AI Governance & Compliance
ISO-42001 · NIST AI RMF · SDLC gates
Enterprise AI governance platforms that scan codebases, generate compliance artifacts across 10 regulatory frameworks, and enforce gate-based approval workflows. For clients in regulated industries (healthcare, financial services) where AI deployments require documented provenance and auditable decision chains.
Advisory AI Governance Platform Provenance-by-construction AI Gov-Process SaaS

Stack breadth across the work

Cloud Platforms
Microsoft AzureFunctions · Container Apps · AI Foundry · Cosmos DB
Google Cloud (GCP)BigQuery · Cloud Run · GCS · Cloud Scheduler
Amazon Web ServicesBedrock · Lambda · DynamoDB · CloudFormation
AI / ML
Claude (Anthropic)Sonnet / Haiku · via Bedrock + Azure
OpenAI / GPT-4oEmbeddings · Assistants API · Azure OpenAI
Phi-3 / Phi-4 MiniSLM intent routing · fine-tuning
scikit-learnRandomForest · GradientBoosting · 6 models
LightGBM / ProphetTime-series forecasting
LangChainAgentic loops · tool orchestration
ElevenLabsTTS · timestamp-aligned audio
Data & Storage
PostgreSQL + pgvectorHNSW index · hybrid BM25+vector search
Google BigQueryData warehouse · ML training · ST_DISTANCE
Azure Cosmos DBNoSQL · vector search · Gremlin graph
RedisCaching · pub/sub · session state
Azure Blob / GCSML models · JSON API · video assets
Infrastructure & Deployment
Bicep / ARMAzure IaC — AI-generated via Kiro
CloudFormationAWS IaC — AI-generated via Kiro
TerraformGCP infrastructure · BigQuery · Cloud Run
Docker / Container AppsContainerized jobs · multi-stage builds
GitHub ActionsCI/CD · container build/push · deploy
n8n / CaddyWorkflow orchestration · reverse proxy
Languages & Frameworks
PythonPrimary — ML · pipelines · agents · ETL
TypeScript / Node.jsGovernance platform · SaaS API · Fastify
SQL (BigQuery + PG)Complex views · window functions · spatial
FFmpeg / PillowVideo encoding · image compositing
FastAPICustom microservices · agent backends
Data Sources & APIs
NOAA GOES-18 GLMSatellite lightning · NetCDF · L2 Flash
FAA METAR / aviationweather.govSurface weather obs · 17 stations
NWS / SPC APIsActive alerts · categorical outlook
UWyo Radiosonde ArchiveUpper-air soundings · CAPE/LI/K-Index
ThemeParks.wiki API286 WDW entities · live wait data

Scale across the body of work

24+
Case studies documented
3
Cloud platforms deployed to
44K+
ML training observations
1.6M+
RAG knowledge records
286
Park entities tracked
1,053
Servers analyzed (migration)
Token economics as a design constraint

Before a model is called, I've already decided: what context is strictly necessary, what can be cached or pre-computed, which model tier matches the actual reasoning demand, and what value the output delivers per dollar spent. That trade-off is explicit in the architecture — not discovered after the bill arrives.

Model selection — match reasoning tier to task complexityClaude Haiku vs. Sonnet vs. Opus is a cost decision
Context window discipline — minimal sufficient context, not maximum availableSmaller prompts = lower latency + lower cost
Caching & pre-computation — static reasoning done once, not per-requestRAG + Cosmos reduces repeated inference
Infrastructure right-sizing — measure utilization before scaling, retire before it accumulates$394/mo Cosmos → $0 after architecture review

What an engagement looks like