Fractional AI CTO / Chief AI Architect

I'm the fractional AI CTO who still writes the code.

I build the harnesses, code-intelligence graphs, voice pipelines, and hybrid retrieval that turn AI from a demo into a system you can run in production.

30+ Startups founded
35+ Years shipping code
4 Production AI systems below
See the code
Writing / New essay
The Manager as Compiler

The Manager as Compiler

Running a mixed human–AI team without drowning in slop

Six months into serious AI adoption, output is up and the manager is more tired than ever. The problem is control systems, not tooling: cheap, unverified output is flooding a review pipeline that was never designed to filter it.

Read the article

Code harnesses & agentic coding

Steve builds the harnesses that turn AI coding agents from impressive demos into systems you can trust in a real codebase — quality gates, sandboxes, and orchestration he writes himself.

An edit-compile-run-debug loop traces across a live code graph: a node changes, the build fans out, tests fire, a failure lights up, and the agent patches and re-runs — the inner loop of agentic coding, made legible.
See how it works

AI Cybersecurity: Code Property Graphs

Steve builds code property graphs that let AI reason about software the way a senior engineer does — following control, data, and call flow across a whole codebase instead of one file at a time.

A code property graph assembles node by node — functions, variables, and calls — then a taint path lights up and threads from an untrusted source through the graph to a sensitive sink, the way a CPG exposes an attack path.
See how it works

Voice AI

Steve builds low-latency voice AI end to end — streaming audio in, transcription and understanding out — fast enough to feel like conversation rather than a form you talk at.

A live sales call, annotated in real time: the pipeline transcribes the client–salesperson conversation and categorizes each exchange against four different sales strategies as the call unfolds. The annotator also lets the user mark key parts of the call by hand — those annotations feed back in to train the AI.
See how it works

Hybrid search at scale

Steve builds hybrid retrieval that fuses semantic vector search with structured filtering, so AI answers stay both relevant and precise on large, messy document sets — not one or the other.

Standard · single model
Advanced · open-weights SOTA frontier
One query forks two ways: a vector path drifts through an embedding cloud to nearest neighbors while a structured-filter path snaps a metadata grid down to qualifying rows; the two streams converge and rerank into a single ordered result.
See how it works

What else
I've shipped

Thirty-five years of building under real consequences — banks, AWS, an automated hedge fund, pioneering DevOps in 2006. The proof, not the pitch.

AI Scrummaster An agent that runs the connective tissue of delivery so engineering teams spend their time writing code, not status updates. GCoder A graph-native coding agent that models code, runtime, infra, and security as one queryable SystemGraph — localized subgraph retrieval over flat whole-repo context. HyperCoder An autonomous AI coding agent that plans, edits, runs, and self-corrects across a real project rather than answering one prompt at a time. IronBox A hardened sandbox that lets AI agents run, compile, and test untrusted code without putting the host or the codebase at risk. Heimdall Security Suite A security suite that watches a system end to end, surfacing the signal that matters instead of another wall of alerts. Cartographer Renders multi-level-of-detail maps of cloud and code structure straight from source and infrastructure, so large systems stay legible. McWiki Turns a repository into a navigable, queryable knowledge graph an agent can actually traverse instead of grepping blind. ShipGen An AI ship designer that turns intent into shippable, tested output instead of a draft someone still has to finish. Task Runner A workflow engine that sequences and runs multi-step jobs reliably, with the orchestration and retries handled in code, not by hand. AI Economics Advisor An AI advisor that reasons over economic data to turn raw numbers into decisions a non-specialist can act on. Crypto Arbitrage Chainer Detects and chains crypto arbitrage opportunities across markets, reasoning about multi-hop paths in real time. SLAM-Based Crypto Trading A particle-filter crypto trading system making sub-millisecond decisions, accelerated on Apple Silicon. Azara.ai Platform Founded and launched an agentic AI workflow platform from scratch in 15 months as Founder & CTO, hiring and leading the team to launch. 10XLabs Container Platform Invented one of the earliest container platforms and service meshes (2011-2012), with sub-second provisioning years before Docker went mainstream. DevOps Foundation Principles Early pioneer in founding DevOps: wrote a DevOps vision paper in 2006, before the term existed — practices now used across the industry.
Apollo AI hedge fund Built a fully automated hedge fund on neural networks and genetic algorithms in the 1990s — algorithmic trading decades before it was normal.
Program trading at UBS Led development and architecture of equities program-trading systems where latency and correctness were the whole job.
DevOps at Morgan Stanley Early pioneer in founding DevOps: wrote a DevOps vision paper in 2006, before the term existed — practices now used across the industry.
Principal Cloud Architect, AWS Principal Cloud Architect for ASEAN, leading DevOps and microservices delivery for major regional banks and enterprises.
Global Chief Architect, banking Global Chief Architect roles building world-class architecture functions and global standardization programs across international banking.

Still shipping the code.

I write code, and I have for thirty-five years — usually in places where being wrong was expensive. I started doing AI for money in the 1990s, building a fully automated hedge fund on neural networks and genetic algorithms, and I have been close to the hard edge of trading, banking, and infrastructure ever since. Along the way I was an early pioneer in founding DevOps — writing a DevOps vision paper at Morgan Stanley in 2006 — invented one of the earliest container platforms and service meshes, served as a Principal Cloud Architect at AWS, and held Global Chief Architect roles in international banking. I have founded more than thirty startups, including Azara, an agentic AI workflow platform I built and launched as Founder and CTO. Today I build AI directly: the code harnesses that make agentic coding trustworthy, the code-intelligence graphs that let machines reason about software, voice pipelines, and hybrid retrieval at scale. I am the person you hire when AI has to actually work — as your fractional AI CTO or Chief AI Architect, and as someone who still ships the code.

Hire an AI leader who still ships the code.

Whether you need a fractional AI CTO to own the roadmap or a Chief AI Architect who writes the hard parts himself, the question is the same: can this person build AI you can trust in production? The harnesses, graphs, voice pipelines, and retrieval systems above are the answer. Let's talk about what you're building and where it has to hold up.