ML Platform Architect & Technical Lead · Cape Town, available globally
I define the infrastructure and delivery strategy for production ML systems — the layer that determines whether enterprise AI creates commercial value or stays in notebooks.
First-year ROI from platform optimisation I led.
Models trained daily, up from ~1 per week, via Prediction Factory I conceptualised.
Reduction in P1/P2 incidents through MLOps standardisation.
Faster experiment turnaround through champion model architectures.
Third-party perspective
"Andrea operates with the strategic mindset of a Lead Architect. He possesses the rare ability to bridge the gap between 'Scientific Rigour' and 'Operational Speed'."
"Andrea is a standout technical leader within our practice. His work consistently sets the standard for technical excellence, and he operates as a force multiplier; his presence raises the technical bar for the entire team."
"Andrea did not simply teach me technical skills; he taught me how to think like an architect. Because of his mentorship, I was promoted to AI Adoption Lead — a transition that typically takes two to four years, achieved in just 12 months."
What I do
I focus on the parts of AI delivery that determine whether a system is actually useful after launch: trustworthy retrieval, observability, evaluation, release discipline, clear interfaces, and privacy-conscious architecture.
That means GenAI and RAG systems, MLOps and AI platform engineering, and delivery practices that survive contact with real users, real constraints, and real governance requirements.
Retrieval pipelines, grounded responses, tool orchestration, evaluation loops, and secure prompt workflows for production use.
Automated training, deployment, monitoring, reproducibility, and scalable delivery patterns for model-heavy organisations.
Practical, security-aware cloud systems across container platforms, CI/CD, infrastructure-as-code, and controlled data boundaries.
Systems tied to throughput gains, reduced manual work, better governance, and commercial outcomes rather than novelty theatre.
Career history, domain background, delivery style, and why production-grade AI is my lane.
View profileCase studies spanning GenAI, RAG, model monitoring, cloud architecture, and AI platform engineering.
Explore projectsWriting on MLOps, production AI systems, applied mathematics, ethics, and engineering judgement.
Read writingNeed the concise version? Download my CV or get in touch directly about consulting or roles.
Featured work
These cover AI platform engineering, model lifecycle automation, retrieval-led experiences, and applied systems thinking.
Led architecture and delivery of a production-grade RAG chatbot for John Lewis Partnership's internal workforce — from first line of code to 90% adoption in one week. Drummie answers HR, …
Python, Google Cloud Run, Gemini API, RAG, LangChain, Vector Store, Google Chat API, GCP
Read case studyDefined and delivered JUMO's internal ML platform from first principles — a config-driven orchestration layer that scaled model training from ~1 model per week to 50+ models per day, automated …
Python, Kubernetes, FastAPI, MLflow, Airflow, AWS S3, DynamoDB, EMR on EKS, Great Expectations, Datadog, PagerDuty, Docker
Read case studyDesigned and built a real-time model monitoring system at JUMO that reduced data anomaly detection time from weeks to minutes. The system automated batch scoring with Airflow-orchestrated Kubernetes jobs, computed …
Python, Airflow, Kubernetes, Great Expectations, Datadog, PagerDuty, AWS EKS, S3
Read case studyYou can review my profile, browse technical writing, inspect project work, or connect via LinkedIn, GitHub, and Medium.