Cloud & AI Architect
Building with AI,
writing about what works.
15+ years architecting cloud and AI systems — from Doctolib's infrastructure to award-winning AI products. Here I benchmark local LLMs, build agents, and document the techniques that actually hold up.
Currently
AI & DevOps Architect
Stealth startup
Previously
Cloud Architect
Doctolib
Recognition
3rd / 9,500+
MS Azure AI Hackathon
01 — Latest writing
All posts →
Gemma 4 26B Won't Fit on My 24GB MacBook — Until I Did This
Ollama gives 2 tok/s with broken tool calling. I got 49 tok/s with perfect tool calling using Unsloth Q3_K_XL + llama.cpp. Then I built a Claude Code clone on top of it.
I Tested Every Gemma 4 Model Locally on My MacBook — What Actually Works
Audio ASR in 3 languages, image understanding, full-stack app generation, coding, and agentic behavior -- all on a MacBook M4 Pro 24GB.
The Ralph Wiggum Technique: Autonomous AI Development with Claude Code
Learn how to use the Ralph Wiggum technique for autonomous AI-powered coding. Install the Ralph plugin for Claude Code and let your AI write code while you sleep.
02 — Recognition
3rd place among 9,500+ developers — Microsoft Azure AI Hackathon
Foodi Copilot, an AI nutrition assistant built on Azure AI and Cosmos DB, placed in the top 0.2% of one of Devpost's most popular hackathons and was featured on the official Microsoft Developer Blog.
FEATURED INTERVIEW — MICROSOFT HACKATHON WINNERS
03 — Selected projects
All projects →Foodi Copilot
AI-powered nutrition assistant. 3rd place out of 9,500+ developers at the Microsoft Azure AI Hackathon — featured on the official Microsoft Developer Blog.
Browser Agent
Chrome extension agent using RAG to automate multi-step browsing workflows autonomously.
iOS Intelligent Keyboard
Custom Apple-Intelligence-style iOS keyboard with neural predictions and context-aware suggestions.