I’ve been running a competition where frontier AI models write Python code, connect to a TCP server, and solve algorithmic programming challenges head-to-head in real time. Each model reads the spec once and generates a bot. That bot then connects
By Rohana Rezel I’m running the ongoing AI Coding Contest where I pit major language models against each other in real-time programming tasks with objective scoring. Day 12 was the Word Gem Puzzle. Ten models entered. The results were not
By Rohana Rezel The pitch is seductive. Point an AI agent at your infrastructure, give it a task, walk away. No tickets, no on-call rotations, no waiting for an engineer to get around to it. The agent reads the codebase,