Tag Archives: AI benchmark

An open-weights Chinese model just beat Claude, GPT-5.5, and Gemini in a programming challenge

A colourful 7x7 Word Gem Puzzle board with KIMI highlighted in green and CLAUDE in purple

By Rohana Rezel I’m running the ongoing AI Coding Contest where I pit major language models against each other in real-time programming tasks with objective scoring. Day 12 was the Word Gem Puzzle. Ten models entered. The results were not

Discuss on boreal.social