I put Moonshot AI's new K2 Thinking model to the test on the Shuttle codebase to see if the hype around its reasoning capabilities holds up in practice