“Qwen 2.5-Max outperforms ... almost across the board GPT-4o, DeepSeek-V3 and Llama-3.1-405B,” Alibaba's Cloud Division said in a statement posted on the company's official WeChat account.
DeepSeek's success also sparked competition among its local rivals, who began to improve their AI models. Two days after DeepSeek-R1 was released, TikTok's owner, ByteDance, updated its flagship AI model, claiming that it outperformed OpenAI's o1 in AIME, a test that measures the ability of AI models to understand and respond to complex instructions.