Ai2 launches Tulu3-405B AI model claiming top performance over DeepSeek and OpenAI

techcrunch.com — January 30, 2025 at 03:01 PM UTC

Ai2, a Seattle-based nonprofit, has launched a new AI model called Tulu3-405B, claiming it outperforms DeepSeek V3 and OpenAI's GPT-4o on various benchmarks. This model is open source, allowing anyone to replicate it. Tulu3-405B features 405 billion parameters and was trained using 256 GPUs. It utilizes a technique called reinforcement learning with verifiable rewards to enhance its performance on tasks like math problem solving. The model excelled in tests like PopQA and GSM8K, surpassing competitors including Meta’s Llama 3.1. Tulu3-405B is available for testing through Ai2’s chatbot and on platforms like GitHub and Hugging Face.

With a significance score of 4.7, this news ranks in the top 2.5% of today's 30325 analyzed articles.

Get summaries of news with significance over 5.5 (usually ~10 stories per week). Read by 10,000+ subscribers: