DeepSeek, a Chinese AI firm, has launched DeepSeek V3, an advanced open-source AI model. This model boasts 671 billion parameters and outperforms both existing open-source models and closed models like OpenAI's GPT-4o across various benchmarks. Utilizing a mixture of experts architecture, it activates only the relevant neural networks for each task, significantly reducing hardware costs.
The training of DeepSeek V3 required approximately 2788K GPU hours, costing around $5.57 million, which is notably less than the expenses incurred by major U.S. tech companies for training large language models. The model has surpassed competitors like Llama-3.1-405B and Qwen 2.5-72B, and while it was outperformed by Anthropic's Claude 3.5 Sonnet in some benchmarks, it remains a significant advancement in the AI landscape.
• DeepSeek V3 outperforms leading AI models in multiple benchmarks.
• The model's training cost is significantly lower than U.S. tech companies.
An AI model whose source code is made publicly available for use and modification.
A neural network architecture that activates only relevant sub-networks for specific tasks, optimizing resource use.
Variables in a model that are adjusted during training to improve performance on tasks.
DeepSeek is a Chinese AI firm that has developed the DeepSeek V3 model, showcasing significant advancements in open-source AI.
OpenAI is a leading AI research organization known for developing advanced models like GPT-4o, which DeepSeek V3 has outperformed.
5 Sonnet, which outperformed DeepSeek V3 in certain benchmarks.
Phys.org on MSN.com 7month
Isomorphic Labs, the AI drug discovery platform that was spun out of Google's DeepMind in 2021, has raised external capital for the first time. The $600
How to level up your teaching with AI. Discover how to use clones and GPTs in your classroom—personalized AI teaching is the future.
Trump's Third Term? AI already knows how this can be done. A study shows how OpenAI, Grok, DeepSeek & Google outline ways to dismantle U.S. democracy.
Sam Altman today revealed that OpenAI will release an open weight artificial intelligence model in the coming months. "We are excited to release a powerful new open-weight language model with reasoning in the coming months," Altman wrote on X.