Chinese AI firm releases DeepSeek V3, a new leader in open-source AI models

Full Article
Chinese AI firm releases DeepSeek V3, a new leader in open-source AI models

DeepSeek, a Chinese AI firm, has launched DeepSeek V3, an advanced open-source AI model. This model boasts 671 billion parameters and outperforms both existing open-source models and closed models like OpenAI's GPT-4o across various benchmarks. Utilizing a mixture of experts architecture, it activates only the relevant neural networks for each task, significantly reducing hardware costs.

The training of DeepSeek V3 required approximately 2788K GPU hours, costing around $5.57 million, which is notably less than the expenses incurred by major U.S. tech companies for training large language models. The model has surpassed competitors like Llama-3.1-405B and Qwen 2.5-72B, and while it was outperformed by Anthropic's Claude 3.5 Sonnet in some benchmarks, it remains a significant advancement in the AI landscape.

• DeepSeek V3 outperforms leading AI models in multiple benchmarks.

• The model's training cost is significantly lower than U.S. tech companies.

Key AI Terms Mentioned in this Article

Open-source AI model

An AI model whose source code is made publicly available for use and modification.

Mixture of Experts (MoE)

A neural network architecture that activates only relevant sub-networks for specific tasks, optimizing resource use.

Parameters

Variables in a model that are adjusted during training to improve performance on tasks.

Companies Mentioned in this Article

DeepSeek

DeepSeek is a Chinese AI firm that has developed the DeepSeek V3 model, showcasing significant advancements in open-source AI.

OpenAI

OpenAI is a leading AI research organization known for developing advanced models like GPT-4o, which DeepSeek V3 has outperformed.

Anthropic

5 Sonnet, which outperformed DeepSeek V3 in certain benchmarks.

Get Email Alerts for AI News

By creating an email alert, you agree to AIleap's Terms of Service and Privacy Policy. You can pause or unsubscribe from email alerts at any time.

Latest Articles

Alphabet's AI drug discovery platform Isomorphic Labs raises $600M from Thrive
TechCrunch 6month

Isomorphic Labs, the AI drug discovery platform that was spun out of Google's DeepMind in 2021, has raised external capital for the first time. The $600

AI In Education - Up-level Your Teaching With AI By Cloning Yourself
Forbes 6month

How to level up your teaching with AI. Discover how to use clones and GPTs in your classroom—personalized AI teaching is the future.

Trump's Third Term - How AI Can Help To Overthrow The US Government
Forbes 6month

Trump's Third Term? AI already knows how this can be done. A study shows how OpenAI, Grok, DeepSeek & Google outline ways to dismantle U.S. democracy.

Sam Altman Says OpenAI Will Release an 'Open Weight' AI Model This Summer
Wired 6month

Sam Altman today revealed that OpenAI will release an open weight artificial intelligence model in the coming months. "We are excited to release a powerful new open-weight language model with reasoning in the coming months," Altman wrote on X.

Popular Topics