Chinese AI startup DeepSeek launches DeepSeek-V3, an enormous 671-billion parameter mannequin, shattering benchmarks and rivaling top proprietary systems.
This smaller model approached the mathematical reasoning capabilities of GPT-4 and outperformed another Chinese model, Qwen-72B. https://sites.google.com/view/what-is-deepseek/