Chinese AI startup DeepSeek launches DeepSeek-V3, an enormous 671-billion parameter model, shattering benchmarks and rivaling prime proprietary techniques.
This smaller mannequin approached the mathematical reasoning capabilities of GPT-4 and outperformed another Chinese model, Qwen-72B. https://writexo.com/share/u02f7sch