Chinese AI startup DeepSeek launches DeepSeek-V3, a large 671-billion parameter model, shattering benchmarks and rivaling top proprietary techniques.
This smaller mannequin approached the mathematical reasoning capabilities of GPT-4 and outperformed another Chinese mannequin, Qwen-72B. https://postgresconf.org/users/deepseek-1