AI Models – Large Language Models

DeepSeek-V3.2 Open Source 685B Models Released (Credit - Midjourney, The AI Track)

DeepSeek-V3.2 Open Source 685B Models Released as Free GPT-5 Rivals

Chinese startup DeepSeek has launched 685B-parameter models V3.2 and V3.2-Speciale, matching or surpassing GPT-5 and Gemini-3.0-Pro on major math and coding benchmarks. A 2025 technical report on Hugging Face details the DeepSeek Sparse Attention architecture, which cuts 128K-token inference costs by about 70% compared with the prior V3.1-Terminus model.

Read More »
Scroll to Top