Loading
Loading
A chronological list of notable foundation‑model releases, newest first.
| Date | Lab | Model / Release | Notes | |
|---|---|---|---|---|
| JUN 1, 2026 | MiniMax | MiniMax Releases M3 SWE-Bench Pro:59.0% (vendor)Context:1M TokensSpeed vs M2:9x prefill / 15x decodeCost:5-10% of proprietary | MiniMax ships M3, an open-weight frontier model built on its new MSA (MiniMax Sparse Attention) architecture — a KV-block selection scheme that cuts per-token compute at long context — carrying a 1M-token context window, native multimodality, and built-in agentic coding. JD Cloud was first to host it via an OpenAI-compatible API. | |
| JUN 1, 2026 | Alibaba (Qwen) | Alibaba Launches Qwen 3.7 Plus Modalities:Text + VisionVision Arena:#16 (LM Arena)Context:1M TokensAccess:Proprietary API | Alibaba Cloud ships Qwen 3.7 Plus, the multimodal sibling to the text-only Qwen 3.7 Max. It adds image and video understanding and is tuned to balance power and speed for everyday agentic tasks. Closed-weights and API-only, the preview pushed Alibaba to a top-5 lab on LM Arena's Vision leaderboard. | |
| MAY 28, 2026 | Anthropic | Anthropic Ships Claude 4.8 Opus Focus:Agentic AutonomySWE-bench:97.4%Availability:Claude Console | Anthropic officially launches Claude 4.8 Opus, its ultimate frontier model optimized for long-horizon autonomous system execution, repository-level software engineering, and multi-step complex logic. | |
| MAY 20, 2026 | Alibaba (Qwen) | Alibaba Launches Qwen 3.7 Max Intelligence Index:56.6 (5th global)SWE-Verified:80.4%Agent Stamina:35h / 1,158 tool callsInput Price:$2.50 / 1M tokens | Alibaba Cloud unveils Qwen 3.7 Max, a proprietary, API-only 'reasoning agent' model built for long-horizon autonomous work rather than chat. With 1T+ parameters and a 1M-token context window, it ranks 5th globally on Artificial Analysis's Intelligence Index (56.6) — the highest-ranked Chinese model — and is served only through Qwen Studio and Alibaba Cloud Model Studio. | |
| MAY 19, 2026 | Google Launches Gemini 3.5 Flash Context Window:2M TokensFormat:API & Vertex AI | Google releases Gemini 3.5 Flash in general availability, showcasing significant speedups in lightweight tool-calling capabilities and ultra-low-latency workflows. | ||
| MAY 15, 2026 | xAI | xAI Releases Grok 4 Flagship Training Scale:200k GPU ClusterFocus:Real-World Alignment | xAI launches Grok 4, a major foundation upgrade incorporating advanced real-world visual logic and deep real-time tool execution, trained on the expanded Memphis 200k B200/H100 cluster. | |
| APR 25, 2026 | DeepSeek | DeepSeek V4 Released Architecture:MoECost:1/10th Frontier API | DeepSeek launches V4, a native multi-modal reasoning Mixture-of-Experts (MoE) model matching frontier commercial performance at an unprecedented low cost. | |
| APR 24, 2026 | OpenAI | OpenAI Releases GPT-5.5 Focus:Enterprise AgentsBenchmark Gain:+12% Code GenList Price:2x prior modelEffective Cost:~20-30% cheaper / token | OpenAI ships GPT-5.5, introducing key updates targeting multi-agent orchestration, complex coding pipelines, and long-horizon tool execution. | |
| APR 21, 2026 | Moonshot AI | Moonshot AI Ships Kimi K2.6 Swarm Capacity:300 sub-agentsContext Length:4M Tokens | Moonshot AI launches Kimi K2.6, featuring advanced long-context coding and an elevated 'Agent Swarm' system coordinating up to 300 sub-agents. | |
| APR 17, 2026 | Anthropic | Anthropic Ships Claude Opus 4.7 Coding Benchmark:94.2% SWE-benchAvailability:Claude Console | Anthropic releases Claude Opus 4.7 as its flagship model, delivering advanced multi-step tool use, elite coding, and top-tier logical reasoning. | |
| APR 9, 2026 | Meta | Meta Superintelligence Labs Releases Muse Spark Context:4D Spatial VideoLicensing:Proprietary API | Meta launches Muse Spark, a proprietary 4D world model trained on first-person video and spatial data for advanced situational reasoning. | |
| APR 8, 2026 | Anthropic | Anthropic Previews Claude Mythos Access Tier:Project GlasswingFocus:Defensive Cyber & Safety | Anthropic announces Claude Mythos under a gated private preview, focusing on defensive cybersecurity, automated exploit analysis, and mechanistic interpretability. | |
| APR 3, 2026 | Alibaba (Qwen) | Alibaba Releases Qwen 3.6 Series Models:Plus / Max-PreviewFocus:Coding & Reasoning | Alibaba Cloud launches the Qwen 3.6 family, bringing deep structural optimizations for agentic programming and reasoning workloads. | |
| MAR 28, 2026 | Zhipu AI | Zhipu AI Launches GLM-5.1 Architecture:MoE Refinement | Zhipu AI releases GLM-5.1, bringing logical refinement and expanded coding context for enterprise Chinese API users. | |
| MAR 17, 2026 | Zhipu AI | Zhipu AI Releases GLM-5-Turbo Target Use:OpenClaw AgentsLatency:Ultra-Low | Zhipu AI rolls out GLM-5-Turbo, an optimized, fast-inference model tailored for 'OpenClaw' autonomous agent platforms. | |
| MAR 16, 2026 | MiniMax | MiniMax M2.7 Released Context:1M Tokens | MiniMax releases M2.7, offering expanded context windows and native multi-modal image-to-text generation capabilities. | |
| MAR 6, 2026 | OpenAI | OpenAI Releases GPT-5.4 Type:Consolidated Flagship | OpenAI releases GPT-5.4, a major unified flagship update incorporating critical logical execution and context pipeline improvements from the 5.3 branch. | |
| MAR 4, 2026 | OpenAI | OpenAI Launches GPT-5.3 Instant Performance:Ultra-Low Latency | OpenAI rolls out GPT-5.3 Instant, a high-speed, cost-optimized conversational and logical execution model. | |
| FEB 27, 2026 | MiniMax | MiniMax Releases M2.6 Platform Rank:Top 3 OpenRouter | MiniMax launches M2.6, bridging agentic text capability and early multi-modal context features, seeing rapid pick-up on OpenRouter developer platforms. | |
| FEB 16, 2026 | Alibaba (Qwen) | Alibaba Launches Qwen 3.5 Family Architecture:Sparse MoEFlagship Size:397B parameters | Alibaba Cloud releases Qwen 3.5, engineered for high-performance agentic MoE configurations (397B-A17B flagship MoE). | |
| FEB 12, 2026 | ByteDance | ByteDance Releases Seedance 2.0 Resolution:4K CinematicAudio Mode:Fully Synchronized | ByteDance officially launches Seedance 2.0, a unified multi-modal video/audio generation model generating highly realistic, synchronized cinema clips. | |
| FEB 12, 2026 | MiniMax | MiniMax Releases M2.5 Rank:#1 on OpenRouter | MiniMax launches M2.5, which rapidly climbs OpenRouter popularity charts, demonstrating high adoption for agentic text scenarios. | |
| FEB 11, 2026 | Zhipu AI | Zhipu AI Launches GLM-5 MoE Parameters:744B MoEHardware:Huawei Ascend | Zhipu AI releases GLM-5, a flagship 744B-parameter Mixture-of-Experts (MoE) model trained entirely on sovereign Chinese hardware. | |
| FEB 5, 2026 | Google Launches Gemini 3.1 Pro Focus:Reinforcement Learning PlanningContext Length:2M Tokens | Google launches Gemini 3.1 Pro, a major agentic reasoning update to its flagship model series. Delivering native deep search execution, reinforcement-learning based planning, and substantial code-generation gains. | ||
| FEB 5, 2026 | OpenAI | OpenAI Ships GPT-5.3-Codex Focus:Software Engineering | OpenAI launches GPT-5.3-Codex, establishing elite new milestones in multi-agent logical coordination and deep code architecture mapping. | |
| FEB 5, 2026 | Anthropic | Anthropic Ships Claude Opus 4.6 Logic Score:SWE-bench +5% | Anthropic launches Claude Opus 4.6, delivering optimization bumps for agentic reasoning and complex coding workloads. | |
| JAN 27, 2026 | Moonshot AI | Moonshot AI Releases Kimi K2.5 Parameters:1T MoE | Moonshot AI launches Kimi K2.5, a 1-trillion parameter multimodal Mixture-of-Experts model powered by an advanced 'Agent Swarm' system. | |
| DEC 22, 2025 | Zhipu AI | Zhipu AI Ships GLM-4.7 Context Length:1M Tokens | Zhipu AI rolls out GLM-4.7, delivering major logic optimization, expanded context limits, and complex multi-agent orchestration tools. | |
| NOV 24, 2025 | Anthropic | Anthropic Ships Claude Opus 4.5 Context Window:500k Tokens | Anthropic rolls out Claude Opus 4.5, a mid-generation bump focused on complex long-context reasoning and reduced hallucination rates. | |
| NOV 18, 2025 | Google Launches Gemini 3 Pro Focus:Long-Horizon Planning | Google introduces Gemini 3 Pro, focusing on advanced multi-step planning, long-horizon tool execution, and deeper Google Workspace integrations. | ||
| OCT 31, 2025 | MiniMax | MiniMax Releases M2 Type:Base Model | MiniMax shifts to its next-generation foundation architecture with the release of the M2 base model. | |
| SEP 30, 2025 | Zhipu AI | Zhipu AI Releases GLM-4.6 & 4.6V Format:API & Open Weights | Zhipu AI launches GLM-4.6 alongside the multimodal 4.6V, establishing massive upgrades in multi-modal understanding, coding context, and reasoning capabilities. | |
| AUG 11, 2025 | Zhipu AI | Zhipu AI Ships GLM-4.5V Modalities:Vision-Language | Zhipu AI launches GLM-4.5V, a highly competitive open-weight vision-language model. | |
| AUG 7, 2025 | OpenAI | OpenAI Releases GPT-5 Modalities:Speech, Text, Video | OpenAI launches GPT-5, a next-generation frontier multimodal model establishing new milestones in complex logic, native video processing, and abstract reasoning. | |
| JUL 28, 2025 | Zhipu AI | Zhipu AI Releases GLM-4.5 & GLM-4.5-Air Speed:2x GLM-4 | Zhipu AI releases GLM-4.5 alongside the high-speed GLM-4.5-Air model, optimizing API cost and latency. | |
| JUL 11, 2025 | Moonshot AI | Moonshot AI Releases Kimi K2 Context Length:2M Tokens | Moonshot AI launches the foundational Kimi K2 model, establishing a massive baseline context size for long-form document processing. | |
| JUN 17, 2025 | Google Releases Gemini 2.5 Flash Focus:Low-Latency ReasoningContext Length:1M Tokens | Google ships Gemini 2.5 Flash, bringing major architectural speed and multi-lingual reasoning improvements to its widely used efficiency model. | ||
| JUN 1, 2025 | ByteDance | ByteDance Launches Seedance 1.0 Type:Video Gen | ByteDance introduces Seedance, its first-generation unified video foundation model, focusing on realistic character physics. | |
| MAY 22, 2025 | Anthropic | Anthropic Releases Claude Opus 4 & Sonnet 4 Focus:Frontier Logic & SWESWE-bench:SOTA | Anthropic unveils the Claude 4 generation, launching both the flagship Claude Opus 4 and the balanced Claude Sonnet 4. | |
| APR 29, 2025 | Alibaba (Qwen) | Alibaba Releases Qwen 3 Family Benchmark:+15% Math/Logic | Alibaba Cloud releases the Qwen 3 model family, showing dramatic logical gains and native multi-lingual reasoning benchmarks. | |
| APR 16, 2025 | OpenAI | OpenAI Launches Flagship o3 Coding:AIME 97.2% | OpenAI releases the full flagship o3 reasoning model, breaking previous records in complex physics, chemistry, coding, and mathematical benchmarks. | |
| APR 5, 2025 | Meta | Meta Releases Llama 4 Scout & Maverick Scout Size:109B (17B Active)Maverick Size:400B (17B Active) | Meta officially launches Llama 4 Scout (109B parameter MoE) and Llama 4 Maverick (400B parameter MoE), featuring advanced open-weight architectures. | |
| MAR 25, 2025 | Google Releases Gemini 2.5 Pro (Experimental) Release Type:Experimental Preview | Google launches Gemini 2.5 Pro Experimental, focusing on advanced coding reasoning and complex multi-modal analysis. | ||
| FEB 24, 2025 | Anthropic | Anthropic Launches Claude 3.7 Sonnet Thinking Mode:User-Toggled | Anthropic releases Claude 3.7 Sonnet, introducing native hybrid-reasoning capabilities, allowing users to toggle between fast-inference and deep-thinking modes. | |
| FEB 18, 2025 | xAI | xAI Releases Grok 3 Reasoning Model Cluster:100k ColossusMath Score:Frontier Parity | xAI officially launches Grok 3, demonstrating breakthrough mathematical logic and long-horizon planning performance, trained on the 100k liquid-cooled Colossus cluster in Memphis. | |
| JAN 31, 2025 | OpenAI | OpenAI Releases o3-mini Latency:Low-Latency | OpenAI releases o3-mini, providing low-latency reasoning speeds and competitive coding capabilities for developers. | |
| JAN 20, 2025 | Moonshot AI | Moonshot AI Releases Kimi K1.5 Logic Score:o1-Parity | Moonshot AI releases Kimi K1.5, claiming parity with OpenAI o1 across mathematical, logical, and coding benchmarks. | |
| JAN 20, 2025 | DeepSeek | DeepSeek Releases R1 Cost:$0.55 / M Tokens | DeepSeek launches R1, an open-source reasoning model that matches top Western proprietary APIs at an extreme fraction of the cost, disrupting global tech markets. | |
| DEC 26, 2024 | DeepSeek | DeepSeek Releases V3 Parameters:671B Total (37B Active)Pricing:$0.14 / M input tokens | DeepSeek launches DeepSeek-V3, a massive 671B parameter Mixture-of-Experts (MoE) flagship model. Delivering state-of-the-art open-source logic at commodity pricing with Multi-head Latent Attention (MLA) and FP8 training. | |
| DEC 6, 2024 | Meta | Meta Launches Llama 3.3 Size:70B | Meta releases Llama 3.3, upgrading the 70B model size to Llama 3.1 405B intelligence levels at significantly reduced inference cost. | |
| DEC 5, 2024 | OpenAI | OpenAI Ships Full o1 Full Release:Dec 5, 2024 | OpenAI releases the complete o1 model family, integrating deep reasoning capabilities directly into its flagship API and ChatGPT platforms. | |
| NOV 4, 2024 | Anthropic | Anthropic Ships Claude 3.5 Haiku Coding Speed:+20% | Anthropic releases Claude 3.5 Haiku globally, replacing the older Claude 3 Haiku model with significantly sharper coding skills. | |
| OCT 22, 2024 | Anthropic | Anthropic Upgrades Claude 3.5 Sonnet (v2) OSWorld Benchmark:14.9% (SOTA) | Anthropic releases an upgraded version of Claude 3.5 Sonnet ('v2') alongside a public beta of 'Computer Use' agentic functionality. | |
| SEP 25, 2024 | Meta | Meta Ships Llama 3.2 Vision & Lightweight Models Edge Sizes:1B / 3B | Meta releases Llama 3.2, introducing multimodal vision capabilities (11B/90B) and ultra-lightweight models (1B/3B) for edge devices. | |
| SEP 19, 2024 | Alibaba (Qwen) | Alibaba Launches Qwen 2.5 Family Coding Rank:Top Open-Source | Alibaba Cloud releases Qwen 2.5, establishing a highly popular open-source coding, mathematics, and multilingual benchmark family. | |
| SEP 12, 2024 | OpenAI | OpenAI Previews o1-preview & o1-mini AIME 2024 Score:83.3% | OpenAI unveils its first reinforcement-learning reasoning models, releasing o1-preview and o1-mini to ChatGPT Plus and API tiers. | |
| JUL 23, 2024 | Meta | Meta Releases Llama 3.1 405B Flagship Context window:128k TokensFlagship size:405B Parameters | Meta releases Llama 3.1, introducing the massive 405B parameter flagship alongside 8B/70B upgrades, expanding the context window to 128k. | |
| JUL 18, 2024 | OpenAI | OpenAI Ships GPT-4o-mini Pricing:$0.15 / M input tokens | OpenAI launches GPT-4o-mini as a high-speed, cost-effective multimodal model to completely replace GPT-3.5 Turbo. | |
| JUN 20, 2024 | Anthropic | Anthropic Launches Claude 3.5 Sonnet SWE-bench Verified:49.0% (SOTA) | Anthropic releases Claude 3.5 Sonnet, establishing new benchmarks in software engineering, complex logical reasoning, and chart parsing. | |
| JUN 6, 2024 | Alibaba (Qwen) | Alibaba Launches Qwen 2 Open weights Language Support:30+ Languages | Alibaba Cloud releases Qwen 2, providing multilingual open-weights models in five size variations. | |
| MAY 15, 2024 | DeepSeek | DeepSeek Releases V2 API pricing:$0.14 / M input tokens | DeepSeek releases DeepSeek-V2, introducing Multi-head Latent Attention (MLA) and slashing API pricing down to commodity levels. | |
| MAY 14, 2024 | Google Launches Gemini 1.5 Flash Context Window:1M Tokens | Google rolls out Gemini 1.5 Flash at Google I/O, optimized for speed and cost while retaining the signature 1-million token context window. | ||
| MAY 13, 2024 | OpenAI | OpenAI Ships GPT-4o Omnimodal model Audio Latency:232ms average | OpenAI launches GPT-4o, natively integrating real-time audio, vision, and text processing in a single fast-inference model. | |
| APR 18, 2024 | Meta | Meta Releases Llama 3 8B & 70B Training Data:15T Tokens | Meta launches Llama 3 in 8B and 70B sizes, showing huge logical gains over Llama 2 via high-quality dataset curation. | |
| APR 17, 2024 | MiniMax | MiniMax Releases abab 6.5 MoE Type:Proprietary MoE | MiniMax releases abab 6.5, bringing reasoning improvements to its proprietary MoE API line. | |
| APR 9, 2024 | OpenAI | OpenAI GPT-4 Turbo Reaches GA Context Length:128k Tokens | OpenAI releases the GA version of GPT-4 Turbo with vision (`gpt-4-turbo-2024-04-09`), increasing context logic and performance. | |
| MAR 13, 2024 | Anthropic | Anthropic Claude 3 Haiku Reaches GA Speed:Low-LatencyFocus:Document Analysis | Anthropic makes Claude 3 Haiku generally available, completing its initial Claude 3 roll out with a highly efficient model. | |
| MAR 4, 2024 | Anthropic | Anthropic Ships Claude 3 Opus & Sonnet Opus context:200k Tokens | Anthropic launches the Claude 3 family, introducing the flagship Claude 3 Opus alongside the balanced Claude 3 Sonnet. | |
| FEB 15, 2024 | Google Unveils Gemini 1.5 Pro Context Length:1M - 2M Tokens | Google launches Gemini 1.5 Pro in limited preview, introducing the first native 1-million token context window in LLM history. | ||
| FEB 4, 2024 | Alibaba (Qwen) | Alibaba Launches Qwen 1.5 Series Sizes:0.5B to 72B | Alibaba Cloud releases Qwen 1.5, offering highly competitive open models in diverse parameter sizes. | |
| JAN 16, 2024 | Zhipu AI | Zhipu AI Launches GLM-4 API Bench:Early GPT-4 Parity | Zhipu AI rolls out GLM-4, offering a competitive commercial API matching early GPT-4 benchmarks. | |
| JAN 15, 2024 | MiniMax | MiniMax Releases abab 6 MoE Arch:First Chinese MoE | MiniMax releases abab 6, establishing itself as the first Chinese lab to deploy a Mixture-of-Experts architecture. | |
| DEC 6, 2023 | Google Launches Gemini 1.0 Family Ultra Benchmark:90.0% MMLU | Google introduces Gemini 1.0 in Ultra, Pro, and Nano tiers, its first native multimodal architecture designed from scratch. | ||
| NOV 6, 2023 | OpenAI | OpenAI Announces GPT-4 Turbo at DevDay Pricing:1/3rd GPT-4 cost | OpenAI announces GPT-4 Turbo in preview (`gpt-4-1106-preview`), raising context to 128k, dropping prices, and adding JSON mode. | |
| JUL 18, 2023 | Meta | Meta Launches Llama 2 Commercial open-weights Training Data:2.0T Tokens | Meta releases Llama 2 in partnership with Microsoft, offering open weights for commercial and research use across 7B, 13B, and 70B sizes. | |
| JUL 11, 2023 | Anthropic | Anthropic Ships Claude 2 Context Length:100k Tokens | Anthropic launches Claude 2, featuring a public web interface and a massive 100k token context window. | |
| MAR 14, 2023 | OpenAI | OpenAI Launches GPT-4 MMLU Score:86.4% | OpenAI releases GPT-4, setting the ultimate gold standard for reasoning, coding, professional exams, and complex instructions. | |
| MAR 14, 2023 | Anthropic | Anthropic Releases Claude 1 Safety Framework:Constitutional AI | Anthropic officially launches its first commercial model, Claude 1, emphasizing safety and helpfulness via Constitutional AI. | |
| FEB 24, 2023 | Meta | Meta Unveils Llama 1 for Research Leak Date:Late February 2023 | Meta releases its first series of open models (7B to 65B) to researchers, whose weights are quickly leaked to the public. | |
| NOV 30, 2022 | OpenAI | OpenAI Launches ChatGPT Plus GPT-3.5 User Adoption:100M in 2 months | OpenAI launches ChatGPT, a free conversational interface built on the InstructGPT (GPT-3.5) series. | |
| MAR 15, 2022 | OpenAI | OpenAI Releases InstructGPT text-davinci-002 Alignment Mode:RLHF | OpenAI rolls out InstructGPT models, utilizing RLHF to align model outputs with human instructions. | |
| JUN 11, 2020 | OpenAI | OpenAI Announces GPT-3 API Private Beta Parameters:175B | OpenAI releases the commercial API private beta for GPT-3 (175B parameters), showcasing early in-context learning. |
Pacing of the frontier model releases — pooled across all labs and broken down per lab. Gaps represent actual days between consecutive frontier launches.
Chronological feed of the 10 most recent releases across all tracked labs