Ctrl+Alt+Future15 Sep 2025

Qwen3-Next: Free large language model from Alibaba that could revolutionize training costs?

Qwen3-Next is a new large-scale language model (LLM) from Alibaba that has 80 billion parameters but only activates 3 billion during inference through a hybrid attention mechanism and rare Mixture-of-Experts (MoE) design. It offers outstanding efficiency and speed of up to 10 times compared to previous models, while achieving higher accuracy in ultra-long context tasks and outperforming Gemini-2.5-Flash-Thinking model on complex reasoning tests.

Why is Qwen3-Next good and what makes it special?

Accessibility and open source:

Qwen3-Next models are available through Hugging Face, ModelScope, Alibaba Cloud Model Studio, and NVIDIA API Catalog. Its open source nature, released under the Apache 2.0 license, encourages innovation and democratizes access to cutting-edge AI technology.

Cost-effectiveness:

- Qwen3-Next not only shows higher accuracy, but also significant efficiency compared to other models

- It can be trained with less than 10% of the computational cost (9.3% to be exact) compared to the Qwen3-32B model. This reduced training cost has the potential to democratize AI development.

Faster inference:

- Only 3 billion (about 3.7%) of its 80 billion parameters are active during the inference phase. This dramatically reduces the FLOPs/token ratio while maintaining model performance

FLOPs is an abbreviation for Floating Point Operations Per Second, which is a unit of measurement for computer performance. In the case of AI models, FLOPs/token indicates how many computational operations are required to process a single text "token" (word or word fragment).

- For shorter contexts, it provides up to 7x speedup in the prefill (first token output) phase and 4x speedup in the decode (additional tokens output) phase.

Innovative architecture:

- Hybrid attention mechanism, which enables extremely efficient context modeling for ultra-long contexts.

- Rare Mixture-of-Experts (MoE) system: consists of 512 experts, where 10 experts and 1 shared expert are actively used at the same time.

Outstanding performance:

- Outperforms Qwen3-32B-Base in most benchmarks, while using less than 10% of its computational cost

- Very close in performance to Alibaba's flagship 235B parameter model.

- Performs particularly well in handling ultra-long context tasks, up to 256,000 tokens. Furthermore, the context length can be extended to 1 million tokens using the YaRN method.

- Qwen3-Next-80B-A3B-Thinking excels at complex reasoning tasks. It outperforms mid-range Qwen3 variants and even outperforms the closed-source Gemini-2.5-Flash-Thinking in several benchmarks

Multilingual capabilities:

The automatic speech recognition model, Qwen3-ASR-Flash, performs accurate transcription in 11 major languages and several Chinese dialects

Agent capabilities

Excellent for device invocation tasks and agent-based workflows

Links

Qwen3-Next: Towards Ultimate Training & Inference Efficiency: https://qwen.ai/blog?id=4074cca80393150c248e508aa62983f9cb7d27cd&from=research.latest-advancements-listHugging Face model: https://huggingface.co/collections/Qwen/qwen3-next-68c25fd6838e585db8eeea9dModelscope: https://modelscope.cn/models/Qwen/Qwen3-Next-80B-A3B-ThinkingOpenrouter: https://openrouter.ai/qwenQwen Chat: https://chat.qwen.ai/

Oppdag Premium

Prøv 14 dager gratis

Kjøp Premium

Episoder(15)

HunyuanImage 2.1 is an open source model that can generate high resolution (2K) images

HunyuanImage 2.1 is an open source text-to-image diffusion model capable of generating ultra-high resolution (2K) images. It stands out with its dual text encoder, two-stage architecture including a r...

12 Sep 202533min

Google Stitch: user interface (UI) design using artificial intelligence

Google Stitch is an AI-powered tool designed for app developers to generate user interfaces (UI) for mobile and web applications. It can turn ideas into UIs. By default, it uses Google DeepMind’s late...

12 Sep 202533min

Kimi K2 0905 is the latest update to Moonshot AI's large-scale Mixture-of-Experts language model

Kimi K2 0905 is the latest update to Moonshot AI’s large-scale Mixture-of-Experts (MoE) language model, which is well-suited for complex agent-like tasks. With its advanced coding and reasoning capabi...

7 Sep 202529min

Tencent HunyuanWorld-Voyager: Generating 3D-consistent video from a single photo

Tencent has unveiled its AI-powered tool called HunyuanWorld-Voyager, which can transform a single image into a directional, 3D-consistent video—providing the thrill of exploration without the need fo...

7 Sep 202546min

GLM-4.5: The Next Generation of Artificial Intelligence That Thinks and Acts

Z.ai introduces its latest flagship models, the GLM-4.5 and GLM-4.5-Air, which take the capabilities of intelligent assistants to a new level. These models uniquely combine deep analytics, master-leve...

7 Sep 202535min

Gemini 2.5 Flash Image: Advanced AI Generation and Editing

Gemini 2.5 Flash Image, also known as Nano Banana, is an advanced, multimodal image creation and editing model that can interpret both text and image commands, allowing users to create, edit, and iter...

4 Sep 202549min

Qwen-Image image generation model: complex text display and precise image editing

Qwen-Image is a basic image generation model developed by Alibaba's Qwen team. It has two outstanding capabilities: complex text rendering and precise image editing.Qwen-Image can render text, even lo...

3 Sep 202539min

Reklamefrie Premium-podkaster

Hør populære podkaster som Storefri med Mikkel og Herman, Ida med hjertet i hånden, Krimpodden og mye mye mer

Skap din egen podkastboble

I appen skaper du ditt eget bibliotek med favoritter, og vi gir deg også anbefalinger til podkaster du ikke kan gå glipp av.

Prøv 14 dager gratis

Dersom du er ny Podme-bruker får du 14 dager gratis prøveperiode når du oppretter abonnement

Premium

99 kr/ måned

Tilgang til alle våre Premium-podkaster
Alle podkaster fra VG, Aftenposten, BT og SA
Reklamefritt Premium-innhold
Ingen bindingstid. Avslutt når du ønsker

Prøv 14 dager gratis

Premium

129 kr/ måned

Tilgang til alle Premium-podkaster
Alle podkaster fra VG, Aftenposten, BT og SA
Reklamefritt Premium-innhold
Ingen bindingstid. Avslutt når du ønsker
En Ekstra bruker

Prøv 14 dager gratis

Populært innen Teknologi

Historiene og stemmene du vil høre

Ubegrenset tilgang til alle dine favorittpodkaster og lydbøker

Les mer