Kimi K2 0905 is the latest update to Moonshot AI's large-scale Mixture-of-Experts language model
Ctrl+Alt+Future7 Sep 2025

Kimi K2 0905 is the latest update to Moonshot AI's large-scale Mixture-of-Experts language model

Kimi K2 0905 is the latest update to Moonshot AI’s large-scale Mixture-of-Experts (MoE) language model, which is well-suited for complex agent-like tasks. With its advanced coding and reasoning capabilities, and extended context length, it delivers outstanding performance in the field of artificial intelligence.


- Agent-like intelligence: It doesn’t just answer questions, it also performs actions. This includes advanced tool usage, reasoning, and code synthesis. It automatically understands how to use given tools to complete a task without having to write complex workflows.


- Long-context inference: Supports long-context inference of up to 256k tokens, which has been extended from the previous 128k.


- Coding: It has improved agent-like coding, with higher accuracy and better generalization across frameworks. It also offers advanced front-end coding with more aesthetic and functional outputs for web, 3D and related tasks. It performs well on coding benchmarks such as LiveCodeBench and SWE-bench.

- Reasoning and Knowledge: Achieves state-dependent performance in boundary knowledge, mathematics and coding among non-thinking models. It performs well on reasoning benchmarks such as ZebraLogic and GPQA.

- Tool Usage: Performs well on tool usage benchmarks such as Tau2 and AceBench. To strengthen tool invocation capabilities, the model can independently decide when and how to invoke its tools.


Links

Twitter: https://x.com/Kimi_Moonshot/status/1963802687230947698Kimi-K2: https://moonshotai.github.io/Kimi-K2/Hugging Face: https://huggingface.co/moonshotai/Kimi-K2-Instruct-0905Tech report: https://github.com/MoonshotAI/Kimi-K2/blob/main/tech_report.pdfUser Manual: https://platform.moonshot.ai/docs/introduction#text-generation-modelKimi Chat: https://www.kimi.com/Openrouter MoonshotAI: Kimi K2 0905: https://openrouter.ai/moonshotai/kimi-k2-0905Groq: https://groq.com/blog/introducing-kimi-k2-0905-on-groqcloud


Episoder(15)

Qwen3-Next: Free large language model from Alibaba that could revolutionize training costs?

Qwen3-Next: Free large language model from Alibaba that could revolutionize training costs?

Qwen3-Next is a new large-scale language model (LLM) from Alibaba that has 80 billion parameters but only activates 3 billion during inference through a hybrid attention mechanism and rare Mixture-of-...

15 Sep 202546min

HunyuanImage 2.1 is an open source model that can generate high resolution (2K) images

HunyuanImage 2.1 is an open source model that can generate high resolution (2K) images

HunyuanImage 2.1 is an open source text-to-image diffusion model capable of generating ultra-high resolution (2K) images. It stands out with its dual text encoder, two-stage architecture including a r...

12 Sep 202533min

Google Stitch: user interface (UI) design using artificial intelligence

Google Stitch: user interface (UI) design using artificial intelligence

Google Stitch is an AI-powered tool designed for app developers to generate user interfaces (UI) for mobile and web applications. It can turn ideas into UIs. By default, it uses Google DeepMind’s late...

12 Sep 202533min

Tencent HunyuanWorld-Voyager: Generating 3D-consistent video from a single photo

Tencent HunyuanWorld-Voyager: Generating 3D-consistent video from a single photo

Tencent has unveiled its AI-powered tool called HunyuanWorld-Voyager, which can transform a single image into a directional, 3D-consistent video—providing the thrill of exploration without the need fo...

7 Sep 202546min

GLM-4.5: The Next Generation of Artificial Intelligence That Thinks and Acts

GLM-4.5: The Next Generation of Artificial Intelligence That Thinks and Acts

Z.ai introduces its latest flagship models, the GLM-4.5 and GLM-4.5-Air, which take the capabilities of intelligent assistants to a new level. These models uniquely combine deep analytics, master-leve...

7 Sep 202535min

Gemini 2.5 Flash Image: Advanced AI Generation and Editing

Gemini 2.5 Flash Image: Advanced AI Generation and Editing

Gemini 2.5 Flash Image, also known as Nano Banana, is an advanced, multimodal image creation and editing model that can interpret both text and image commands, allowing users to create, edit, and iter...

4 Sep 202549min

Qwen-Image image generation model: complex text display and precise image editing

Qwen-Image image generation model: complex text display and precise image editing

Qwen-Image is a basic image generation model developed by Alibaba's Qwen team. It has two outstanding capabilities: complex text rendering and precise image editing.Qwen-Image can render text, even lo...

3 Sep 202539min

Populært innen Teknologi

romkapsel
rss-avskiltet
teknisk-sett
tomprat-med-gunnar-tjomlid
energi-og-klima
lydartikler-fra-aftenposten
rss-impressions-2
shifter
nasjonal-sikkerhetsmyndighet-nsm
fornybaren
elektropodden
hans-petter-og-co
smart-forklart
pedagogisk-intelligens
rss-alt-vi-kan
rss-fish-ships
teknologi-og-mennesker
rss-for-alarmen-gar
rss-ki-praten
rss-alt-som-gar-pa-strom