Ctrl+Alt+Future3 Syys 2025

Deep Cogito - Cogito v2: Free model. Using a unique, iterative self-learning method (IDA)

According to developer Deep Cogito, Cogito v2 is one of the world’s most powerful open-source AI models, available in sizes ranging from 70B to 671B parameters. Thanks to its unique, iterative self-learning method (IDA), the model solves complex problems by developing its internal “intuition” rather than by searching for longer, shorter and more efficient thoughts.

• Market-leading performance: The company claims that the performance of the largest 671B-parameter MoE (Mixture of Experts) model competes with the latest DeepSeek models and approaches that of closed models such as o3 and Claude 4 Opus. The models have been trained in over 30 languages and are optimized for coding, STEM tasks, instruction following, and tool calling.

• Innovative Training Method (IDA): The company uses a method called Iterated Distillation & Amplification (IDA), which it describes as a scalable and efficient strategy for achieving superintelligence. The essence of this is that the model internalizes the inference process and improves its own parameters through iterative self-improvement, rather than simply searching for the answer. According to Deep Cogito, this helps the models develop better “intuition.”

• Superior efficiency: The company emphasizes that thanks to the IDA method, their models achieve superior results with shorter “reasoning chains.” For example, their 671B model uses 60% shorter reasoning chains than DeepSeek R1. This approach is also significantly more cost-effective; they claim that training all of the Cogito models cost less than $3.5 million.

• Flexible, hybrid operation: According to the company, Cogito v2 models are hybrid models, which means that they can respond immediately (like a standard LLM), or respond after a self-reflective, “thinking” process. This thinking mode can be manually turned on.

• Size selection and local runnability: Deep Cogito has released the models in four different sizes (70B, 109B, 405B, 671B), so users can choose the model that suits their hardware

. The company highlights that with the help of Unsloth, the models can also be run locally, even in quantized (reduced size) form, with minimal loss of accuracy.

• Emergent capabilities: The company mentions as an interesting consequence that although the models were trained only on text data, thanks to the multimodal base model, they can also think on visual content through transfer learning.

LinksCogito V2 Preview: https://www.deepcogito.com/research/cogito-v2-previewHugging Face Cogito v2 preview - 671B MoE: https://huggingface.co/deepcogito/cogito-v2-preview-deepseek-671B-MoEunsloth: https://docs.unsloth.ai/basics/tutorials-how-to-fine-tune-and-run-llms/cogito-v2-how-to-run-locallyOpenRouter: https://openrouter.ai/deepcogito/cogito-v2-preview-deepseek-671b

Kokeile Premiumia

Nauti 14 päivää ilmaiseksi

Tilaa Premium

Jaksot(15)

Qwen3-Next: Free large language model from Alibaba that could revolutionize training costs?

Qwen3-Next is a new large-scale language model (LLM) from Alibaba that has 80 billion parameters but only activates 3 billion during inference through a hybrid attention mechanism and rare Mixture-of-...

15 Syys 202546min

HunyuanImage 2.1 is an open source model that can generate high resolution (2K) images

HunyuanImage 2.1 is an open source text-to-image diffusion model capable of generating ultra-high resolution (2K) images. It stands out with its dual text encoder, two-stage architecture including a r...

12 Syys 202533min

Google Stitch: user interface (UI) design using artificial intelligence

Google Stitch is an AI-powered tool designed for app developers to generate user interfaces (UI) for mobile and web applications. It can turn ideas into UIs. By default, it uses Google DeepMind’s late...

12 Syys 202533min

Kimi K2 0905 is the latest update to Moonshot AI's large-scale Mixture-of-Experts language model

Kimi K2 0905 is the latest update to Moonshot AI’s large-scale Mixture-of-Experts (MoE) language model, which is well-suited for complex agent-like tasks. With its advanced coding and reasoning capabi...

7 Syys 202529min

Tencent HunyuanWorld-Voyager: Generating 3D-consistent video from a single photo

Tencent has unveiled its AI-powered tool called HunyuanWorld-Voyager, which can transform a single image into a directional, 3D-consistent video—providing the thrill of exploration without the need fo...

7 Syys 202546min

GLM-4.5: The Next Generation of Artificial Intelligence That Thinks and Acts

Z.ai introduces its latest flagship models, the GLM-4.5 and GLM-4.5-Air, which take the capabilities of intelligent assistants to a new level. These models uniquely combine deep analytics, master-leve...

7 Syys 202535min

Gemini 2.5 Flash Image: Advanced AI Generation and Editing

Gemini 2.5 Flash Image, also known as Nano Banana, is an advanced, multimodal image creation and editing model that can interpret both text and image commands, allowing users to create, edit, and iter...

4 Syys 202549min

Qwen-Image image generation model: complex text display and precise image editing

Qwen-Image is a basic image generation model developed by Alibaba's Qwen team. It has two outstanding capabilities: complex text rendering and precise image editing.Qwen-Image can render text, even lo...

3 Syys 202539min

Kaikki yhdessä sovelluksessa

Kuuntele kaikki suosikkipodcastisi ja -äänikirjasi yhdessä paikassa.

Sinulle valikoitua sisältöä

Podme-sovelluksessa kokoat suosikkisi helposti omaan kirjastoosi. Saat meiltä myös kuuntelusuosituksia!

Jatka kuuntelua koska tahansa

Voit jatkaa siitä mihin jäit, myös offline-tilassa.

Premium

9,99 €/kk

Kaikki premium-podcastit
Ei mainoksia
Ei sitoutumista, peruuta koska tahansa

Aloita 14 päivän kokeilu

Premium

13,99 €/kk

Kaikki premium-podcastit
Ei mainoksia
Ei sitoutumista, peruuta koska tahansa
Yksi lisäkäyttäjä

Kokeile 14 päivää maksutta

Tarinat ja äänet, joita rakastat kuunnella

Kuuntele kaikki suosikkipodcastisi ja -äänikirjasi

Lue lisää