Ctrl+Alt+Future3 Syys 2025

Qwen-Image-Edit: Image editing with artificial intelligence. No need for Photoshop anymore?

Today, we will look at an AI model that simplifies image editing: Qwen-Image-Edit. This model builds on the foundation of the original, high-performance Qwen-Image, and brings amazing capabilities in the areas of text rendering and precise image editing.

Qwen-Image-Edit’s capabilities and benefits in brief:

- This model stands out for its ability to precisely edit texts within images, both in a bilingual (Chinese and English) environment. This includes directly adding, deleting, and modifying text while preserving the original text size, font, and style. For example, it can make corrections in calligraphy or modify even the smallest text elements on posters.

- It allows you to modify the content of the image while maintaining the original visual semantics and consistency. This includes creating IP (intellectual property) content (e.g., modifying a mascot to have different personalities), rotating objects (even 90 or 180 degrees to see the back), and style transformation (e.g., transforming a portrait into a Studio Ghibli style).

- Precision Detail Editing: This feature focuses on leaving certain regions of the image completely unchanged while adding, removing, or modifying specific elements. Examples include adding a sign and generating an associated reflection, removing small objects or hair, changing the color of a specific font, or modifying a person's clothing and background.

- Step-by-step editing (chained approach): Qwen-Image-Edit allows users to progressively correct errors in images, such as calligraphy. This means that bounding boxes can be used to mark areas to be corrected and modifications can be made iteratively until the desired result is achieved.

What makes it better than others?

- It not only generates or edits images, but also understands them, making it a comprehensive base model for intelligent visual creation and manipulation, where language, layout and images converge.

- Open source ecosystem. The model is natively supported in ComfyUI and is also available on the HuggingFace and ModelScope platforms, making it widely accessible to developers and users. Optimizations such as low GPU memory requirements, FP8 quantization and acceleration methods further increase its accessibility and efficiency.

Links

Blog: https://qwenlm.github.io/blog/qwen-image-edit/GitHub: https://github.com/QwenLM/Qwen-ImageSystem prompt: https://huggingface.co/spaces/Qwen/Qwen-Image-Edit/blob/main/app.pyHugging Face: https://huggingface.co/Qwen/Qwen-Image-EditHF Demo: https://huggingface.co/spaces/Qwen/Qwen-Image-EditQwen Chat: https://chat.qwen.ai/Qwen-Image-Edit ComfyUI Native Support: https://blog.comfy.org/p/qwen-image-edit-comfyui-supportQwen-Image-Edit ComfyUI Native Workflow Example: https://docs.comfy.org/tutorials/image/qwen/qwen-image-editLenovo UltraReal: https://civitai.com/models/1662740/lenovo-ultrareal?modelVersionId=2106185Realism: https://huggingface.co/flymy-ai/qwen-image-realism-lora

Kokeile Premiumia

Nauti 14 päivää ilmaiseksi

Tilaa Premium

Jaksot(15)

OpenAI gpt-oss: OpenAI's latest development in open source AI models

We’d like to introduce OpenAI’s latest development in open source AI models: the gpt-oss series. These two open-weight language models, gpt-oss-120b and gpt-oss-20b, have been tested by OpenAI to deli...

3 Syys 202551min

ByteDance Seed-OSS-36B, a large language model specifically for long context understanding and reasoning

Seed-OSS is a set of open-source large-scale language models developed by ByteDance Seed Team, designed to provide powerful capabilities in long-context understanding, reasoning, and agentic tasks. It...

3 Syys 202539min

Microsoft VibeVoice is excellent for creating podcasts, even by cloning our own voice

VibeVoice is a novel framework designed to generate expressive, emotional, and lifelike long-form, multi-actor audio, such as podcasts, from text. The model aims to solve the significant challenges of...

3 Syys 202540min

Deep Cogito - Cogito v2: Free model. Using a unique, iterative self-learning method (IDA)

According to developer Deep Cogito, Cogito v2 is one of the world’s most powerful open-source AI models, available in sizes ranging from 70B to 671B parameters. Thanks to its unique, iterative self-le...

3 Syys 202547min

Mastering Prompt Tricks with Large Language Models

In this episode, we dive deep into the art of crafting effective prompts for large language models. Join our hosts as they explore essential techniques to optimize outputs, enhance creativity, and imp...

26 Syys 202410min

AI in Enterprise

The rapid development of AI has outpaced the ability of many organisations to adapt1. This discrepancy presents both challenges and opportunities. While there is growing pressure to utilize AI for its...

13 Syys 20244min

Kaikki yhdessä sovelluksessa

Kuuntele kaikki suosikkipodcastisi ja -äänikirjasi yhdessä paikassa.

Sinulle valikoitua sisältöä

Podme-sovelluksessa kokoat suosikkisi helposti omaan kirjastoosi. Saat meiltä myös kuuntelusuosituksia!

Jatka kuuntelua koska tahansa

Voit jatkaa siitä mihin jäit, myös offline-tilassa.

Premium

9,99 €/kk

Kaikki premium-podcastit
Ei mainoksia
Ei sitoutumista, peruuta koska tahansa

Aloita 14 päivän kokeilu

Premium

13,99 €/kk

Kaikki premium-podcastit
Ei mainoksia
Ei sitoutumista, peruuta koska tahansa
Yksi lisäkäyttäjä

Kokeile 14 päivää maksutta

Tarinat ja äänet, joita rakastat kuunnella

Kuuntele kaikki suosikkipodcastisi ja -äänikirjasi

Lue lisää