CRYPTOREPORTCLUB
  • Crypto news
  • AI
  • Technologies
Friday, June 20, 2025
No Result
View All Result
CRYPTOREPORTCLUB
  • Crypto news
  • AI
  • Technologies
No Result
View All Result
CRYPTOREPORTCLUB

AI image models gain creative edge by amplifying low-frequency features

June 20, 2025
157
0

June 20, 2025

The GIST AI image models gain creative edge by amplifying low-frequency features

Related Post

Bilinear sequence regression model shows why AI excels at learning from word sequences

Bilinear sequence regression model shows why AI excels at learning from word sequences

June 20, 2025
All-topographic neural networks more closely mimic the human visual system

All-topographic neural networks more closely mimic the human visual system

June 20, 2025
Gaby Clark

scientific editor

Robert Egan

associate editor

Editors' notes

This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

preprint

trusted source

proofread

AI image models gain creative edge by amplifying low-frequency features
Original vs C3 (Ours). Compared to the original diffusion models, Our C3 consistently generates more creative images with no added computational cost. Credit: arXiv (2025). DOI: 10.48550/arxiv.2503.23538

Recently, text-based image generation models can automatically create high-resolution, high-quality images solely from natural language descriptions. However, when a typical example like the Stable Diffusion model is given the text "creative," its ability to generate truly creative images remains limited.

KAIST researchers have developed a technology that can enhance the creativity of text-based image generation models such as Stable Diffusion without additional training, allowing AI to draw creative chair designs that are far from ordinary.

Professor Jaesik Choi's research team at KAIST Kim Jaechul Graduate School of AI, in collaboration with NAVER AI Lab, developed this technology to enhance the creative generation of AI generative models without the need for additional training. The work is published on the arXiv preprint server the code is available on GitHub.

Professor Choi's research team developed a technology to enhance creative generation by amplifying the internal feature maps of text-based image generation models. They also discovered that shallow blocks within the model play a crucial role in creative generation. They confirmed that amplifying values in the high-frequency region after converting feature maps to the frequency domain can lead to noise or fragmented color patterns.

Accordingly, the research team demonstrated that amplifying the low-frequency region of shallow blocks can effectively enhance creative generation.

News at KAIST
Overview of the methodology researched by the development team. After converting the internal feature map of a pre-trained generative model into the frequency domain through Fast Fourier Transform, the low-frequency region of the feature map is amplified, then re-transformed into the feature space via Inverse Fast Fourier Transform to generate an image. Credit: The Korea Advanced Institute of Science and Technology (KAIST)

Considering originality and usefulness as two key elements defining creativity, the research team proposed an algorithm that automatically selects the optimal amplification value for each block within the generative model.

Through the developed algorithm, appropriate amplification of the internal feature maps of a pre-trained Stable Diffusion model was able to enhance creative generation without additional classification data or training.

The research team quantitatively proved, using various metrics, that their developed algorithm can generate images that are more novel than those from existing models, without significantly compromising utility.

In particular, they confirmed an increase in image diversity by mitigating the mode collapse problem that occurs in the SDXL-Turbo model, which was developed to significantly improve the image generation speed of the Stable Diffusion XL (SDXL) model. Furthermore, user studies showed that human evaluation also confirmed a significant improvement in novelty relative to utility compared to existing methods.

News at KAIST
Application examples of the methodology researched by the development team. Various Stable Diffusion models generate novel images compared to existing generations while maintaining the meaning of the generated object. Credit: The Korea Advanced Institute of Science and Technology (KAIST)

Jiyeon Han and Dahee Kwon, Ph.D. candidates at KAIST and co-first authors of the paper, stated, "This is the first methodology to enhance the creative generation of generative models without new training or fine-tuning. We have shown that the latent creativity within trained AI generative models can be enhanced through feature map manipulation."

They added, "This research makes it easy to generate creative images using only text from existing trained models. It is expected to provide new inspiration in various fields, such as creative product design, and contribute to the practical and useful application of AI models in the creative ecosystem."

This research, co-authored by Jiyeon Han and Dahee Kwon, Ph.D. candidates at KAIST Kim Jaechul Graduate School of AI, was presented on June 16 at the International Conference on Computer Vision and Pattern Recognition (CVPR), an international academic conference.

More information: Jiyeon Han et al, Enhancing Creative Generation on Stable Diffusion-based Models, arXiv (2025). DOI: 10.48550/arxiv.2503.23538

Journal information: arXiv Provided by The Korea Advanced Institute of Science and Technology (KAIST) Citation: AI image models gain creative edge by amplifying low-frequency features (2025, June 20) retrieved 20 June 2025 from https://techxplore.com/news/2025-06-ai-image-gain-creative-edge.html This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Amuse, a songwriting AI collaborator for music composers 0 shares

Feedback to editors

Share212Tweet133ShareShare27ShareSend

Related Posts

Bilinear sequence regression model shows why AI excels at learning from word sequences
AI

Bilinear sequence regression model shows why AI excels at learning from word sequences

June 20, 2025
0

June 20, 2025 The GIST Bilinear sequence regression model shows why AI excels at learning from word sequences Lisa Lock scientific editor Robert Egan associate editor Editors' notes This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the...

Read moreDetails
All-topographic neural networks more closely mimic the human visual system

All-topographic neural networks more closely mimic the human visual system

June 20, 2025
In an era where empathy feels unfamiliar, AI now translates emotions

In an era where empathy feels unfamiliar, AI now translates emotions

June 19, 2025
Jamming with AI: Jazz trio plays live with AI-generated sound

Jamming with AI: Jazz trio plays live with AI-generated sound

June 19, 2025
Hyper-realistic AI technology creates avatars from a single photo

Hyper-realistic AI technology creates avatars from a single photo

June 19, 2025
Researchers are teaching AI to see more like humans

Researchers are teaching AI to see more like humans

June 19, 2025
New test can help driverless cars make ‘moral’ decisions

New test can help driverless cars make ‘moral’ decisions

June 19, 2025

Recent News

Meta tells the Oversight Board it isn’t removing the word ‘transgenderism’ from its hate speech rules

Meta tells the Oversight Board it isn’t removing the word ‘transgenderism’ from its hate speech rules

June 20, 2025
Snap is acquiring Saturn, a calendar app used at thousands of high schools

Snap is acquiring Saturn, a calendar app used at thousands of high schools

June 20, 2025

Myriad Moves: Will Bitcoin Set a New All-Time High? Plus Strategy and PENGU Predictions

June 20, 2025
Bilinear sequence regression model shows why AI excels at learning from word sequences

Bilinear sequence regression model shows why AI excels at learning from word sequences

June 20, 2025

TOP News

  • Shiba Inu Price Prediction Today

    617 shares
    Share 247 Tweet 154
  • North Korean Hackers Pose as South Korean Government Officials to Steal Crypto

    596 shares
    Share 238 Tweet 149
  • The best Android phones for 2023

    567 shares
    Share 227 Tweet 142
  • Multilingual and open source: OpenGPT-X research project releases large language model

    573 shares
    Share 229 Tweet 143
  • The Biggest Data Breaches of 2023

    580 shares
    Share 232 Tweet 145
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms of Use
Advertising: digestmediaholding@gmail.com

Disclaimer: Information found on cryptoreportclub.com is those of writers quoted. It does not represent the opinions of cryptoreportclub.com on whether to sell, buy or hold any investments. You are advised to conduct your own research before making any investment decisions. Use provided information at your own risk.
cryptoreportclub.com covers fintech, blockchain and Bitcoin bringing you the latest crypto news and analyses on the future of money.

© 2023-2025 Cryptoreportclub. All Rights Reserved

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Crypto news
  • AI
  • Technologies

Disclaimer: Information found on cryptoreportclub.com is those of writers quoted. It does not represent the opinions of cryptoreportclub.com on whether to sell, buy or hold any investments. You are advised to conduct your own research before making any investment decisions. Use provided information at your own risk.
cryptoreportclub.com covers fintech, blockchain and Bitcoin bringing you the latest crypto news and analyses on the future of money.

© 2023-2025 Cryptoreportclub. All Rights Reserved