AI image generators get a new safety test for hidden toxic text in memes

Generative AI models can be prompted with just a few words to insert offensive or discriminatory text messages into images. Aditya Kumar from the SPRINT-ML Lab at the CISPA Helmholtz Center for Information Security is investigating how such outputs can be reliably prevented. To address this, he developed ToxicBench, a test dataset that evaluates how well image-generating AI systems handle offensive inputs. He also created a fine-tuning strategy to adapt the models accordingly.

AI image generators get a new safety test for hidden toxic text in memes

By cryptoadmin

You Missed

Cardano analytics platform TapTools to close within two weeks

The Star Fox remake is a test for the franchise’s future

Bitcoin’s slide to $67,000 is accelerating a shift into digital dollars

Trump signs scaled-back AI cybersecurity order

Categories

AI image generators get a new safety test for hidden toxic text in memes

By cryptoadmin

Related Post

AI brings object-level vision prosthetics closer to reality

Microsoft unveils AI models in push for independence from OpenAI

Anthropic expands access to powerful Mythos AI model

You Missed

Cardano analytics platform TapTools to close within two weeks

The Star Fox remake is a test for the franchise’s future

Bitcoin’s slide to $67,000 is accelerating a shift into digital dollars

Trump signs scaled-back AI cybersecurity order