Please use this identifier to cite or link to this item:
https://www.um.edu.mt/library/oar/handle/123456789/121431
Title: | MAP-elites with transverse assessment for multimodal problems in creative domains |
Authors: | Zammit, Marvin Liapis, Antonios Yannakakis, Georgios N. |
Keywords: | Robotics Evolutionary robotics Genetic programming (Computer science) Artificial intelligence Computational intelligence Algorithms Generative programming (Computer science) |
Issue Date: | 2024 |
Publisher: | Springer |
Citation: | Zammit, M., Liapis, A., & Yannakakis, G. N. (2024). MAP-elites with transverse assessment for multimodal problems in creative domains. International Conference on Computational Intelligence in Music, Sound, Art and Design (EvoMusArt). Aberystwyth, Wales, UK |
Abstract: | The recent advances in language-based generative models have paved the way for the orchestration of multiple generators of different artefact types (text, image, audio, etc.) into one system. Presently, many open-source pre-trained models combine text with other modalities, thus enabling shared vector embeddings to be compared across different generators. Within this context we propose a novel approach to handle multimodal creative tasks using Quality Diversity evolution. Our contribution is a variation of the MAP-Elites algorithm, MAP-Elites with Transverse Assessment (MEliTA), which is tailored for multimodal creative tasks and leverages deep learned models that assess coherence across modalities. MEliTA decouples the artefacts’ modalities and promotes cross-pollination between elites. As a test bed for this algorithm, we generate text descriptions and cover images for a hypothetical video game and assign each artefact a unique modality-specific behavioural characteristic. Results indicate that MEliTA can improve text-to-image mappings within the solution space, compared to a baseline MAP-Elites algorithm that strictly treats each image-text pair as one solution. Our approach represents a significant step forward in multimodal bottom-up orchestration and lays the groundwork for more complex systems coordinating multimodal creative agents in the future. |
URI: | https://www.um.edu.mt/library/oar/handle/123456789/121431 |
Appears in Collections: | Scholarly Works - InsDG |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
map_elites_with_transverse_assessment_for_multimodal_problems_in_creative_domains.pdf | 9 MB | Adobe PDF | View/Open |
Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.