Analysing the impact of images and text for predicting human creativity through encoders

No hay miniatura disponible
Fecha
2025
Título de la revista
ISSN de la revista
Título del volumen
Editor
Science and Technology Publications, Lda
google-scholar
Resumen
This study explores the application of multimodal machine learning techniques to evaluate the originality and complexity of drawings. Traditional approaches in creativity assessment have primarily focused on visual analysis, often neglecting the potential insights derived from accompanying textual descriptions. The research assesses four target features: drawings’ originality, flexibility and elaboration level, and titles’ creativity, all labelled by expert psychologists. The research compares different image encoding and text embeddings to examine the effectiveness and impact of individual and combined modalities. The results indicate that incorporating textual information enhances the predictive accuracy for all features, suggesting that text provides valuable contextual insights that images alone may overlook. This work demonstrates the importance of a multimodal approach in creativity assessment, paving the way for more comprehensive and nuanced evaluations of artistic expression.
Palabras clave
Artistic expression
Creativity assessment
EEG
Machine learning
Originality evaluation
Text and image analysis
Descripción
Ponencia presentada en la 11th International Conference on Information and Communication Technologies for Ageing Well and e-Health, celebrada en Porto, entre el 6 y el 8 de abril de 2025
Materias
Cita
Pikatza-Huerga, A., Matanzas de Luis, P., Uribe, M. F.-D.-R., Lasa, J. P., Zulaika, U., & Almeida, A. (2025). Analysing the Impact of Images and Text for Predicting Human Creativity Through Encoders. International Conference on Information and Communication Technologies for Ageing Well and e-Health, ICT4AWE - Proceedings, 15-24. https://doi.org/10.5220/0013203600003938
Colecciones