Forthcoming Articles
International Journal of Arts and Technology

Forthcoming articles have been peer-reviewed and accepted for publication but are pending final changes, are not yet published and may not appear here in their final order of publication until they are assigned to issues. Therefore, the content conforms to our standards but the presentation (e.g. typesetting and proof-reading) is not necessarily up to the Inderscience standard. Additionally, titles, authors, abstracts and keywords may change before publication. Articles will not be published until the final proofs are validated by their authors.
Forthcoming articles must be purchased for the purposes of research, teaching and private study only. These articles can be cited using the expression "in press". For example: Smith, J. (in press). Article Title. Journal Title.
Articles marked with this shopping trolley icon are available for purchase - click on the icon to send an email request to purchase.
Online First articles are also listed here. Online First articles are fully citeable, complete with a DOI. They can be cited, read, and downloaded. Online First articles are published as Open Access (OA) articles to make the latest research available as early as possible.
Register for our alerting service, which notifies you by email when new issues are published online.
International Journal of Arts and Technology (58 papers in press) Regular Issues
Abstract: In response to the problems of insufficient personalised feedback and poor real-time performance in traditional music teaching, this paper proposes an interactive teaching mode based on wireless sensor networks and multimodal data fusion. By deploying multiple sensors to collect real-time data on students performance movements, audio, and physiological states, the Gaussian Bayesian algorithm is fused and denoised before uploading to the cloud platform. Then, the weighted matrix factorisation algorithm is used to generate personalised error correction content and push it out. Experiments have shown that this mode reduces sensor data transmission by 78.8%, reduces energy consumption to 0.33J, and achieves a push hit rate of 95.2%, forming an efficient interactive loop and providing precise solutions for the digitisation of music teaching. Keywords: Wireless Sensor Network; Multimodal; Data Fusion; Music Interactive Teaching; Gaussian-Bayesian; Personalized Push. DOI: 10.1504/IJART.2026.10077421
Abstract: To tackle the declining precision of sentiment analysis for online educational website reviews via the LDA topic model, I propose the innovative TWBEWC-TFWW-LDA algorithm, integrating emotion word co-occurrence-based theme word bags (TWBEWC) and topic feature word weighting (TFWW). It constructs emotional topic word bags, extracts sentiment-laden topic words via semantic similarity, weights them by significance and distribution, and performs LDA clustering. Experiments show that with 15 emotion topic feature words, its text clustering accuracy, recall and F1 reach 0.812, 0.802 and 0.810 respectively; it also achieves 88%, 96% and 90% accuracy in classifying aversion, surprise and neutrality. This enhanced accuracy refines online education sentiment analysis for college students, optimizing course design and teaching methods. Keywords: Online education; LDA topic model; sentiment classification; sentiment word co-occurrence; feature word weighting. DOI: 10.1504/IJART.2026.10077461
Abstract: This paper proposed generative adversarial network (GAN) with a shared latent space (sLS-GAN) to improve controllability and cultural adaptability in art style transfer. By integrating a variational autoencoder (VAE) with adversarial learning, it constructs a shared latent space that enables high-quality bidirectional translation between greyscale and colour image domains. Latent-space alignment improves realism and semantic coherence, while cycle-consistency regularises forward-backward mappings. A dual up-sampling/dual down-sampling design enhances structural stability across domains. In addition, a residual saliency network strengthens salient-region modelling and improves efficiency, reducing reliance on explicit content-preservation constraints. Experiments on the WikiArt Paintings and SemArt datasets show that sLS-GAN achieves an FID of 106.45 on WikiArt and outperforms representative baselines in Inception Score and PSNR, indicating improved semantic consistency, diversity, and perceptual quality. In greyscale colourisation, sLS-GAN reduces parameters by 89.5% and FLOPs by 87.8% versus conventional models, delivering substantial computational savings. Keywords: Generative Adversarial Network; Grayscale–Color Image Translation; Bidirectional Variational Autoencoder; Cycle Consistency; Residual Saliency Network. DOI: 10.1504/IJART.2026.10077967
Abstract: With the continuous integration of information technology and education and teaching, as well as the gradual increase in demand for music experiences in cultural tourism, intelligent teaching has become an important way to enhance learning experiences and outcomes. To further enhance the personalization level of music appreciation teaching, this study proposes a teaching recommendation system that better aligns with user needs by combining the Kano model with the Collaborative Filtering (CF) algorithm. In the specific research methodology, the Kano questionnaire is first adopted to identify and quantify the must-be, one-dimensional, and attractive requirements in teaching Then, an improved CF algorithm is proposed, which integrates the weights of the above-mentioned requirements into user similarity calculation. The study demonstrates that this method can effectively improve recommendation accuracy and user satisfaction and is also easy to use for beginners. Keywords: Cultural tourism; Kano model; Collaborative filtering; Music appreciation teaching; Personalized recommendation; User satisfaction. DOI: 10.1504/IJART.2026.10078284
Abstract: Vocal music is an important expressive form with auditory appeal, emotional transmission, and cultural interpretation; it plays an increasingly important role in tourism performance, the activation and dissemination of intangible cultural heritage, local festival exhibitions, and the construction of immersive cultural spaces. Focusing on the practical problems of college vocal music teaching (VMT) in terms of competency structure, training mode, and cognitive support mechanism, this work starts from the talent demand of cultural tourism; it sorts out the current situation of college VMT and the necessity of its diversified reform. From the perspective of Supply Chain Management, the operational logic of the college VMT system is analyzed. The demands of performing positions, festival events, and scenic spot narration in the cultural tourism market are integrated into the closed-loop talent cultivation system. Keywords: cultural tourism; diversified teaching; vocal music training; SCM; CLT; rational cognition of the brain; supply chain management. DOI: 10.1504/IJART.2026.10078285
Abstract: This study aims to design an algorithm applied to an intelligent generation system, which can integrate the visual style of Miao silver jewelry patterns into advertising images, to enhance the visual appeal and cultural expression of advertisements. The research object of this study points to the general graphic advertising design; it also faces the cultural tourism visual scenarios, such as the dissemination of ethnic culture, the display of tourist destination images, and the promotion of cultural and creative industries. The digital extraction and intelligent generation of Miao silver jewelry patterns can help enhance the cultural recognition and dissemination appeal of promotional images. The constructed method can provide technical support for the transformation of Miao cultural elements in cultural tourism posters, tourism promotional materials, and cultural creativity promotion vision. Keywords: Miao silver jewelry pattern; Intelligent generation system; Deep learning model; Advertising image; Cultural tourism. DOI: 10.1504/IJART.2026.10078286
Abstract: Digital music education has shown great potential in the wave of digital transformation of the cultural tourism industry, providing an opportunity for the deep integration of traditional music teaching and cultural tourism experience. The current online platforms lack attention to individual differences, making it difficult to meet the fragmented and personalized learning needs of tourists or study groups in the cultural and tourism scene. This study aims to optimize the teaching process using Artificial Intelligence (AI) neural network technology. The study proposes the concept framework of Dynamic Maintenance of Optimal Learning Zone (DM-OLZ) for cultural and tourism research and learning scenarios. An intelligent teaching assistant system with perceptual ability is constructed by integrating Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) networks. Experimental data confirms that in the practical scenario of cultural tourism integration, the system has a significant effect on enhancing learners' interest and skill mastery. Keywords: Artificial intelligence; Neural network; CNN; LSTM; Digital music teaching; Cultural tourism. DOI: 10.1504/IJART.2026.10078287
Abstract: This paper aims to discuss the strategic research on the professional growth of broadcasting and hosting art based on the Seasonal Autoregressive Integrated Moving Average Model - Backpropagation neural network (SARIMA-BP) model. Firstly, the problems of employment pressure, lack of practical experience, and insufficient educational resources in the current broadcasting and hosting art profession are studied. The professional growth strategy of broadcasting and hosting art supported by the online security network platform is proposed. Secondly, through the analysis of SARIMA-BP model, a prediction method based on SARIMA-BP model is established to predict the growth trend of the profession. The values are small, indicating that the SARIMA-BP model has high prediction accuracy for the future development trend of broadcasting and hosting art. Keywords: SARIMA-BP model; broadcasting and hosting art profession; online secure web platform; development strategy; employment rate. DOI: 10.1504/IJART.2026.10078288
Abstract: To enhance the accuracy and effectiveness of international communication of Chinese Red Classics, this study first proposes an intelligent application framework comprising three layers: cultural semantic encoding, multimodal deep fusion, and intelligent application. A specific fusion model is then developed based on this framework. The model uses an optimized Visual Geometry Group-19 (VGG-19) network as a visual feature extractor and incorporates an attention mechanism to improve the extraction of cultural symbols. In addition, the model includes multimodal adaptive interaction and multi-task decoding functions. Overall, the results confirm that the proposed method improves machines understanding of the content and structure of Red Classics. This study provides a practical tool for digital interpretation and intelligent international dissemination of Red Classics. Keywords: International communication of Red Classics; multimodal information fusion; VGG network; cross-cultural understanding; deep learning. DOI: 10.1504/IJART.2026.10078289
Abstract: The research results show that the online music teaching system assisted by AI can keep the response speed within 4 seconds between 200 and 800 users and achieve the expected goal through performance test and user feedback analysis. Students' feedback shows that more than 90% are satisfied with the improved learning effect and online teaching method. Therefore, the improved system has significantly improved students' learning effect and satisfaction, and it is suggested to continue to optimize the system experience and performance in the future to further improve the teaching quality and user satisfaction. This study provides empirical support for the development of online music education technology against the background of culture-tourism integration, and offers specific suggestions for optimizing the teaching model of music culture communication in cultural tourism scenarios. Keywords: : Artificial intelligence; Online music teaching; Teaching system; Teaching system improvement; cultural tourism. DOI: 10.1504/IJART.2026.10078291
Abstract: This work investigates the transformative impact of integrating Virtual Reality (VR) technology into piano music appreciation instruction within cultural tourism modules. A trinity model comprising the flipped classroom, situational teaching, and VR experience is constructed for this work. Flipped classrooms facilitate pre-class knowledge acquisition, while situational teaching establishes practical tasks related to cultural tourism. Additionally, VR technology is employed to reconstruct historical scenic sites for an immersive learning experience. The experiment involves 150 undergraduates majoring in Musicology, who are divided into five groups of 30 participants each. This performance is observed in knowledge mastery, learning interest, and depth of understanding, significantly outperforming traditional modes (p < 0.001). This work demonstrates that this model significantly enhances students' musical aesthetics. Furthermore, it enhances the cultural interpretation potential of students as future promotion ambassadors for Culture and Tourism. These results provide new insights into the digital integration of art, culture tourism and education. Keywords: Piano Music Appreciation Teaching; Teaching Innovation; Virtual Reality Technology; Situational Teaching Method; Flipped Classroom; Cultural Tourism. DOI: 10.1504/IJART.2026.10078681
Abstract: This study explores the reconstruction and singing simulation of male roles in national opera using artificial neural networks (ANN). Against the practical background of the continuous integrated development of cultural tourism, national opera is no longer confined to the traditional theatrical communication space, and its presentation modes are gradually extending to tourism performances, digital exhibition halls, immersive cultural-tourism scenarios, and local cultural experience spaces. As an important carrier for the narrative progression, vocal styling, and stage temperament expression of national opera, male characters not only carry distinct artistic conventions but also constitute a crucial gateway for tourists to recognize the cultural images of local operas. Therefore, the proposed PiGAN-SwT model It can improve the accuracy of acoustic feature reconstruction for male characters in national opera, and provide technical support for digital opera exhibitions, the reproduction of cultural-tourism performances, and the dissemination of local culture in cultural tourism scenarios. Keywords: National opera; Male roles; Cultural tourism; Virtual reconstruction; Timbre recognition; Generative adversarial network; Deep learning. DOI: 10.1504/IJART.2026.10078683
Abstract: This study aims to improve the emotion recognition accuracy in red classic literature and promote the effect of visual dissemination. To this end, an emotion recognition model based on Bidirectional Encoder Representations from Transformers (BERT) - Bidirectional Encoder Representations from Transformers (BiLSTM)-SentiWordNet (SWN). This model combines BERT's deep semantic representation, BiLSTM's bidirectional time series processing capabilities, and SentiWordNet's emotion enhancement module. Through comparative experiments, it can be found that the constructed BERT-BiLSTM-SWN models accuracy attains 95.19%, about 5% higher than other model algorithms. Therefore, the proposed algorithm shows excellent performance in the emotion recognition of red classic literature, and can more accurately capture the emotional features of the text, providing a solid technical foundation for visual dissemination, and further promoting the development of literature digitization and sentiment analysis technology. Keywords: Long short-term memory network; Red classic literature; Visual dissemination; Deep learning; BiLSTM. DOI: 10.1504/IJART.2026.10078827
Abstract: This study systematically investigates the mechanisms through which patriotic-themed films shape audience emotion and cognition within the context of culture tourism integration, providing highly relevant scientific evidence for the field of patriotic education. On this basis, the study discusses the possibility of how internet of bodies (IoB) technology and social interaction can amplify these emotional effects on the network platform. This study reveals the neural mechanism of the influence of main melody films on the audiences patriotic feelings through neuroscience methods, and establishes the connection with IoB and generative artificial intelligence. The findings provide scientific support for film production and the development of culture tourism scenarios for patriotic education within the framework of culture tourism integration. Keywords: neuroscience; patriotic-themed films; film audiences; patriotic sentiment; culture tourism integration. DOI: 10.1504/IJART.2026.10078828 Abstract: The purpose is to improve the identification of ethnic minorities with Chinese traditional culture and promote the development of Chinese traditional culture inheritance. Educational strategies are optimized from deep learning (DL) to strengthen the traditional cultural identity education of ethnic minorities. Based on this, firstly, according to the needs and interests of ethnic minority college students, the DL algorithm is used in traditional cultural identity education to establish the relationship between various disciplines. Finally, identity education is conducted in daily life to change the traditional didactic education means of passive learning to stimulate college students to learn and acquire traditional cultural knowledge actively. The results show that integrating DL into the traditional cultural identity education of ethnic minority college students can significantly stimulate students' interest in learning and improve their autonomous learning ability. Keywords: new era; ethnic minority college students; traditional cultural identity education; intelligent multimedia technology; deep learning. DOI: 10.1504/IJART.2026.10078829
Abstract: In the context of artificial intelligence-driven cultural tourism and sports services, this study explores the intelligent recommendation methods for the integration of urban and rural cultural tourism and sports services. The purpose is to optimize the efficient matching of user demands and cultural tourism and sports resources between urban and rural areas. A recommendation model based on deep learning (DL) is proposed, integrating the Graph Attention Network, Bidirectional Long Short-Term Memory for dynamic interest modeling, and the Dual Duel Deep Q Network. This study confirms the remarkable advantages of DL technology in intelligent recommendation in the context of cultural tourism and sports services; it provides a scientific solution for optimizing the allocation of cultural tourism and sports services resources and improves service accuracy, thus offering important inspirations for the intelligent evolution of smart public services. Keywords: Urban and rural cultural tourism amd sports service; Intelligent recommendation; Deep learning; Graph Attention Network; Dual duel deep Q network. DOI: 10.1504/IJART.2026.10078830
Abstract: This paper discusses the systematic reconstruction of cultural tourism music education in colleges and universities under the background of informatization. In view of the inherent subjectivity and aesthetic uncertainty of music art, this paper applies fuzzy theory to optimise teaching methods and constructs a teaching framework that accommodates multi-dimensional interpretation to meet the complex artistic communication needs in cultural tourism scenarios. Based on the actual circulation of teaching resources against the background of cultural tourism integration, this paper analyses the information asymmetry between textbooks and interdisciplinary practical equipment inventory, and proposes a supply chain management structure diagram based on information platforms to ensure the accurate supply of diversified teaching resources. Combined with the characteristics of the information age, this paper uses the questionnaire survey method to explore the penetration mechanism of multimedia Artificial Intelligence (AI) technology into the current situation of cultural tourism music education in colleges and universities. Keywords: fuzzy theory; artificial intelligence; college music education; supply chain management. DOI: 10.1504/IJART.2026.10078833
Abstract: Against the backdrop of ongoing cultural and tourism integration and the accelerated restructuring of creative content production, talent cultivation and innovation-entrepreneurship education in university animation programs face new structural demands. Traditional course evaluation methods can no longer accurately reflect the alignment among course content, practical components, and industry needs. In response, China has introduced the strategic initiative of "mass entrepreneurship and innovation" and implemented a series of policies to support student entrepreneurship. This study aims to analyze the structural relationship between entrepreneurship course content and teaching objectives, focusing on the creative content production demands driven by the cultural and tourism industry. Utilizing the analytic hierarchy process (AHP) and fuzzy comprehensive evaluation (FCE), this study proposes an innovation and entrepreneurship education (IEE) evaluation system. By constructing a multi-level evaluation model that integrates both quantitative and qualitative data, this system provides a comprehensive assessment of entrepreneurship education for college animation students. Keywords: Animation majors; Fuzzy comprehensive evaluation; Hierarchical analysis; Innovation and entrepreneurship; Evaluation system; Entrepreneurship courses. DOI: 10.1504/IJART.2026.10078872
Abstract: With the deepening of global cultural exchange, translation quality has become increasingly important in cultural communication. Traditional machine translation methods often struggle to meet expectations, especially when dealing with multimodal content such as video subtitles, audio narration, and text embedded in images related to Heitu culture. These conventional approaches frequently fail to handle long-range dependencies and complex semantic structures. This study leverages multimedia artificial intelligence to combine the sequence modeling capabilities of Long Short-Term Memory (LSTM) networks with the global information capture of Transformer models. This integration improves the handling of long-range dependencies and semantic consistency across heterogeneous data sources. When applied to complex multimedia materials related to Heitu culture, the proposed method demonstrates significantly higher translation quality. Through the introduction of a reinforcement learning mechanism, the proposed model can dynamically adjust translation strategies when processing culturally specific terminology and emotional nuances, thereby enhancing cultural adaptability in translations. Keywords: Machine Translation; LSTM; Sequence-to-Sequence (Seq2Seq) Model; Semantic Coherence; Long-Distance Dependencies; multimedia artificial intelligence. DOI: 10.1504/IJART.2026.10078974
Abstract: This work focuses on optimizing intelligent composition technology in the context of the cultural tourism Internet of Things (IoT); it takes the optimization of the music generation algorithm as the core goal and improves the quality and innovation of music generation. Utilizing Python's Mido library, the main melody of Musical Instrument Digital Interface (MIDI) music files is extracted. The Skip-Gram model of Word2vec is then used to convert the note sequences into feature vectors. The performance of various models is compared, including Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Bi-directional Long Short-Term Memory (Bi-LSTM), Bidirectional Gated Recurrent Unit (Bi-GRU), Bi-LSTM+Attention, and Bi-GRU+Attention. The metrics used for comparison include note accuracy, Bilingual Evaluation Understudy (BLEU) score, creativity, melodic beauty, rhythmic smoothness, and auditory experience, aiming to demonstrate the effectiveness of the proposed model. Keywords: Internet of Things; Deep Neural Network; Intelligent Composition; Bi-LSTM; Note Accuracy; cultural tourism. DOI: 10.1504/IJART.2026.10078975 Application of Digital Image Media Technology in Film Animation ![]() by Yan Liu, Feng Tang Abstract: The development of media technology in the digital age has enabled many traditional artists to achieve new developments in their creations. The emergence of this expression method has greatly enriched the forms of artistic expression. This paper studies the application of digital image media technology in film and television animation. In the experimental part, it is applied to teaching, and experiments are conducted on several processes of animation production. The experimental results show that its application effect in the post-production link is most obvious. The scores of the experimental class are mainly concentrated in the 80100 points stage. There are 20 students in Class A who have reached 90100 points, 18 students in Class B who have reached 90100 points, and only five students in Class C who have reached 90100 points, and only six students in Class D who have reached 90100 points. It can be seen that the application of digital image media technology has effectively improved the production effect of animation. At the end of the paper, a brief summary of specific technologies and applications is given. Keywords: Film and Television Animation; Digital Image Technology; Post Production; Image Fusion Algorithm. DOI: 10.1504/IJART.2026.10074102 Multimedia Art Data Optimisation by Integrating UO-CRUSH and Q-learning Algorithm VR Technology ![]() by Hongying Song, Xiaohong Wang Abstract: A resource management framework based on virtual reality (VR) technology is proposed to address the limitations and poor presentation effects of traditional multimedia art data flat creation. It integrates the UO-CRUSH algorithm driven by resource interest and the Q-learning algorithm driven by user interest to optimise the storage and scheduling of multimedia art data. The experimental results show that the proposed model has an average computation time of 426 seconds, completes an average of 752 tasks in the maximum cycle, and has a uniform distribution of resource placement groups (137153). In the instance verification, the classification accuracy is not less than 90%, the interactive response speed is 172 ms, and the frame rate of the picture reaches 78fps, which combines good immersion and economy. This model effectively improves the optimisation effect of multimedia art data. Keywords: Multimedia art data; Virtual reality; Scalable replica hashing algorithm; Reinforcement learning algorithm; Resource management. DOI: 10.1504/IJART.2026.10074165 Developing a Community Participation Strategy for the Preservation of Historic Buildings in Shanghai ![]() by Lu Chen Abstract: This study introduces a novel approach that is specifically designed to align with China's unique national circumstances, allowing for the active involvement of the community. We propose a "community participation framework" that incorporates distinctive local features, formulated through an extensive review and synthesis of both domestic and international literature, as well as drawing insights from effective community governance practices in China. The framework seeks to harmonise the interactions between community members, historic structures, and municipal authorities. Within this structure, these stakeholders can create a cohesive and self-sustaining system, mutually benefiting from the preservation efforts of historical architecture. To guarantee ongoing advancements in this initiative, oversight and management practices were implemented, utilising performance evaluation metrics to measure the protective measures' effectiveness. Our findings offer valuable insights for furthering the engagement of social forces in the conservation of historical buildings in China. Keywords: Community Engagement; Conservation of Historic Buildings; Layered Analysis; Framework Development. DOI: 10.1504/IJART.2026.10074166 Quantitative Evaluation of the Effectiveness of Preservation of Modern Urban Residential Buildings in Shanghai (1910-1949) ![]() by Lu Chen Abstract: This study examines modern residential buildings in Shanghai 19101949), analysing the dynamic relationships among architecture, inhabitants, the environment, and society. Addressing the urgent need to conserve Shanghai historical structures, it establishes key evaluation metrics for preservation effectiveness. Employing a quantitative approach validated by a BP neural network model, the analysis demonstrates that current conservation strategies fail to fully harness their potential to drive economic growth, enhance resident quality of life, foster environmental sustainability, or enrich urban cultural heritage. Recognising the intrinsic link between historical preservation and collective well-being encompassing human, environmental, and societal dimensions the research develops optimisation strategies to maximise conservation outcomes. The goal is to significantly elevate the impact of preservation efforts on these buildings, establishing a benchmark for scholarly research, policy formulation, and heritage management in Shanghai and analogous contexts. Keywords: Quantitative Assessment of Conservation Effectiveness; AHP; FCE;BP. DOI: 10.1504/IJART.2026.10074167 Image based Digital Processing Technology and System for Music Signals ![]() by Huan Li Abstract: Traditional music signal digital processing only focuses on the time-frequency characteristics of audio signals, ignoring the image features related to music signals, which leads to limitations in the overall understanding and representation of music signals. By comprehensively utilizing the characteristics of audio and image signals, a more comprehensive and accurate music signal processing method was proposed to comprehensively and accurately understand music emotions. The music emotion data from MediaEval Emotion in Music, MagnaTagATune Dataset, EmoReact Dataset, and DEAM Dataset were selected, preprocessed, and mapped to represent music emotions using Valence and Arousal. The Long Short-Term Memory (LSTM) - Residual Network (ResNet) model was constructed, with the LSTM module used to extract audio features and the ResNet module used to extract image features. In the fusion layer, the features extracted by the LSTM module and ResNet module were fused 1:1. A music recommendation system was constructed based on user historical preferences and recognition results of music emotions. The experimental results showed that the average accuracy of LSTM-ResNet model in music emotion classification on DEAM Dataset was as high as 98.5%. The combination of LSTM and ResNet can enhance the performance of music emotion classification and provide new methods for music recommendation tasks. Keywords: Music Signals; Emotional Classification; Music Recommendations; Music Images; Long Short-Term Memory; Residual Network. DOI: 10.1504/IJART.2026.10074476 Classification and Recognition of Visual Communication Elements using Multimodal Fusion Affective Computing ![]() by Yihan Yang, Xuehang Wu Abstract: This paper aims to study a method for classifying and recognising visual communication elements based on multimodal fusion affective computing technology, in order to improve the accuracy of information transmission and the ability to express emotions. First, this paper utilises Pythons Scrapy library to automatically collect, filter, and preprocess image and text data using open-source computer vision libraries and regular expressions. Then, this paper uses the ResNet model to extract image features and the Transformer-based bidirectional encoder representation (BERT) model to extract text features, and fuses them through an attention mechanism. Finally, this paper uses the support vector machine (SVM) algorithm to classify the features, thus completing the classification and recognition of visual communication elements. Experimental results show that the proposed model performs well in emotion classification and recognition tasks, with high accuracy and stability. Keywords: Multimodal Fusion Affective Computing; Visual Communication Element; Residual Network; Bidirectional Encoder Representations from Transformers; Attention Mechanism. DOI: 10.1504/IJART.2026.10075026 Exploration on High Dynamic Dance Video Keyframe Extraction Based on Clustering Algorithm ![]() by Zhuoying Qi Abstract: The amount of dance videos is also increasing, and how to watch dance videos rapidly and efficiently is already an issue that needs to be solved nowadays The research on an efficient way to watch the key information of dance video is of great help to dance learning and dance posture analysis Based on this, this paper studied the keyframe extraction of high dynamic dance video, and proposed a keyframe extraction method based on K-means clustering algorithm (KMA) Firstly, this paper proposed a method of shot segmentation by using histogram features for edge extraction, and then compared the similarity between video frames Finally, the KMA was used to match the video frames with the nearest cluster, and the dance video keyframes were determined by evaluating the sum of the similarity between the cluster centre and all sample frames of the cluster After putting forward the keyframe extraction method, this paper analysed its extraction effect, and drew the following conclusions through experimental research: compared with the extraction results of the traditional keyframe extraction method, the precision of the extraction results of the improved KMA was 6.5% higher, and the recall rate was 8.8% higher. The keyframe extraction method based on the improved KMA has good results. Keywords: Keyframe Extraction; High Dynamic Dance Video; Clustering Algorithm; K-means Clustering Algorithm. DOI: 10.1504/IJART.2026.10075220 Aesthetic Relevance of Generative Artificial Intelligence ![]() by Umberto Roncoroni Abstract: This article examines the impact of generative artificial intelligence (GAI) on contemporary art, creative practice, and education. Evaluating the benefits and drawbacks of GAI is difficult due to the accelerated development of technology and weak academic relationships among aesthetics and computer science. To elucidate GAI, we propose setting aside metaphysical dilemmas and concentrating on the aesthetic problems of AI such as romantic influences, technocentric approaches to creativity, black boxes, and misunderstanding about the properties of digital media. Through an approach that combines philosophical analysis, computer science, and art-based research, we compare the Dadaist Poem by Tzara with GAI to verify its coherence with contemporary art development. We found that GAI contradicts contemporary art innovations and the aesthetic potential of digital media. The results demonstrate why GAI will jeopardize the development of creative and significant art and proffer a review of theories, methods, and interdisciplinary references. Keywords: Aesthetics; Avant-gardes; Computational Creativity; Contemporary Art; Dadaism; Digital Media; Generative Artificial Intelligence; Interactivity; Postmodernism; Public Art. DOI: 10.1504/IJART.2026.10075314 Design of Three-Dimensional System of Computer Aided Dance Teaching Technology Management ![]() by Hongmei Li Abstract: Traditional dance teaching has problems such as single means, low efficiency, and difficulty in intelligent scheduling of teaching content. To this end, this paper designs a computer-aided three-dimensional dance teaching technology management system, constructs a three-layer system architecture with logic representation layer, business logic layer and database as the core, and integrates embedded communication protocols to achieve stable transmission and response of teaching data between multiple modules; in terms of action recognition algorithm, the system extracts multi-resolution features based on high-resolution network (HRNet), and performs convolution, interpolation and cascade operations between different resolutions through the sequence multi-scale feature fusion model to obtain highly semantic and accurately positioned joint point heat maps, and then constructs a geometric relationship estimation network with a three-layer structure, predicts the position relationship according to the trunk and limb joints, and matches the connection, completing the accurate modelling of dance movements. Experimental results show that the system has the highest accuracy of 95.3% in dance movement recognition tasks and 85.3% in posture estimation. This paper verifies the effectiveness and practical value of the technology management system that combines multi-module system design with multi-scale sequence fusion algorithm in dance teaching. Keywords: Dance Teaching; Technology Management; Embedded System; Feature Fusion Algorithm; Pose Estimation. DOI: 10.1504/IJART.2026.10075649 Modelling of Outdoor Building Facade Automated Image Decoration Design System Based on Point Cloud Semantic Segmentation ![]() by Jing Liu Abstract: This paper aims to solve the problems of insufficient accuracy of point cloud semantic segmentation and poor system scalability in the automated decoration design of building facades, and constructs a three-module system including point cloud semantic segmentation, semantic-driven image decoration design, and result mapping. This paper addresses heterogeneous complex facade components and boundary fuzziness by optimising point cloud semantic segmentation with multi-scale feature fusion and boundary refinement achieving classification accuracy of 0.820.93 mIoU 0.87 and improved structural IoU 0.84 with 4.7-pixel keypoint error reduction. A semantic-constrained decoration module integrates style rules and geometric alignment yielding style matching 8.0 and structural alignment 8.46. The system automates 138 tasks hourly with 92.54% completion and 7.76% manual intervention demonstrating efficient structural-semantic-decoration integration. Keywords: Point Cloud Semantic Segmentation; Building Facade Modeling; Automated Image Decoration Design; Three-dimensional Structure Recognition; Multi-style Adaptability. DOI: 10.1504/IJART.2026.10075668 Two Dimensional Digital Art Animation Synthesis System Based on BP Neural Network ![]() by Zhouzhou Cheng, Xiao Xia Abstract: At present, 2D animation still faces problems such as difficulty in getting rid of the production methods mainly based on manual creation and low creation efficiency. This paper aims to use BP neural network to construct and study a two-dimensional digital art animation synthesis system, in order to improve the production efficiency of two-dimensional animation. This paper first introduces the structure, advantages, and algorithm principles of BP neural network, and then explains the technology of two-dimensional digital art animation production. After constructing a two-dimensional digital art animation synthesis system model, this paper performs performance and image processing of the synthesis system, checks and confirms the advantages and disadvantages of the model, and then provides a rough overview of the artificial system. The results showed that the maximum error of the BP neural network in the experiment was less than 9%, and the average error was reduced to 5%. Keywords: 2D Digital Art Animation ; BP Neural Network ; Poisson Equation; Bezier Curve. DOI: 10.1504/IJART.2026.10075881 Cross-Platform Music Recommendation Method and Innovation Path Based on the Internet of Things and Blockchain ![]() by Jin Ma, Yi Li Abstract: Traditional music recommendation systems have not yet implemented cross platform applications, and the improvement of recommendation effectiveness is limited by the limitations of the recommendation system itself. This paper aims to study how to analyse and investigate cross platform music recommendation methods based on IoT and blockchain technology. This paper tests four recommendation algorithm models for different recommendation list lengths. Experimental data shows that when the recommendation list length is 60, the accuracy, recall, and F1 score of the hybrid recommendation algorithm model are 62.14%, 50.64%, and 0.559, respectively, which are superior to the other three recommendation algorithms. In addition, when the number of users using the system is 60, the security of the system is 97.80%. A series of data proves that the cross platform music recommendation system based on the Internet of Things and blockchain designed in this paper is feasible and worthy of further promotion and application. Keywords: Music Recommendation Method; Internet of Things; Blockchain Technology; Music Platform. DOI: 10.1504/IJART.2027.10076014 Integrating Actor-Network Theory and Speculative Design: Exploring Innovations in HCI Education from a More-than-Human Perspective ![]() by Jiawei Li, Zhiyong Fu, Jiayue Wang, Lin Zhu, Jiaxuan Xu Abstract: This study explores integrating Actor-Network Theory (ANT) and speculative design into Human-Computer Interaction (HCI) education to cultivate students' More-than-Human Design capabilities. Traditional human-centered approaches are insufficient for complex socio-technical challenges; students need new frameworks to understand human and non-human actor interactions. ANT analyzes HCI systems, emphasizing human-technology co-construction while recognizing human agency. Speculative design provides innovative methods for exploring HCI possibilities through provocative artifacts and narratives. A workshop guided students in applying ANT and speculative design to analyze More-than-Human systems, comparing human participation and AI-assisted generation results. This demonstrated AI's potential to enrich HCI education. The research offers a comprehensive framework combining theory and practice, fostering students' critical thinking and innovative awareness for envisioning future human-computer interaction designs. Keywords: Human-Computer Interaction education; More-than-Human Design; Actor-Network Theory; Speculative Design; Design pedagogy. DOI: 10.1504/IJART.2027.10076214 Personalised News Recommendation System Based On Computer Artificial Intelligence Technology ![]() by Qiang Wang Abstract: In this paper, intelligent recommendation, an artificial intelligence algorithm, was incorporated into the design of digital media. Collaborative filtering algorithm was used to design a personalised news recommendation system. According to the current user's historical reading records, praise and sharing behaviour, collaborative filtering algorithm was used to recommend interesting news reports for users. Specific types of news were pushed to users based on algorithmic recommendations and were available to users through personalised news display pages. According to the validation of experimental data results, after algorithm processing, the existing calculated similarity was 0.85638. The top few target objects with the highest similarity were selected and pushed to users. Finally, data and opinions were collected through a survey questionnaire. The research results showed that integrating artificial intelligence technology into digital media can improve the design, innovation, and user experience of digital media, and provide users with more intelligent and personalised services. Keywords: Collaborative Filtering; Digital Media; Personalize News; Intellectualized System; Artificial Intelligence. DOI: 10.1504/IJART.2027.10076522 Analysis of Brushstrokes during the Creation of a Painting of a Peach ![]() by Otoniel Igno-Rosario, Claudia Hernández-Aguilar, Luis Manuel Hernández-Simón, Flavio Arturo Domínguez-Pacheco, Jose Alberto Medina-Pérez Abstract: This study presents a systematic video-based analysis of brushstrokes during the creation of a peach painting. Using a dual-camera setup that simultaneously captured top and side views, we recorded 23 preliminary strokes and the complete sequence of 281 strokes that formed the final artwork. Each stroke was processed frame by frame to extract spatial-temporal features, including three-dimensional coordinates, orientation angles, and stroke velocity. Our research was motivated by the limited availability of annotated data on brushstrokes and the need for interpretability of human stroke dynamics. The study is presented as a preliminary case painting that generates evidence-based data to support further research in the field of computational art and robotic painting. Code is available at https://github.com/oton-lab/brushstrokes. Keywords: brushstroke analysis; video processing; artistic painting; computational creativity. DOI: 10.1504/IJART.2027.10076644 Knowledge Mapping and Mechanistic Insights in Emotional Digital Design: An Empirical Study Combining Clustering and Thematic Analysis ![]() by Lan Ma, Yiyuan Ding, Chenxi Dong, Tang Liu, Zhenyu Li, Wenlei Mao, Tianyi Liu, Fernando Jorge Matias Sanches Oliveir, Xue Zhu, Lianfa Xu, Guangyi Tang, Feng Sha Abstract: This study maps the knowledge structure of emotional digital design using bibliometric co-word clustering and three-round thematic analysis of 3,817 Web of Science records. Six major research clusters were identified: design and technology, health and care, education and learning, mental health and emotion regulation, workplace studies, and conversational agents, revealing their evolution over time. Thematic analysis highlights research hotspots and cross-disciplinary trends that guide the fields development. By integrating quantitative mapping with qualitative interpretation, the study provides both a structural overview and detailed thematic insights. This dual-method approach enables researchers and practitioners to better understand the fields current state and future directions. It also addresses the limitations of traditional keyword co-occurrence analysis its lack of semantic depth and mechanistic insight by combining bibliometric clustering with multi-round thematic analysis, delivering a structural map and explanatory understanding of emotional digital design. Keywords: Emotional Digital Design; Affective Design Environments; Emotion-Centered Interaction; Experiential Design; User Engagement. DOI: 10.1504/IJART.2027.10076828 Application of Deep Learning Algorithms in the Transfer of Ethnic and Folk Art Design Styles in Cultural and Creative Products ![]() by Wanli Gu Abstract: This paper aims to address the issue of similar forms but different spirits in the process of transferring ethnic folk art styles to cultural and creative products. This paper proposes a deep transfer framework with cultural semantic guidance and multi-scale style decoupling. Firstly, this paper constructs annotation data covering typical artistic styles of various ethnic groups, and designs a semantic embedding module to encode intangible attributes such as pattern symbolism, colour taboos, and composition paradigms as style guidance signals. Secondly, this paper designs a dual path feature decoupling network in the content encoder to maintain the structural semantics of the product carrier. The experimental results show that the method proposed in this paper achieves FID accuracy of 27.3 Keywords: Ethnic Folk Art; Deep Learning; Style Transfer; Cultural and Creative Product Design; Cultural Semantic Embedding. DOI: 10.1504/IJART.2027.10076929 Intelligent Data Processing and Visual Design: Big Data Computing Promotes Innovation in Graphic Communication ![]() by Xinchun Wang, Dichen Li, Haolin Xiong Abstract: Traditional visual design is often limited to static presentation, lacking dynamic interaction and intelligent analysis, making it difficult to fully showcase the complexity and changing trends of data. This paper combines the language image pre training (CLIP) model and generative adversarial network (GAN) to extract semantic information from text descriptions, generate high-quality graphic designs, and achieve semantic based visualisation of creative data. This paper uses the CLIP model to analyse user provided text descriptions, extract semantic features of the data, and convert them into high-dimensional feature vectors. These features are then input into a GAN generator, which generates high-quality graphs that conform to the semantic description through generative adversarial mechanisms. The results showed that the SSIM and PSNR values of the generated images were 0.95 and 33.5 dB, respectively, and the frame rate and response time of dynamic interaction were 45 FPS and 32 ms, respectively. Keywords: Intelligent Data Processing; Visual Design; Big Data Computing; Graphical Communication; Contrastive Language-Image Pretraining. DOI: 10.1504/IJART.2027.10077157 Analysis on The Construction Strategy of Intelligent Music Teaching Classroom Based on Emotional Education ![]() by Xinyu Du Abstract: This paper proposes and validates a human-computer interaction teaching scheme for emotional education, which solves the problem of teacher centred and monotonous interactive activities in middle school music classrooms. This paper combines Bayesian skin colour modelling, elliptical contour fitting, GMM tracking, and a hybrid method of multi class SVM to design and implement a gesture recognition system suitable for teaching scenarios. This paper conducted experiments at school A, including questionnaire interviews (600 students, 6 teachers) and system validation (approximately 2,400 gesture samples), to evaluate the current status and method performance. The main results showed that the proposed gesture recognition method achieved excellent accuracy on teaching terminals with a testing accuracy of 92.5%, while maintaining low latency (about 25 milliseconds) on resource limited devices, achieving a good balance between accuracy and real-time performance. Keywords: Interactive Teaching in Music Classroom; Gesture Recognition Algorithm; Human-computer Interaction; Skin Color Feature Extraction. DOI: 10.1504/IJART.2028.10077158 WayangFusionNet: Multi-Scale Cross-Modal Transformer for the Sustainability of Wayang Kulit Character Heritage Preservation ![]() by Andy Pramono, I-Cheng Chang, Betty Dewi Puspasari Abstract: The conservation of intangible cultural heritage is essential within the context of fast globalisation. Wayang Kulit, a traditional Indonesian art form, is an important entertainment medium and also an embodiment of important moral and cultural values. However, its existence is increasingly threatened by waning youth interest and limited digital documentation. Artificial intelligence facilitates deeper analysis, improved recognition, and sustainable digital cultural preservation. This paper describes WayangFusionNet, a new hybrid model that combines multi-scale feature extraction from EfficientNetV2B3 with a cross-modal transformer for character recognition. It outperforms existing models, achieving a test accuracy of 99.27%. Qualitative analyses, such as confusion matrices, class activation maps, and t-SNE visualisations, validate the model's ability to establish and convey distinct features. The experiment results confirm the ability of digital technology to preserve endangered art forms and revive the art's values for future use. Keywords: Multi-Scale Cross-Modal Network; Feature Pyramid Networks; Hybrid Network; Indonesian Wayang Kulit Classification. DOI: 10.1504/IJART.2027.10077160 A Self-supervised Sub-style Separation Framework for Artistic Painting Classification ![]() by Rui Huang, J.I.A. CUI, Che Jiang, Chengran Hu, Meng Qi, Zhelin Li Abstract: Art style classification remains challenging due to subjective style delineation and the coexistence of multi-stylistic features within a single painting, which limit the effectiveness of conventional feature-learning strategies. This study proposes a Self-Supervised Learning-based sub-style Modelling method (SSLM) that models the sub-style distribution in enhanced views of a painting using introduced variance and covariance losses to extract a more stable and discriminative style representation. Experiments on style databases demonstrate that SSLM outperforms state-of-the-art methods in handling style ambiguity. Furthermore, we introduce a style uncertainty index to quantify the dominance of principal styles over sub-styles. Based on this metric, we construct a new dataset, P2, using a style-cleaning algorithm to enhance style purity. The accuracy of supervised models on P2 is improved through experiments, demonstrating the efficiency of cleaning style uncertainty. The proposed study offers new insights into art style classification through the sub-style modelling mechanism and style uncertainty quantification. Keywords: Style classification; Sub-style modelling; Style uncertainty; Image representation learning; Art style recognition. DOI: 10.1504/IJART.2028.10077162 Image Style Transfer and Visual Expression Based on Neural Networks ![]() by Lianlian He, Wei Sun, Dongxian Yu Abstract: Given the difficulty in balancing style and content and the limitation of visual expression in current image style transfer, this paper applies an improved DiffStyler model. First, ResNet-50 extracts multi-scale features (shallow conv1, middle res3, deep res5) with spatial alignment via upsampling and channel concatenation. The SE (squeeze-and-excitation) module dynamically adjusts channel weights through sigmoid-constrained intervals. Second, a dual-path Transformer architecture uses VGG19 (visual geometry group)-extracted style features as Key vectors and content features as Query vectors, achieving cross-domain alignment via similarity matrix calculations. Third, the DDPM (denoising diffusion probabilistic models) framework injects with a Keywords: Neural Networks; Image Style Transfer; Visual Expression; Style-Content Balance; Diffusion Model. DOI: 10.1504/IJART.2027.10077167 Practice of Multimodal Music Teaching Mode Based on Artificial Intelligence ![]() by Jie Liu Abstract: This paper takes EG and CG as the research objects to explore the differences between EG and CG. The pre-test is mainly to test the difference in music between EG and CG, and to compare with the students after the experiment. The post-experiment is divided into three phases: post-test, questionnaire, and interview. The post-test uses students' final exam scores as indicators to compare the learning effects of EG and CG, and to test the effect of the multimodal teaching model in music teaching. After the test, the same questionnaire was distributed to EG and CG to determine whether their musical interest improved. On this basis, the teacher randomly selected 10 students from EG to investigate their interest in multimodal teaching methods. The average scores of the pre-test and post-test of EG were 21.85 and 27.03, respectively, indicating that multimodal teaching can promote students' music learning. Keywords: Multimodal Music Teaching Model in Practice; Artificial Intelligence; Mel Frequency Inversion Factor; Multimodal Teaching Model. DOI: 10.1504/IJART.2027.10077173 Emotion Recognition in Artworks: Multimodal Data Analysis Based on Deep Learning Algorithms ![]() by Ying Bai, Liping Ouyang Abstract: Traditional art emotion recognition methods have limitations such as single feature extraction, strong subjectivity, and low recognition accuracy. This paper proposes a deep multimodal feature fusion method based on cross attention mechanism, which achieves fine-grained bidirectional interaction between visual, textual, and metadata patterns. This study uses an accurate cross database matching strategy to construct a high-quality multimodal dataset containing 32178 valid samples, integrating visual images, text descriptions, and structured metadata. The accuracy of this method on the test set is 92.7%, with an F1 score of 91.4%, which is significantly better than visual input only (85.2%) and text input only (79.8%). Compared with early and late fusion strategies, it improved accuracy and F1 score by approximately 4.3 to 5.7 percentage points, demonstrating its potential for application in digital humanities, intelligent curation, and other fields. Keywords: Artworks Research; Emotion Recognition; Multimodal Fusion; Cross-Attention Mechanism; Transformer Architecture. DOI: 10.1504/IJART.2027.10077282 3D Geometric Reconstruction Method of Damaged Cultural Relics Based on Multimodal Data Fusion and Deep Learning ![]() by Feng Li, Yajie Bai Abstract: The current 3D geometric restoration methods used for damaged cultural relics are susceptible to noise and information loss when using single modal data. For this purpose, this paper applies multimodal point cloud feature fusion and 3D generative adversarial processing. Firstly, a multimodal mapping function is used to encode the laser scanned point cloud, image sequence, and computed tomography (CT) slices into a dense 3D feature tensor. Next, the discriminator uses multi-scale convolution kernels to determine the geometric consistency between the generated point cloud and the ground truth point cloud at both local and global levels, and approximates the true distribution of lost artefacts by jointly optimising adversarial loss and geometric reconstruction error. The results show that when the defect rate is 10%40%, the chamfer distance of the proposed method increases from 0.30 mm to 0.47 mm; the average point spacing deviation increases from 0.13 mm to 0.22 mm. Keywords: 3D Geometric Reconstruction; Multimodal Feature Fusion; Generative Adversarial Loss; Geometric Consistency; Digital Heritage Preservation. DOI: 10.1504/IJART.2027.10077430 Design and Implementation of a Chinese Painting Style Copying System based on Image Recognition and Style Transfer Algorithm ![]() by Yifan Xue Abstract: Neural replication of Chinese ink painting demands stroke-level semantics and physics-aware modelling of ink-wash gradients and paper-fibre micro-textures. We integrated a stroke recogniser with a physics-guided, multi-scale style-transfer module, incorporating Darcy and anisotropic-diffusion priors, and evaluated fidelity, robustness, and efficiency.Trained on 2,900 Chinese ink paintings (2,400 internal; 500 external) spanning shan shui and hua niao genres, across resolutions up to 4,0962 and various Xuan papers/inks, our multi-branch CNN stroke recogniser and stroke-conditioned encoder-decoder outperformed baselines (Gatys NST, AdaIN, dual-path diffusion).Metrics showed superior performance: SSIM 0.928 (vs. 0.902), LPIPS 0.1680.186 (vs. 0.1980.218), SCDS 0.816 (vs. 0.772), stroke recognition macro-F1 0.892 and mIoU 0.803. Expert ratings (n=15) favoured our method (8.18.4 vs. 7.37.8). External validation SSIM reached 0.921, with efficient 4,0962 inference (12.4 s).This stroke-conditioned, physics-guided system achieves higher fidelity, cultural authenticity, cross-domain robustness, and high-resolution efficiency, advancing conservation-grade digital replication of cultural heritage. Keywords: Artificial Intelligence; Image Processing; Computer-Assisted; Pattern Recognition; Automated; Algorithms; Reproducibility of Results; Cultural Characteristics. DOI: 10.1504/IJART.2027.10077567 Digital display and virtual experience of cultural heritage based on image processing ![]() by Yuting Deng Abstract: In response to the contradiction between high fidelity reconstruction and lightweight virtual experience in digital display of cultural heritage, this paper proposes an end-to-end digital display framework that integrates multi view image processing and neural rendering optimisation. Firstly, based on the ETH 3D and MuralDH public datasets, this paper improves input consistency through preprocessing methods such as illumination normalisation, multispectral registration, and highlight separation. Secondly, in the 3D reconstruction stage, this paper introduces texture confidence weighting and multispectral feature fusion mechanisms, and then constructs a lightweight neural radiation field (NeRF) model, embedding a dynamic detail level mechanism for texture perception. The experiment shows that this method compresses the model volume to 16.6 MB, and at a high fidelity level of PSNR of 32.7 dB and SSIM of 0.936, the average frame rate on the network side is 42.3 fps, which is significantly better than the baseline scheme. Keywords: image processing; cultural heritage; digital display; neural radiation field; lightweight rendering; virtual experience. DOI: 10.1504/IJART.2028.10078101 Trends and Topics in Arts Education and Artificial Intelligence: a Systematic Review using a Human-In-The-Loop Model ![]() by Yue Yu, Ruolin Sun Abstract: Artificial Intelligence in Arts Education (AIAEd) has rapidly expanded, yet its conceptual evolution remains underexamined. This study systematically reviews 372 Web of Science publications using Latent Dirichlet Allocation and Computational Grounded Theory to map the fields paradigmatic development. Five thematic clusters emerge: fundamental principle design, technological and interdisciplinary expansion, AI-driven educational models, intelligent behavioural evaluation, and personalised creative learning. A six-stage evolutionary analysis reveals a trajectory from early exploratory work to systematised design, discipline-specific applications, metaverse-supported visual pedagogies, and the recent shift toward generative AI and multidimensional learner modelling. The latest stage highlights creativity-oriented AI tools, affect-aware feedback, and socio-health dimensions of evaluation. Findings indicate a delayed but decisive transition from technology-centred experimentation to human-centred, creativity-enhancing AIAEd frameworks. Keywords: Artificial Intelligence in Arts Education (AIAEd); Computational Grounded Theory; Topic Modelling; Generative Artificial Intelligence; Pedagogical Design and Evaluation. DOI: 10.1504/IJART.2028.10078302 Application of Style Transfer and Generative Art Based on Deep Learning in Personalized Artistic Creation ![]() by Haitao Pu, Yuang Pu Abstract: Traditional artistic creation faces challenges in personalisation and diversity, and it is not easy to realise the growing individual needs. This paper proposes a novel deep learning method that integrates conditional generative adversarial networks and multi-scale convolutional neural networks for personalised artistic creation. The technique utilises cGAN to process user-input text descriptions, sketches, or style labels as conditional vectors, generating artworks with specific styles and content. MS-CNN improves diversity and refinement in style transfer, achieving precise fusion through an adaptive style extraction mechanism. Experimental outcomes demonstrate a seamless integration of style and content; image similarity exceeds 0.8, and the artistic style score exceeds 6 on a ten-point scale, enhancing personalised expression. The core innovation is the collaborative framework that simultaneously optimises content generation and style fusion, addressing the imbalance in traditional approaches. This research provides a new technical path for personalised artistic creation and advances the application of deep learning in art. Keywords: Deep Learning; Generative Adversarial Networks; Style Transfer; Personalized Artistic Creation; Multi-scale Convolutional Neural Networks. DOI: 10.1504/IJART.2028.10078325 Wearable Whole Body Dance Action Evaluation System Based on Artificial Intelligence Movement Guidance ![]() by Nuo Li, Yushan Liu, Jun Niu Abstract: With the development of computers, the development speed of artificial intelligence technology and wearable devices is also changing with each passing day, and they are becoming more and more widely used in life In order to evaluate the dance movements of the whole body and improve the accuracy of the dance posture, this article trains dancers through wearable devices guided by artificial intelligence sports, collects the changes before and after different dance positions under the guidance of wearable devices, and forms a continuous frame of human body posture change trend curve based on human body posture information, and determines the extreme of the curve The value position is used to calculate the similarity between each action sequence on the obtained segmented action sequence to determine whether different action sequences belong to the same or similar actions The experimental results found that wearable devices based on artificial intelligence motion guidance can effectively improve the accuracy of dance poses. Compared with dancers without wear and artificial intelligence guidance, the standard of dance poses has increased by more than 50%. This shows that wearable devices guided by artificial intelligence can effectively improve dance accuracy. Keywords: Artificial Intelligence; Wearable Devices; Dance Moves; Evaluation Systems. DOI: 10.1504/IJART.2027.10078502 Construction of Gender Identities through Cosmetic Advertisement: A Case Study ![]() by Uzma Nazar, Hussain Othman, Sadia Deep, Nazia Suleman, Hina Shaheen Abstract: This study examines how cosmetic advertisements construct and reshape female identity within an Islamic societal context, focusing on the linguistic strategies used to promote idealised images of women. Using a qualitative case study approach, the research analyses the Golden Rose cosmetic booklet through textual and sociolinguistic analysis to identify how vocabulary, imagery, and embedded messages contribute to identity formation. The analysis reveals that the advertisements employ persuasive linguistic devices, such as metaphors, personification, and exaggerated descriptors, to promote unrealistic standards of beauty associated with glamour, youthfulness, desirability, and perfection. These linguistic and visual cues collectively construct an artificial female identity that contrasts sharply with womens real-life attributes and living standards. The studys key contribution lies in demonstrating how cosmetic advertising language subtly manipulates perceptions of femininity, encouraging women to internalise commoditised and idealised identities. The findings highlight the need for greater awareness of the socio-cultural impact of advertising discourse and its role in shaping womens self-perception in contemporary Islamic societies Keywords: Gender Identity Construction; Cosmetic Advertising Discourse; Linguistic Strategies; Idealised Femininity; Sociocultural Representation. DOI: 10.1504/IJART.2027.10078673 Investigating the Digital Exhibition Experience at the Huangmei Opera Museum through MOS ![]() by Li Yang, Liliana Soares, Rute Gomes, Nankai Cheng Abstract: This study examines visitor perceptions of the Huangmei Opera Museums digital exhibition through an MOS-based post-visit questionnaire. Rather than treating Mean Opinion Score (MOS) as a complete theory of user experience, the study uses it as a descriptive aggregation method to summarize visitors ratings across five evaluation dimensions adapted from Kaasinen et al.s UX goal-setting framework: brand, theory, empathy, technology, and vision. Based on 38 valid responses collected in Anhui Province, the results show relatively positive evaluations of the brand, theory, empathy, and vision dimensions, while the Technology dimension received comparatively lower ratings. The findings suggest that digital exhibition design in opera museums should move beyond the mere presence of technology and place greater emphasis on interpretive depth, narrative engagement, and meaningful interaction. As an exploratory case study, the paper offers a context-specific contribution to user experience evaluation in digital heritage exhibitions and provides practical implications for museum design. Keywords: Huangmei Opera Museum; User Experience (UX); Mean Opinion Score (MOS); Cultural heritage; Museum evaluation. DOI: 10.1504/IJART.2028.10078677 A Study on the Accessibility Analysis and Optimization Strategies for Public Facilities of Shanghai's Architectural Heritage ![]() by Lu Chen, Xueqing Zhang Abstract: This study investigates the architectural heritage of Shanghai, with a specific focus on four central districts Huangpu, Xuhui, Changning, and Hongkou where such heritage is densely concentrated. Guided by the 15-minute city concept, the research examines and systematically evaluates the distribution of public facilities surrounding these heritage sites. Based on the evaluation findings and considering the distinct developmental orientations of the four districts, targeted strategies are proposed for enhancing the public facility network. These strategies are designed to address the daily needs of local residents and tourists, facilitate efficient urban management, and contribute to the sustainable development of the respective areas. Keywords: Fifteen-Minute City?Cultural Heritage?Public Facilities?Accessibility Analysis. DOI: 10.1504/IJART.2028.10078680 IVTI-Based Artwork: Multisensory Technology as Aesthetic Fulfillment for People with Visual Disabilities in Indonesia ![]() by Nur Fajrie, Imaniar Purbasari, Slamet Khoeron, Ika Yuni Purnama, Hisbulloh Als Mustofa Abstract: This study addresses art accessibility for visually impaired individuals in Indonesia by developing the Interactive Voice Touch Interface (IVTI), a multisensory system combining tactile reliefs and localized audio narratives. Employing a mixed-method approach using Research and Development with ADDIE (Analysis, Design, Development, Implementation, and Evaluation) approach involving 150 participants, including 15 core testers, the research utilized iterative design cycles to refine the prototype. Needs analysis identified tactile access (94%) and descriptive audio (91%) as critical features for art appreciation. Evaluation results indicated a 93% functional reliability rate, alongside significant improvements in user confidence, tactile sensitivity, and emotional engagement. The findings confirm that IVTI bridges perceptual and cognitive gaps between sighted and visually impaired users. The study concludes that culturally localized multisensory technologies are essential for transforming art accessibility, promoting inclusive education, and advancing social equity. Keywords: Multisensory; user-centered design; haptic interaction; visual impairment; Indonesia. DOI: 10.1504/IJART.2028.10078978 Narratives without Borders: Reviewing Transmedia Storytelling as a Tool for Brand Expansion ![]() by Aman Vats, Eshan Parikh, Dhanashree Giri Amatya Abstract: The evolving media industry is increasingly organized around platform-driven and segmented ecosystems, where traditional single-medium storytelling has become inadequate for sustaining long-term brand growth and audience engagement. This paradigm shifts positions transmedia storytelling as a strategic method of brand expansion, particularly within the context of digital convergence and the rapid proliferation of OTT platforms. This review article critically examines transmedia storytelling by integrating insights from academic literature, media industry practices, and case studies. It addresses global developments, including major Western franchises such as the Marvel Cinematic Universe and Star Wars, alongside emerging practices in South Asia, where OTT platforms like Netflix, Amazon Prime Video, and Disney+ Hotstar experiment with serial and transmedia content strategies. Drawing on established theoretical frameworks such as convergence culture, transmedia grammar, and brand equity theory, the article adopts a multidisciplinary approach to analyze interconnected narrative worlds across platforms while maintaining coherence. Keywords: Transmedia storytelling; Brand extension; Convergence culture; Participatory media; Transmedia Grammar; OTT platforms; Global branding. DOI: 10.1504/IJART.2028.10078979 Special Issue on: OA Intelligent Media Arts Convergence of Technology, Creativity, and Performance
Abstract: To solve the issues of structural mismatch, lack of controllability of harmony and texture, and insufficient realism of generated samples due to exposure bias in existing artificial intelligence models for polyphonic music generation, this study studies and designs a novel polyphonic artificial intelligence (AI) music generation algorithm based on transformer and adversarial mechanisms. Findings denote that the controllable choral transformer model achieves a test accuracy of up to 93.52% in the alto part, with a note error rate as low as 6.48%. The proposed model also achieves a training accuracy of 94.86%, a note-chord consistency score of 0.752, and a melody-chord pitch distance of only 0.853. The proposed algorithm, by combining relative position attention, multidimensional conditional control, and adversarial training, effectively improves the structural rationality and harmonic consistency of generated music, providing an efficient and feasible technical solution for AI-assisted music creation and personalised music generation. Keywords: transformer; music generation; polyphony; adversarial mechanism; relative position attention mechanism. DOI: 10.1504/IJART.2026.10078013
Abstract: Music style defines a musical works overall characteristics, while beats carry emotional undertones, and dance consists of rhythmic movements. This study introduces a music style and beat recognition model based on a genetic algorithm, optimising feature extraction and model selection to improve recognition accuracy. Results demonstrate that the model classifies music into three styles and correspondingly reclassifies the beat dataset. Combined with the previous music style recognition results, the most suitable model is selected from multiple beat recognition models through adaptive selection of fitness function to realise the final beat recognition of the song to be tested. Experimental results show that the model achieves good performance on F-measure and other metrics, and can effectively identify beat characteristics across different music styles. Through error analysis, it identifies the failure case laws of blues, rock and other styles, and provides data support for the accurate adaptation of performance actions and music emotions. Keywords: biological model; style recognition; beat detection; deep learning. DOI: 10.1504/IJART.2026.10078102
Abstract: With the advancement of AI in artistic creation, deep learning-based music generation research has become an integration direction of intelligent media and digital art. This paper proposes an automatic melody generation and arrangement auxiliary system for music production, based on the transformer architecture. The self-attention mechanism and style embedding model are constructed, and the system captures the long-term dependence between notes in the process of melody generation, thus realising the dynamic coordination of rhythm and pitch. The research results prove the effectiveness of transformer structure in the task of music sequence generation, and also show the application potential of artificial intelligence in music creation assistance, automatic arrangement and personalised melody generation. The system is applied in the fields of intelligent arrangement, digital music education, film and game music, etc., which provides a feasible path and technical basis for the realisation of man-machine collaborative creation. Keywords: transformer architecture; music melody; automatically generated; arranging auxiliary system. DOI: 10.1504/IJART.2026.10078229 |
Open Access