Title: Video inpainting using hybrid DMN-EfficientNet with LGSR
Authors: Manjunath R. Hudagi; Lingaraj A. Hadimani; Sachin A. Urabinahatti; Ramesh A. Medar; Abhinandan P. Shirahatti
Addresses: Department of Computer Science and Business Systems, Kolhapur Institute of Technology's College of Engineering (Empowered Autonomous), Kolhapur 416234, Maharashtra, India ' Department of Computer Science and Engineering, Kolhapur Institute of Technology's College of Engineering (Empowered Autonomous), Kolhapur 416234, Maharashtra, India ' Department of AI&ML, BMS Institute of Technology & Management, Bengaluru 560064, Karnataka, India ' Department of Computer Science and Engineering, Kolhapur Institute of Technology's College of Engineering (Empowered Autonomous), Kolhapur 416234, Maharashtra, India ' Department of Computer Science and Engineering, Kolhapur Institute of Technology's College of Engineering (Empowered Autonomous), Kolhapur 416234, Maharashtra, India
Abstract: Video inpainting attempts to take off a specific area of a video or object and replace it in a visually consistent way. Identifying any interesting object or event has become a very time-consuming task because there is far more recorded video than the operators can watch. To overcome this, the video inpainting approach, Deep Maxout Network-EfficientNet (DMN-EfficientNet) is proposed in this paper. The input videos are subjected to video frame extraction. Then, residual frame extraction is performed using the Deep Residual Network (DRN). The extracted residual frames are then subjected to initial level restoration using Low-rankness guided Group Sparse Representation (LGSR). The output generated and the extracted residual frames are used as the input to the DMN-EfficientNet for video inpainting. The DMN-EfficientNet is devised using Deep Maxout Network (DMN) and EfficientNet. The performance of DMN-EfficientNet is estimated with metrics, like Second Derivative Measure of Enhancement (SDME), Structural Similarity (SSIM) and Peak Signal to Noise Ratio (PSNR), and the values attained are 78.07, 0.986 and 40.19 dB, respectively.
Keywords: video inpainting; deep residual network; low-rankness guided group sparse representation; deep maxout network; EfficientNet.
DOI: 10.1504/IJWMC.2025.148109
International Journal of Wireless and Mobile Computing, 2025 Vol.29 No.2, pp.170 - 183
Received: 14 Mar 2024
Accepted: 12 Aug 2024
Published online: 25 Aug 2025 *