RP 1

This document presents AI-based solutions for real-time video quality improvement, focusing on video super resolution and compression artifact removal. The proposed methods enhance video playback quality while reducing bandwidth requirements and can be integrated into existing video systems without modifications. Key contributions include the development of perceptual loss functions, efficient neural network designs, and customizable networks for specific video types.

Uploaded by

ghargeseema963

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views2 pages

RP 1

Uploaded by

ghargeseema963

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Fast and Effective AI Approaches for Video Quality Improvement

Marco Bertini, Leonardo Galteri, Lorenzo Seidenari, Tiberio Uricchio, Alberto Del Bimbo∗
[name.surname]@unifi.com
[name]@small-pixels.com
Università di Firenze - MICC
Small Pixels
Firenze, Italy

ABSTRACT
In this work we present solutions based on AI techniques to the
problem of real-time video quality improvement, addressing both
video super resolution and compression artefact removal. These
solutions can be used to revamp video archive materials allowing
their reuse in modern video production and to improve the end
user experience playing streaming videos in higher quality while Figure 1: System overview: left) GAN-based training; right)
requiring less bandwidth for their transmission. The proposed ap- use of the network to improve video quality of a generic
proaches can be used on a variety of devices as a post-processing video (top), use of a network specialized on a specific video
step, without requiring any change in existing video encoding and (bottom)
transmission pipelines. Experiments on standard video datasets
have shown that the proposed approaches improve video quality
metrics considering either fixed bandwidth budgets or fixed quality artefacts like blocking, mosquito noise, posterization, etc. that ham-
goals. per user experience. In this work, we present a set of techniques
based on AI that can be used to revamp video archive materials [5, 6]
CCS CONCEPTS or increase the visual quality of streaming videos [1, 7]. The devel-
• Computing methodologies → Learning from critiques; Image oped neural networks, trained using the Generative Adversarial
compression; • Computer systems organization → Neural net- Networks (GANs) framework [3] (Fig. 1 left), can be optimized to
works. run in real-time [2, 4, 7] or faster than real-time even on mid-level
GPUs, allowing their deployment for video restoration, and can be
KEYWORDS further optimized to run in real-time on mobile devices, exploiting
Video quality enhancement,GANs,video players,real-time video CoreML and Neural Engine hardware on iOS devices [4], and ex-
enhancement ploiting WebGL and mobile GPU acceleration on Android and web
browsers. The main scientific contributions of our work are:
ACM Reference Format:
Marco Bertini, Leonardo Galteri, Lorenzo Seidenari, Tiberio Uricchio, Al- (1) development of losses that combine perceptual and signal
berto Del Bimbo. 2022. Fast and Effective AI Approaches for Video Quality based metrics that help to reconstruct perceptually pleasant
Improvement. In Mile-High Video Conference (MHV ’22), March 1–3, 2022, frames;
Denver, CO, USA. ACM, New York, NY, USA, 2 pages. https://doi.org/10. (2) development of neural network designs that allow to reduce
1145/3510450.3517270 their computational costs;
(3) development of GANs training regimes that generate realis-
1 INTRODUCTION AND PROPOSED tic details.
METHODS Furthermore, we present a set of products, based on these contribu-
Lossy video compression algorithms such as H.264, H.265, AV1, tions, that can be deployed on a variety of end user devices. These
etc. are the foundation of video streaming but, in order to optimize products can be embedded in video players, to process the frames
available bandwidth and transmission costs, they introduce visual immediately before showing them to the user (Fig. 1 right-top).
They can upscale video frames, thus effectively reducing the band-
∗ These authors contributed equally to the paper. width required to stream videos, and at the same time eliminate
compression artefacts and add image details that were lost due to
Permission to make digital or hard copies of all or part of this work for personal or
classroom use is granted without fee provided that copies are not made or distributed lossy compression. Our networks can be customized to specific
for profit or commercial advantage and that copies bear this notice and the full citation video types (e.g. soccer, cartoons, documentaries) or on a per-title
on the first page. Copyrights for components of this work owned by others than ACM basis (Fig. 1 right-bottom), allowing to obtain a required video qual-
must be honored. Abstracting with credit is permitted. To copy otherwise, or republish,
to post on servers or to redistribute to lists, requires prior specific permission and/or a ity with a lower bitrate, even if this latter approach requires to send
fee. Request permissions from permissions@acm.org. the weights of the network for each title; this is possible thanks
MHV ’22, March 1–3, 2022, Denver, CO, USA to the compactness of the designed networks that require an ex-
© 2022 Association for Computing Machinery.
ACM ISBN 978-1-4503-9222-8/22/03. . . $15.00 tremely limited space. The proposed method can be adapted and
https://doi.org/10.1145/3510450.3517270 effectively used also for video conferencing applications.

77
MHV ’22, March 1–3, 2022, Denver, CO, USA Bertini et al.

REFERENCES the 27th ACM International Conference on Multimedia. ACM, ACM, 1065–1067.
[1] Leonardo Galteri, Marco Bertini, Lorenzo Seidenari, Tiberio Uricchio, and Alberto [5] Filippo Mameli, Marco Bertini, Leonardo Galteri, and Alberto Del Bimbo. 2020.
Del Bimbo. 2020. Increasing video perceptual quality with gans and semantic Image and video restoration and compression artefact removal using a NoGAN
coding. In Proceedings of the 28th ACM International Conference on Multimedia. approach. In Proceedings of the 28th ACM International Conference on Multimedia.
ACM, ACM, 862–870. ACM, ACM, 4539–4541.
[2] Leonardo Galteri, Lorenzo Seidenari, Marco Bertini, and Alberto Del Bimbo. 2019. [6] Filippo Mameli, Marco Bertini, Leonardo Galteri, and Alberto Del Bimbo. 2021.
Towards real-time image enhancement GANs. In International Conference on A NoGAN approach for image and video restoration and compression artifact
Computer Analysis of Images and Patterns. Springer, Springer, 183–195. removal. In 2020 25th International Conference on Pattern Recognition (ICPR). IEEE,
[3] Leonardo Galteri, Lorenzo Seidenari, Marco Bertini, and Alberto Del Bimbo. 2019. IEEE, 9326–9332.
Deep universal generative adversarial compression artifact removal. IEEE Trans- [7] Federico Vaccaro, Marco Bertini, Tiberio Uricchio, and Alberto Del Bimbo. 2021.
actions on Multimedia 21, 8 (2019), 2131–2145. Fast Video Visual Quality and Resolution Improvement using SR-UNet. In Pro-
[4] Leonardo Galteri, Lorenzo Seidenari, Marco Bertini, Tiberio Uricchio, and Alberto ceedings of the 29th ACM International Conference on Multimedia. ACM, ACM,
Del Bimbo. 2019. Fast video quality enhancement using GANs. In Proceedings of 1221–1229.

AI Video Enhancement Project
No ratings yet
AI Video Enhancement Project
29 pages
Ivp New Ieee
No ratings yet
Ivp New Ieee
4 pages
A Good Image Generator Is What You Need For High Resolution Video Synthesis
No ratings yet
A Good Image Generator Is What You Need For High Resolution Video Synthesis
23 pages
MotionVideoGAN: Advanced Video Generator
No ratings yet
MotionVideoGAN: Advanced Video Generator
13 pages
VideoGigaGAN: Adobe's Leap in Video Super-Resolution Technology
No ratings yet
VideoGigaGAN: Adobe's Leap in Video Super-Resolution Technology
8 pages
3D Vq-Gan
No ratings yet
3D Vq-Gan
5 pages
Macroblock 7
No ratings yet
Macroblock 7
7 pages
Production - Derieux - Cedric - Advances in Automatic Image Restoration and Upscaling
No ratings yet
Production - Derieux - Cedric - Advances in Automatic Image Restoration and Upscaling
4 pages
Video GPT
No ratings yet
Video GPT
14 pages
Video Synthesis with Diffusion Models
No ratings yet
Video Synthesis with Diffusion Models
11 pages
Exploring The Various Machine Learning Models For Image Generation - A Comprehensive Survey Unlocking The Future of Digital Creativity
No ratings yet
Exploring The Various Machine Learning Models For Image Generation - A Comprehensive Survey Unlocking The Future of Digital Creativity
15 pages
Deep Learning-Based Techniques For Video Enhancement, Compression and Restoration
No ratings yet
Deep Learning-Based Techniques For Video Enhancement, Compression and Restoration
13 pages
MoCoGAN：分解用于视频生成的运动和内容e
No ratings yet
MoCoGAN：分解用于视频生成的运动和内容e
13 pages
Video Crafter 2
No ratings yet
Video Crafter 2
11 pages
Video Compression by Neural Networks
No ratings yet
Video Compression by Neural Networks
33 pages
Artificial Intelligence in Video Generation - Technologies, Applications, and Future Directions
No ratings yet
Artificial Intelligence in Video Generation - Technologies, Applications, and Future Directions
3 pages
Untitled
No ratings yet
Untitled
561 pages
Ijimai 9 1 16
No ratings yet
Ijimai 9 1 16
36 pages
Adversarial Video Generation On Complex Datasets
No ratings yet
Adversarial Video Generation On Complex Datasets
21 pages
TGAN sHIT
No ratings yet
TGAN sHIT
10 pages
Hu Make It Move Controllable Image-to-Video Generation With Text Descriptions CVPR 2022 Paper
No ratings yet
Hu Make It Move Controllable Image-to-Video Generation With Text Descriptions CVPR 2022 Paper
10 pages
Intelligent Video Processing for Streaming
No ratings yet
Intelligent Video Processing for Streaming
14 pages
Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies
No ratings yet
Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies
27 pages
Fast Vid2Vid Spatial Temporal Compression For Video To Video Synthesis
No ratings yet
Fast Vid2Vid Spatial Temporal Compression For Video To Video Synthesis
23 pages
Research Paper (2) Done
No ratings yet
Research Paper (2) Done
17 pages
Photorealistic Video Generation via Diffusion
No ratings yet
Photorealistic Video Generation via Diffusion
13 pages
Entropy 25 01469
No ratings yet
Entropy 25 01469
22 pages
Advanced Real ASD Anant
No ratings yet
Advanced Real ASD Anant
17 pages
Video Synthesis with Diffusion Models
No ratings yet
Video Synthesis with Diffusion Models
26 pages
KECReport
No ratings yet
KECReport
23 pages
Deep Network Interpolation For Continuous Imagery Effect Transition
No ratings yet
Deep Network Interpolation For Continuous Imagery Effect Transition
17 pages
Video-to-Video Synthesis: Website
No ratings yet
Video-to-Video Synthesis: Website
14 pages
2 DFVSDJHBKJR
No ratings yet
2 DFVSDJHBKJR
6 pages
Major Proejct PPT Feb, 07 - 2025
No ratings yet
Major Proejct PPT Feb, 07 - 2025
18 pages
Skorokhodov StyleGAN-V A Continuous Video Generator With The Price Image Quality CVPR 2022 Paper
No ratings yet
Skorokhodov StyleGAN-V A Continuous Video Generator With The Price Image Quality CVPR 2022 Paper
11 pages
Image Processing and Its Applications: Person in Charge: Wojciech PIECZYNSKI Objectives
No ratings yet
Image Processing and Its Applications: Person in Charge: Wojciech PIECZYNSKI Objectives
16 pages
P F M E V G M: Yramidal LOW Atching For Fficient Ideo Enerative Odeling
No ratings yet
P F M E V G M: Yramidal LOW Atching For Fficient Ideo Enerative Odeling
23 pages
AI Image & Video Quality Trends
No ratings yet
AI Image & Video Quality Trends
16 pages
A Survey On Generative AI and LLM For Video
No ratings yet
A Survey On Generative AI and LLM For Video
16 pages
(IJCST-V12I3P20) :bassant Mohamed Elamir, Amany Fawzy Elgamal, Marwa Hussein Abdelfattah
No ratings yet
(IJCST-V12I3P20) :bassant Mohamed Elamir, Amany Fawzy Elgamal, Marwa Hussein Abdelfattah
17 pages
AI Video Evaluation for Researchers
No ratings yet
AI Video Evaluation for Researchers
59 pages
Image Compression: by Artificial Neural Networks
No ratings yet
Image Compression: by Artificial Neural Networks
14 pages
GANs for Video Translation Experts
No ratings yet
GANs for Video Translation Experts
4 pages
Tivgan: Text To Image To Video Generation With Step-By-Step Evolutionary Generator
No ratings yet
Tivgan: Text To Image To Video Generation With Step-By-Step Evolutionary Generator
10 pages
A Survey On Perceptually Optimized Video Coding
No ratings yet
A Survey On Perceptually Optimized Video Coding
36 pages
A Survey On Video Coding Optimizations Using Machine Learning
No ratings yet
A Survey On Video Coding Optimizations Using Machine Learning
5 pages
Autoregressive Adversarial Post-Training For Real-Time Interactive Video Generation
No ratings yet
Autoregressive Adversarial Post-Training For Real-Time Interactive Video Generation
20 pages
VIhanceD BTech Thesis
No ratings yet
VIhanceD BTech Thesis
39 pages
BONES: Near-Optimal Neural-Enhanced Video Streaming
No ratings yet
BONES: Near-Optimal Neural-Enhanced Video Streaming
28 pages
Thesis 11 51
No ratings yet
Thesis 11 51
41 pages
This Is A Cool Paper
No ratings yet
This Is A Cool Paper
32 pages
Preprints202403 1272 v1
No ratings yet
Preprints202403 1272 v1
37 pages
VD综述
No ratings yet
VD综述
21 pages
Semantically Video Coding: Instill Static-Dynamic Clues Into Structured Bitstream For AI Tasks
No ratings yet
Semantically Video Coding: Instill Static-Dynamic Clues Into Structured Bitstream For AI Tasks
14 pages
Modern Neural Network Technologies Text-to-Image: Scientific Visualization, 2023, Volume 15, Number 2, Pages 66 - 79
No ratings yet
Modern Neural Network Technologies Text-to-Image: Scientific Visualization, 2023, Volume 15, Number 2, Pages 66 - 79
13 pages
Paper 1
No ratings yet
Paper 1
16 pages
Sankisa 2018
No ratings yet
Sankisa 2018
5 pages
Lumiere: A Space-Time Diffusion Model For Video Generation
No ratings yet
Lumiere: A Space-Time Diffusion Model For Video Generation
20 pages
wg1n90021-REQ-JPEG AI Use Cases and Requirements
No ratings yet
wg1n90021-REQ-JPEG AI Use Cases and Requirements
7 pages
Haiwell Cloud SCADA Catalog1
No ratings yet
Haiwell Cloud SCADA Catalog1
2 pages
Reevan Resume
No ratings yet
Reevan Resume
2 pages
Scilab Act1 1
No ratings yet
Scilab Act1 1
11 pages
NetSuite Administrator Fundamentals (English) Training Data Sheet
No ratings yet
NetSuite Administrator Fundamentals (English) Training Data Sheet
3 pages
Notebook - Music Recommendation System Reference
No ratings yet
Notebook - Music Recommendation System Reference
22 pages
Word 2007 Shortcuts and Function Keys: Common Tasks
No ratings yet
Word 2007 Shortcuts and Function Keys: Common Tasks
8 pages
University of Helsinki Python Programming Summer School Daniel Phillipe Gonçalves Menezes Sergipe Aracaju Brazilezes
No ratings yet
University of Helsinki Python Programming Summer School Daniel Phillipe Gonçalves Menezes Sergipe Aracaju Brazilezes
28 pages
lastCleanException 20201224194445
No ratings yet
lastCleanException 20201224194445
41 pages
Top 30 Agile Testing Interview Questions
No ratings yet
Top 30 Agile Testing Interview Questions
19 pages
Softwrer Demo
No ratings yet
Softwrer Demo
4 pages
Bentley Substation Instruction Manual V 3.1
89% (9)
Bentley Substation Instruction Manual V 3.1
211 pages
Qas Actupg v1
No ratings yet
Qas Actupg v1
5 pages
Collection of e - Resources and Reporting: Topic
No ratings yet
Collection of e - Resources and Reporting: Topic
29 pages
Module 4 of Biology PDF
No ratings yet
Module 4 of Biology PDF
27 pages
Communication Control
No ratings yet
Communication Control
234 pages
New IR Structure Guidelines - 2311 Intake
No ratings yet
New IR Structure Guidelines - 2311 Intake
14 pages
Power System Reliability Methods
No ratings yet
Power System Reliability Methods
67 pages
Ai in Entrepreneurship
No ratings yet
Ai in Entrepreneurship
11 pages
Symantec Device Export Logs
No ratings yet
Symantec Device Export Logs
144 pages
Aimlock Trick DNS
No ratings yet
Aimlock Trick DNS
3 pages
How To Install Aspen Hysys v9
No ratings yet
How To Install Aspen Hysys v9
21 pages
Steps To Write JAVA Program: Java Programming Environment (An Introduction)
No ratings yet
Steps To Write JAVA Program: Java Programming Environment (An Introduction)
55 pages
Tech Talent in Transition Seven Technology Trends Reshaping Telcos
No ratings yet
Tech Talent in Transition Seven Technology Trends Reshaping Telcos
11 pages
CV - Mohamad Sadek
No ratings yet
CV - Mohamad Sadek
2 pages
TLE Lesson 2
No ratings yet
TLE Lesson 2
18 pages
Online Student Attendance Management System - Deeqsi
No ratings yet
Online Student Attendance Management System - Deeqsi
52 pages
WordPress Troubleshooting Guide
No ratings yet
WordPress Troubleshooting Guide
44 pages
ABAP Programming Model For Fiori EN
100% (2)
ABAP Programming Model For Fiori EN
612 pages
Javed Hallikeri
No ratings yet
Javed Hallikeri
23 pages
Ap1140aut Getstart PDF
No ratings yet
Ap1140aut Getstart PDF
36 pages

RP 1

Uploaded by

RP 1

Uploaded by

Fast and Effective AI Approaches for Video Quality Improvement

You might also like