A contourlet transform based algorithm for real-time video encoding

Stamos Katsigiannis, Georgios Papaioannou, Dimitris Maroulis

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In recent years, real-time video communication over the internet has been widely utilized for applications like video conferencing. Streaming live video over heterogeneous IP networks, including wireless networks, requires video coding algorithms that can support various levels of quality in order to adapt to the network end-to-end bandwidth and transmitter/receiver resources. In this work, a scalable video coding and compression algorithm based on the Contourlet Transform is proposed. The algorithm allows for multiple levels of detail, without re-encoding the video frames, by just dropping the encoded information referring to higher resolution than needed. Compression is achieved by means of lossy and lossless methods, as well as variable bit rate encoding schemes. Furthermore, due to the transformation utilized, it does not suffer from blocking artifacts that occur with many widely adopted compression algorithms. Another highly advantageous characteristic of the algorithm is the suppression of noise induced by low-quality sensors usually encountered in web-cameras, due to the manipulation of the transform coefficients at the compression stage. The proposed algorithm is designed to introduce minimal coding delay, thus achieving real-time performance. Performance is enhanced by utilizing the vast computational capabilities of modern GPUs, providing satisfactory encoding and decoding times at relatively low cost. These characteristics make this method suitable for applications like video-conferencing that demand real-time performance, along with the highest visual quality possible for each user. Through the presented performance and quality evaluation of the algorithm, experimental results show that the proposed algorithm achieves better or comparable visual quality relative to other compression and encoding methods tested, while maintaining a satisfactory compression ratio. Especially at low bitrates, it provides more human-eye friendly images compared to algorithms utilizing block-based coding, like the MPEG family, as it introduces fuzziness and blurring instead of artificial block artifacts.
Original languageEnglish
Title of host publicationReal-Time Image and Video Processing 2012
EditorsNasser Kehtarnavaz, Matthias F. Carolsohn
PublisherSociety of Photo-Optical Instrumentation Engineers
Number of pages12
ISBN (Print)9780819491299
DOIs
Publication statusPublished - 1 May 2012
Externally publishedYes
EventSPIE Photonics Europe, Real-Time Image and Video processing Conference - Brussels, Belgium
Duration: 16 Apr 201219 Apr 2012

Publication series

NameProceedings of SPIE
PublisherSPIE
Volume8437
ISSN (Print)0277-786X

Conference

ConferenceSPIE Photonics Europe, Real-Time Image and Video processing Conference
CountryBelgium
CityBrussels
Period16/04/1219/04/12

Keywords

  • Contourlet transform
  • real-time video encoding
  • GPU computing
  • denoising
  • video-conferencing
  • surveillance video

Cite this

Katsigiannis, S., Papaioannou, G., & Maroulis, D. (2012). A contourlet transform based algorithm for real-time video encoding. In N. Kehtarnavaz, & M. F. Carolsohn (Eds.), Real-Time Image and Video Processing 2012 [843704] (Proceedings of SPIE; Vol. 8437). Society of Photo-Optical Instrumentation Engineers. https://doi.org/10.1117/12.924327
Katsigiannis, Stamos ; Papaioannou, Georgios ; Maroulis, Dimitris. / A contourlet transform based algorithm for real-time video encoding. Real-Time Image and Video Processing 2012. editor / Nasser Kehtarnavaz ; Matthias F. Carolsohn. Society of Photo-Optical Instrumentation Engineers, 2012. (Proceedings of SPIE).
@inproceedings{e5660cba39cf47bb92154b1fa3680f9e,
title = "A contourlet transform based algorithm for real-time video encoding",
abstract = "In recent years, real-time video communication over the internet has been widely utilized for applications like video conferencing. Streaming live video over heterogeneous IP networks, including wireless networks, requires video coding algorithms that can support various levels of quality in order to adapt to the network end-to-end bandwidth and transmitter/receiver resources. In this work, a scalable video coding and compression algorithm based on the Contourlet Transform is proposed. The algorithm allows for multiple levels of detail, without re-encoding the video frames, by just dropping the encoded information referring to higher resolution than needed. Compression is achieved by means of lossy and lossless methods, as well as variable bit rate encoding schemes. Furthermore, due to the transformation utilized, it does not suffer from blocking artifacts that occur with many widely adopted compression algorithms. Another highly advantageous characteristic of the algorithm is the suppression of noise induced by low-quality sensors usually encountered in web-cameras, due to the manipulation of the transform coefficients at the compression stage. The proposed algorithm is designed to introduce minimal coding delay, thus achieving real-time performance. Performance is enhanced by utilizing the vast computational capabilities of modern GPUs, providing satisfactory encoding and decoding times at relatively low cost. These characteristics make this method suitable for applications like video-conferencing that demand real-time performance, along with the highest visual quality possible for each user. Through the presented performance and quality evaluation of the algorithm, experimental results show that the proposed algorithm achieves better or comparable visual quality relative to other compression and encoding methods tested, while maintaining a satisfactory compression ratio. Especially at low bitrates, it provides more human-eye friendly images compared to algorithms utilizing block-based coding, like the MPEG family, as it introduces fuzziness and blurring instead of artificial block artifacts.",
keywords = "Contourlet transform, real-time video encoding, GPU computing, denoising, video-conferencing, surveillance video",
author = "Stamos Katsigiannis and Georgios Papaioannou and Dimitris Maroulis",
year = "2012",
month = "5",
day = "1",
doi = "10.1117/12.924327",
language = "English",
isbn = "9780819491299",
series = "Proceedings of SPIE",
publisher = "Society of Photo-Optical Instrumentation Engineers",
editor = "Nasser Kehtarnavaz and Carolsohn, {Matthias F.}",
booktitle = "Real-Time Image and Video Processing 2012",
address = "United States",

}

Katsigiannis, S, Papaioannou, G & Maroulis, D 2012, A contourlet transform based algorithm for real-time video encoding. in N Kehtarnavaz & MF Carolsohn (eds), Real-Time Image and Video Processing 2012., 843704, Proceedings of SPIE, vol. 8437, Society of Photo-Optical Instrumentation Engineers, SPIE Photonics Europe, Real-Time Image and Video processing Conference, Brussels, Belgium, 16/04/12. https://doi.org/10.1117/12.924327

A contourlet transform based algorithm for real-time video encoding. / Katsigiannis, Stamos; Papaioannou, Georgios; Maroulis, Dimitris.

Real-Time Image and Video Processing 2012. ed. / Nasser Kehtarnavaz; Matthias F. Carolsohn. Society of Photo-Optical Instrumentation Engineers, 2012. 843704 (Proceedings of SPIE; Vol. 8437).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - A contourlet transform based algorithm for real-time video encoding

AU - Katsigiannis, Stamos

AU - Papaioannou, Georgios

AU - Maroulis, Dimitris

PY - 2012/5/1

Y1 - 2012/5/1

N2 - In recent years, real-time video communication over the internet has been widely utilized for applications like video conferencing. Streaming live video over heterogeneous IP networks, including wireless networks, requires video coding algorithms that can support various levels of quality in order to adapt to the network end-to-end bandwidth and transmitter/receiver resources. In this work, a scalable video coding and compression algorithm based on the Contourlet Transform is proposed. The algorithm allows for multiple levels of detail, without re-encoding the video frames, by just dropping the encoded information referring to higher resolution than needed. Compression is achieved by means of lossy and lossless methods, as well as variable bit rate encoding schemes. Furthermore, due to the transformation utilized, it does not suffer from blocking artifacts that occur with many widely adopted compression algorithms. Another highly advantageous characteristic of the algorithm is the suppression of noise induced by low-quality sensors usually encountered in web-cameras, due to the manipulation of the transform coefficients at the compression stage. The proposed algorithm is designed to introduce minimal coding delay, thus achieving real-time performance. Performance is enhanced by utilizing the vast computational capabilities of modern GPUs, providing satisfactory encoding and decoding times at relatively low cost. These characteristics make this method suitable for applications like video-conferencing that demand real-time performance, along with the highest visual quality possible for each user. Through the presented performance and quality evaluation of the algorithm, experimental results show that the proposed algorithm achieves better or comparable visual quality relative to other compression and encoding methods tested, while maintaining a satisfactory compression ratio. Especially at low bitrates, it provides more human-eye friendly images compared to algorithms utilizing block-based coding, like the MPEG family, as it introduces fuzziness and blurring instead of artificial block artifacts.

AB - In recent years, real-time video communication over the internet has been widely utilized for applications like video conferencing. Streaming live video over heterogeneous IP networks, including wireless networks, requires video coding algorithms that can support various levels of quality in order to adapt to the network end-to-end bandwidth and transmitter/receiver resources. In this work, a scalable video coding and compression algorithm based on the Contourlet Transform is proposed. The algorithm allows for multiple levels of detail, without re-encoding the video frames, by just dropping the encoded information referring to higher resolution than needed. Compression is achieved by means of lossy and lossless methods, as well as variable bit rate encoding schemes. Furthermore, due to the transformation utilized, it does not suffer from blocking artifacts that occur with many widely adopted compression algorithms. Another highly advantageous characteristic of the algorithm is the suppression of noise induced by low-quality sensors usually encountered in web-cameras, due to the manipulation of the transform coefficients at the compression stage. The proposed algorithm is designed to introduce minimal coding delay, thus achieving real-time performance. Performance is enhanced by utilizing the vast computational capabilities of modern GPUs, providing satisfactory encoding and decoding times at relatively low cost. These characteristics make this method suitable for applications like video-conferencing that demand real-time performance, along with the highest visual quality possible for each user. Through the presented performance and quality evaluation of the algorithm, experimental results show that the proposed algorithm achieves better or comparable visual quality relative to other compression and encoding methods tested, while maintaining a satisfactory compression ratio. Especially at low bitrates, it provides more human-eye friendly images compared to algorithms utilizing block-based coding, like the MPEG family, as it introduces fuzziness and blurring instead of artificial block artifacts.

KW - Contourlet transform

KW - real-time video encoding

KW - GPU computing

KW - denoising

KW - video-conferencing

KW - surveillance video

U2 - 10.1117/12.924327

DO - 10.1117/12.924327

M3 - Conference contribution

SN - 9780819491299

T3 - Proceedings of SPIE

BT - Real-Time Image and Video Processing 2012

A2 - Kehtarnavaz, Nasser

A2 - Carolsohn, Matthias F.

PB - Society of Photo-Optical Instrumentation Engineers

ER -

Katsigiannis S, Papaioannou G, Maroulis D. A contourlet transform based algorithm for real-time video encoding. In Kehtarnavaz N, Carolsohn MF, editors, Real-Time Image and Video Processing 2012. Society of Photo-Optical Instrumentation Engineers. 2012. 843704. (Proceedings of SPIE). https://doi.org/10.1117/12.924327