Webinar: Visual quality assessment for decision making in standardization projects

Springer, EURASIP Journal on Image & Video Processing, and European Association for Biometrics present a free monthly author series on the first Thursday. We hope you can join us!

Published in Electrical & Electronic Engineering and Computational Sciences

Aug 06, 2024

Esinu Abadjivor

Liked by Sorrel Bunting and 3 others

Explore the Research

SpringerLink

Remote expert viewing, laboratory tests or objective metrics: which one(s) to trust? - EURASIP Journal on Image and Video Processing

We present a study on the validity of quality assessment in the context of the development of visual media coding schemes. The work is motivated by the need for reliable means for decision-taking in standardization efforts of MPEG and JVET, i.e., the adoption or rejection of coding tools during the development process of the coding standard. The study includes results considering three means: objective quality metrics, remote expert viewing, which is a method designed in the context of MPEG standardization, and formal laboratory visual evaluation. The focus of this work is on the comparison of pairs of coded video sequences, e.g., a proposed change and an anchor scheme at a given rate point. An aggregation of performance measurements across multiple rate points, such as the Bjøntegaard Delta rate, is out of the scope of this paper. The paper details the test setup for the subjective assessment methods and the objective quality metrics under consideration. The results of the three approaches are reviewed, analyzed, and compared with respect to their suitability for the decision-taking task. The study indicates that, subject to the chosen test content and test protocols, the results of remote expert viewing using a forced-choice scale can be considered more discriminatory than the results of naïve viewers in the laboratory tests. The results further that, in general, the well-established quality metrics, such as PSNR, SSIM, or MS-SSIM, exhibit a high rate of correct decision-making when their results are compared with both types of viewing tests. Among the learning-based metrics, VMAF and AVQT appear to be most robust. For the development process of a coding standard, the selection of the most suitable means must be guided by the context, where a small number of carefully selected objective metrics, in combination with viewing tests for unclear cases, appears recommendable.

Our next 1-hour webinar will take place Thursday, September 5 at 2:00pm CEST with esteemed author, Dr.-Ing. Mathias Wien.

RSVP here to join: https://cassyni.com/events/3gasWoMkxJY86L2SWn7Hdw?cb=functi

Title: Visual quality assessment for decision making in standardization projects

Abstract: In the context of the development of compression standards for visual media, typically, most decision making relies on the measurement with one or more objective quality metrics. In many cases, a small number of very simple metrics, such as the PSNR or the SSIM, are applied in decision making processes, e.g., in the context of adoption of coding tools in to a draft specification. THis applies to a variety of visual media under consideration, such as classical 2D video or various representations of immersive visual media like dynamic point clouds or meshes. Given the rise of learning-based coding tools and -apparently- competitive end-to-end learned coding schemes, as well as the increasing number of filtering blocks inside or outside of the coding loop of conventional coding schemes, the suitability of such metrics may be questioned. This is due to a potential lack of correlation with mean opinion scores acquired by subjective assessment, especially if specific artifacts, such as temporal consistency, are not well reflected by the metric. This problem can be even more significant for more advanced, potentially learning-based metrics, which may show unexpected behavior if being applied to compression artifacts which have not been known or seen by the time of training the corresponding metric.

Advisory Group ISO/IEC SC 29/AG 5 MPEG Visual Quality Assessment is tasked with evaluating and recommending metrics and testing procedures for the use in standardization projects inside the body of MPEG Working Groups developing compression standards for visual media. This webinar presents recent insights in the performance of metrics and subjective assessment methods for a variety of visual media types. The evaluation includes laboratory tests as well as remote and on-site expert viewing sessions which are frequently conducted during MPEG standardization meetings. The results and the performance of such subjective tests are assessed and used to benchmark objective metrics commonly used or considered for application in the development process. Furthermore and outlook is provided to the dataset of compressed video for study of quality metrics (CVQM) which is currently being developed in AG 5 and which includes reconstructed video sequences from a set of conventional and learning-based coding schemes.

Speaker Bio: Mathias Wien received the Diploma and Dr.-Ing. degrees from Rheinisch-Westfälische Technische Hochschule Aachen (RWTH Aachen University), Aachen, Germany, in 1997 and 2004, respectively. In 2018, he achieved the status of the habilitation, which makes him an independent scientist in the field of visual media communication. His research interests include image and video processing, immersive, space-frequency adaptive and scalable video compression, and visual quality assessment. Since 2020, Mathias serves as Convenor of ISO/IEC JTC1 SC29/AG5 “MPEG Visual Quality Assessment”. Mathias has been an active contributor to H.264/AVC, HEVC, and VVC. He has participated and contribute to ITU-T VCEG, ISO/IEC MPEG, the Joint Video Experts Team (JVET) and preceding joint teams of VCEG and ISO/IEC MPEG. Mathias has published more than 80 scientific articles and conference papers in the area of video coding and has co-authored several patents in this area. Mathias has further authored and co-authored more than 250 standardization documents. He has published the Springer textbook “High Efficiency Video Coding: Coding Tools and Specification”, which fully covers Version 1 of HEVC.

We look forward to seeing you there!

Esinu Abadjivor

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Computer Imaging, Vision, Pattern Recognition and Graphics

Mathematics and Computing > Computer Science > Computer Imaging, Vision, Pattern Recognition and Graphics

Signal, Speech and Image Processing

Technology and Engineering > Electrical and Electronic Engineering > Signal, Speech and Image Processing

Image Processing

Technology and Engineering > Electrical and Electronic Engineering > Signal, Speech and Image Processing > Image Processing

Image Processing

Mathematics and Computing > Computer Science > Computer Imaging, Vision, Pattern Recognition and Graphics > Image Processing

EURASIP Journal on Image and Video Processing

EURASIP Journal on Image and Video Processing

More about the journal

Related Collections

With Collections, you can get published faster and increase your visibility.

Computational Imaging in the AI Era: Efficient Coding and Representation Techniques

As we navigate the complexities of the digital age, computational imaging has emerged as a pivotal discipline, integrating advanced algorithms and artificial intelligence to enhance image processing techniques. With the rapid proliferation of high-resolution imaging technologies, there is an increasing demand for efficient coding and representation techniques that can optimize storage, transmission, and analysis of visual data. This Collection aims to explore the intersection of computational imaging and AI, investigating how novel methodologies can transform the way we capture, process, and interpret images.

The significance of this research extends beyond technical advancements; it is crucial for various fields, including medical imaging, autonomous vehicles, and entertainment. Recent breakthroughs in machine learning and neural networks have enabled unprecedented levels of image enhancement and classification. Techniques such as deep learning-based compression and generative adversarial networks (GANs) have revolutionized how we approach image representation. As industries increasingly rely on image data, developing efficient coding techniques becomes essential for enhancing data throughput while minimizing resource consumption.

Looking ahead, the continued exploration of computational imaging holds immense potential for future innovations. We may witness the development of real-time image processing solutions that integrate seamlessly with augmented reality and virtual reality systems. Furthermore, as AI continues to evolve, we can expect smarter algorithms that not only compress and enhance images but also learn from user interactions, leading to more intuitive and personalized imaging experiences.

- Advanced compression techniques using deep learning

- Machine learning applications in medical imaging

- Efficient coding methods for high-resolution video

- Representation techniques in augmented and virtual reality

Researchers are invited to submit their work to this Collection, which aims to showcase innovative research that contributes to the advancements in computational imaging and efficient coding techniques.

The "Computational Imaging in the AI Era: Efficient Coding and Representation Techniques" Collection seeks contributions that explore the integration of artificial intelligence with computational imaging. This Collection welcomes research on advanced compression methods, machine learning applications in various imaging domains, and innovative representation techniques. By fostering collaboration and knowledge sharing, this Collection aims to advance the field of computational imaging and enhance our understanding of efficient coding in the digital age.

SDG 9: Industry, Innovation & Infrastructure

SDG 8: Decent Work and Economic Growth

Publishing Model: Open Access

Deadline: Dec 30, 2026

Explore this Collection

AI-empowered Multimedia Processing for Behavioral and Social Computing

Behavioral and social computing aims to understand human actions, interactions, and societal dynamics through computational methods. The advent of AI has ignited a transformative era for behavioral and social computing, creating unprecedented urgency to advance human-centric multimedia processing that harnesses synthetic intelligence for societal benefit. As multimedia data (video, audio, physiological signals, etc.) becomes the cornerstone of understanding human behavior, advanced processing techniques to capture nuanced aspects of human behavior in real-world settings are essential to decode complex social and individual patterns. This special issue focuses on the intersection of multimedia processing and behavioral/social computing, exploring novel methodologies, applications, and ethical considerations in leveraging multimodal data to model human behavior, emotions, social dynamics, and cultural trends. The special issue aims to pioneer next-generation multimedia processing methodologies for human-centric AI research that understand and respond to human behaviors and social dynamics, empowering researchers to tackle real-world challenges in human-AI interaction, healthcare, education, psychology, computational social sciences, business and marketing, and any other related areas.

The special issue seeks high-quality, original contributions from researchers, practitioners, and industry experts to present novel methodologies, real-world applications, comprehensive reviews, and experimental validations that leverage multimodal data to advance behavioral insights and social intelligence. Additionally, extended versions of selected high-quality papers from BESC2025, as well as notable conferences, such as MM, KDD, CIKM, AAAI, IJCAI, ICML, CVPR, ICCV, ECCV, ICIP, ICASSP, ICME, ICMR, ICLR, within the field will be invited to enrich the scope of this special issue.

The topics of interest for this special issue include, but are not limited to:

• Multimodal behavioral and social sensing and analytics

• Generative AI for synthetic behavioral and social data generation

• AI-driven large-scale social dynamics mining and discovery

• Human-AI collaboration for behavioral and social computing

• Human-centered affective computing

• Immersive multimedia solutions for training or therapy

• Conversational AI for social cue interpretation

• Ethical human-AI alignment in behavioral and social interventions

• Multimedia applications for healthcare, education, psychology, computational social sciences, business and marketing, etc.

All submitted papers will undergo rigorous peer-review by specialists in the field, with accepted papers set to be published as soon as they are ready.

Publishing Model: Open Access

Deadline: Jul 31, 2026

Explore this Collection

1st Symposium on Climate-Smart Infrastructure Innovations & Implementation (CSI3)

Cookies

We use cookies to ensure the functionality of our website, to personalize content and advertising, to provide social media features, and to analyze our traffic. If you allow us to do so, we also inform our social media, advertising and analysis partners about your use of our website. You can decide for yourself which categories you want to deny or allow. Please note that based on your settings not all functionalities of the site are available.

Further information can be found in our privacy policy.

Webinar: Visual quality assessment for decision making in standardization projects

Share this post

Share with...

...or copy the link