Webinar: Visual quality assessment for decision making in standardization projects
Published in Electrical & Electronic Engineering and Computational Sciences
Our next 1-hour webinar will take place Thursday, September 5 at 2:00pm CEST with esteemed author, Dr.-Ing. Mathias Wien.
RSVP here to join: https://cassyni.com/events/3gasWoMkxJY86L2SWn7Hdw?cb=functi
Title: Visual quality assessment for decision making in standardization projects
Abstract: In the context of the development of compression standards for visual media, typically, most decision making relies on the measurement with one or more objective quality metrics. In many cases, a small number of very simple metrics, such as the PSNR or the SSIM, are applied in decision making processes, e.g., in the context of adoption of coding tools in to a draft specification. THis applies to a variety of visual media under consideration, such as classical 2D video or various representations of immersive visual media like dynamic point clouds or meshes. Given the rise of learning-based coding tools and -apparently- competitive end-to-end learned coding schemes, as well as the increasing number of filtering blocks inside or outside of the coding loop of conventional coding schemes, the suitability of such metrics may be questioned. This is due to a potential lack of correlation with mean opinion scores acquired by subjective assessment, especially if specific artifacts, such as temporal consistency, are not well reflected by the metric. This problem can be even more significant for more advanced, potentially learning-based metrics, which may show unexpected behavior if being applied to compression artifacts which have not been known or seen by the time of training the corresponding metric.
Advisory Group ISO/IEC SC 29/AG 5 MPEG Visual Quality Assessment is tasked with evaluating and recommending metrics and testing procedures for the use in standardization projects inside the body of MPEG Working Groups developing compression standards for visual media. This webinar presents recent insights in the performance of metrics and subjective assessment methods for a variety of visual media types. The evaluation includes laboratory tests as well as remote and on-site expert viewing sessions which are frequently conducted during MPEG standardization meetings. The results and the performance of such subjective tests are assessed and used to benchmark objective metrics commonly used or considered for application in the development process. Furthermore and outlook is provided to the dataset of compressed video for study of quality metrics (CVQM) which is currently being developed in AG 5 and which includes reconstructed video sequences from a set of conventional and learning-based coding schemes.
Speaker Bio: Mathias Wien received the Diploma and Dr.-Ing. degrees from Rheinisch-Westfälische Technische Hochschule Aachen (RWTH Aachen University), Aachen, Germany, in 1997 and 2004, respectively. In 2018, he achieved the status of the habilitation, which makes him an independent scientist in the field of visual media communication. His research interests include image and video processing, immersive, space-frequency adaptive and scalable video compression, and visual quality assessment. Since 2020, Mathias serves as Convenor of ISO/IEC JTC1 SC29/AG5 “MPEG Visual Quality Assessment”. Mathias has been an active contributor to H.264/AVC, HEVC, and VVC. He has participated and contribute to ITU-T VCEG, ISO/IEC MPEG, the Joint Video Experts Team (JVET) and preceding joint teams of VCEG and ISO/IEC MPEG. Mathias has published more than 80 scientific articles and conference papers in the area of video coding and has co-authored several patents in this area. Mathias has further authored and co-authored more than 250 standardization documents. He has published the Springer textbook “High Efficiency Video Coding: Coding Tools and Specification”, which fully covers Version 1 of HEVC.
We look forward to seeing you there!
Follow the Topic
Related Collections
With Collections, you can get published faster and increase your visibility.
Computational Imaging in the AI Era: Efficient Coding and Representation Techniques
As we navigate the complexities of the digital age, computational imaging has emerged as a pivotal discipline, integrating advanced algorithms and artificial intelligence to enhance image processing techniques. With the rapid proliferation of high-resolution imaging technologies, there is an increasing demand for efficient coding and representation techniques that can optimize storage, transmission, and analysis of visual data. This Collection aims to explore the intersection of computational imaging and AI, investigating how novel methodologies can transform the way we capture, process, and interpret images.
The significance of this research extends beyond technical advancements; it is crucial for various fields, including medical imaging, autonomous vehicles, and entertainment. Recent breakthroughs in machine learning and neural networks have enabled unprecedented levels of image enhancement and classification. Techniques such as deep learning-based compression and generative adversarial networks (GANs) have revolutionized how we approach image representation. As industries increasingly rely on image data, developing efficient coding techniques becomes essential for enhancing data throughput while minimizing resource consumption.
Looking ahead, the continued exploration of computational imaging holds immense potential for future innovations. We may witness the development of real-time image processing solutions that integrate seamlessly with augmented reality and virtual reality systems. Furthermore, as AI continues to evolve, we can expect smarter algorithms that not only compress and enhance images but also learn from user interactions, leading to more intuitive and personalized imaging experiences.
- Advanced compression techniques using deep learning
- Machine learning applications in medical imaging
- Efficient coding methods for high-resolution video
- Representation techniques in augmented and virtual reality
Researchers are invited to submit their work to this Collection, which aims to showcase innovative research that contributes to the advancements in computational imaging and efficient coding techniques.
The "Computational Imaging in the AI Era: Efficient Coding and Representation Techniques" Collection seeks contributions that explore the integration of artificial intelligence with computational imaging. This Collection welcomes research on advanced compression methods, machine learning applications in various imaging domains, and innovative representation techniques. By fostering collaboration and knowledge sharing, this Collection aims to advance the field of computational imaging and enhance our understanding of efficient coding in the digital age.
Publishing Model: Open Access
Deadline: Dec 30, 2026
AI-empowered Multimedia Processing for Behavioral and Social Computing
Behavioral and social computing aims to understand human actions, interactions, and societal dynamics through computational methods. The advent of AI has ignited a transformative era for behavioral and social computing, creating unprecedented urgency to advance human-centric multimedia processing that harnesses synthetic intelligence for societal benefit. As multimedia data (video, audio, physiological signals, etc.) becomes the cornerstone of understanding human behavior, advanced processing techniques to capture nuanced aspects of human behavior in real-world settings are essential to decode complex social and individual patterns. This special issue focuses on the intersection of multimedia processing and behavioral/social computing, exploring novel methodologies, applications, and ethical considerations in leveraging multimodal data to model human behavior, emotions, social dynamics, and cultural trends. The special issue aims to pioneer next-generation multimedia processing methodologies for human-centric AI research that understand and respond to human behaviors and social dynamics, empowering researchers to tackle real-world challenges in human-AI interaction, healthcare, education, psychology, computational social sciences, business and marketing, and any other related areas.
The special issue seeks high-quality, original contributions from researchers, practitioners, and industry experts to present novel methodologies, real-world applications, comprehensive reviews, and experimental validations that leverage multimodal data to advance behavioral insights and social intelligence. Additionally, extended versions of selected high-quality papers from BESC2025, as well as notable conferences, such as MM, KDD, CIKM, AAAI, IJCAI, ICML, CVPR, ICCV, ECCV, ICIP, ICASSP, ICME, ICMR, ICLR, within the field will be invited to enrich the scope of this special issue.
The topics of interest for this special issue include, but are not limited to:
• Multimodal behavioral and social sensing and analytics
• Generative AI for synthetic behavioral and social data generation
• AI-driven large-scale social dynamics mining and discovery
• Human-AI collaboration for behavioral and social computing
• Human-centered affective computing
• Immersive multimedia solutions for training or therapy
• Conversational AI for social cue interpretation
• Ethical human-AI alignment in behavioral and social interventions
• Multimedia applications for healthcare, education, psychology, computational social sciences, business and marketing, etc.
All submitted papers will undergo rigorous peer-review by specialists in the field, with accepted papers set to be published as soon as they are ready.
Publishing Model: Open Access
Deadline: Jul 31, 2026
Please sign in or register for FREE
If you are a registered user on Research Communities by Springer Nature, please sign in