Unveiling the Ace in the Hole: Leveraging Uncertainty Quantification for Computer Vision Systems

DSpace Repository


Dateien:

URI: http://hdl.handle.net/10900/147788
http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-1477889
http://dx.doi.org/10.15496/publikation-89129
Dokumentart: PhDThesis
Date: 2023-11-16
Language: English
Faculty: 7 Mathematisch-Naturwissenschaftliche Fakultät
Department: Informatik
Advisor: Akata, Zeynep (Prof. Dr.)
Day of Oral Examination: 2023-11-09
DDC Classifikation: 004 - Data processing and computer science
510 - Mathematics
Keywords: Artificial intelligence, machine learning, machine vision, uncertainty quantification
Other Keywords:
Artificial intelligence
machine learning
machine vision
uncertainty quantification
License: http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en
Order a printed copy: Print-on-Demand
Show full item record

Abstract:

As machine learning systems become increasingly complex and autonomous, the integration of uncertainty quantification becomes crucial, especially in high-stakes domains like healthcare and autonomous driving, where ambiguity can lead to severe consequences. By offering a clear gauge of prediction confidence, uncertainty quantification supports informed decision-making and risk management. Within the realm of healthcare, where diagnostic procedures often depend on var- ious imaging modalities, modern machine-learning methods are being harnessed to aid diagnosis. Current advancements in generative machine learning explore the synthesis of different medical imaging modalities, predominantly through image-to-image translations. Our work demonstrates that integrating aleatoric uncertainty in Generative Adversarial Networks (GANs) for these translation tasks can amplify interpretability and accuracy. Consequently, this empowers healthcare professionals with better diagnostic and treatment decisions, thus enhancing patient outcomes. In the context of autonomous driving and similar applications, ensuring resilience to unforeseen perturbations is vital. Traditional deterministic models may falter when confronted with new situations, constituting a safety hazard. We address this by implementing a probabilistic approach to dense computer vision tasks and utilizing the Likelihood Annealing technique for uncertainty estimation. These methods amplify the robustness to unexpected situations and provide a calibrated uncertainty measure, contributing to the development of safer autonomous systems. While creating new probabilistic machine learning solutions for vital applications is a key research area, it’s equally significant to develop methods that leverage large-scale pretrained models. These deterministic models can be adapted to estimate uncertainties in a cost-efficient manner regarding data, computation, and other resources, a direction we explore in this thesis. The work presented herein addresses this issue within the context of current computer vision systems, including large-scale vision language models crucial for enabling intelligent multimodal systems.

This item appears in the following Collection(s)