Datenbestand vom 15. November 2024
Tel: 0175 / 9263392 Mo - Fr, 9 - 12 Uhr
Impressum Fax: 089 / 66060799
aktualisiert am 15. November 2024
978-3-8439-3995-9, Reihe Elektrotechnik
Daniel Merget Robust Facial Landmark Detection in the Wild
164 Seiten, Dissertation Technische Universität München (2019), Softcover, A5
Facial landmark detection is a well-studied topic in the field of computer vision that aims to find important key points in human faces. In the wild, the task is particularly challenging due to the high variability of shapes, expressions, poses, lighting conditions, and occlusions. This work presents a state-of-the-art approach to robustly solve the problem of facial landmark detection even under such difficult conditions.
A key novelty of the presented approach lies in the fact that it is based on a fullyconvolutional architecture, making it invariant to translation. Translation invariance is particularly useful when a separate face detector is not available, desirable, or reliable (enough). Fully-convolutional architectures, however, suffer from a comparatively narrow receptive field. This shortcoming is mitigated by a novel implicit kernel convolution. Multiple experiments verify that the implicit kernel convolution improves both landmark detection performance and convergence speed in comparison to other state-of-the-art approaches. Moreover, a proof of concept for face detection-free landmark detection based on the novel approach is provided. High resolutions are handled by a pyramid-like multi-resolution fusion approach, whereas low resolutions are handled by a super resolution mechanism. The presented approach therefore constitutes a generalizable way of robustly detecting facial landmarks in the wild.