pernillekopperuddefense
Abstract of her thesis:
Today, most modern face recognition systems are based on deep learning techniques that require a large amount of labeled training data in order to perform adequately. With the introduction of privacy laws and regulations such as the GDPR, it has become increasingly difficult to collect large and diverse datasets of face images sufficient to train well-performing face recognition systems. Consequently, the use of synthetic data to train face recognition systems has gained attention. This is because synthetic data has the potential to both alleviate the privacy issues faced when collecting real-world data but also offer greater control over the data used to train the face recognition systems. In this thesis, the use of synthetic data to train face recognition systems is explored. Specifically, the thesis aims to explore if it is possible to build an unbiased synthetic dataset that can be used to fine-tune existing face recognition systems to reduce bias with respect to age. Further, the thesis aims to investigate how the use of synthetic face images affects the performance of face recognition systems. The results show that there exists a prominent domain gap between synthetic and real face images that causes the performance of face recognition systems fine-tuned on synthetic data to generalize poorly to real-world data. Introducing real face images to the synthetic training dataset can help close the domain gap and boost the performance of the system. Furthermore, the results show that using an unbiased synthetic dataset has the potential to reduce bias with respect to age if the domain gap is closed. Code implementation is available at https://github.com/pernilko/MSc_FR_Bias.