Facial identity recognition using StyleGAN3 inversion and improved tiny YOLOv7 model.

Advanced Search

Akhil Kumar, Swarnava Bhattacharjee, Ambrish Kumar, Dushantha Nalin K Jayakody

Author Information

Akhil Kumar: School of Computer Science Engineering and Technology, Bennett University, Greater Noida, India.
Swarnava Bhattacharjee: Liverpool John Moores University, Liverpool, England.
Ambrish Kumar: School of Computer Science Engineering and Technology, Bennett University, Greater Noida, India.
Dushantha Nalin K Jayakody: COPELABS, Lusófona University, Lisboa, Portugal. dushantha.jayakody@ulusofona.pt.

PMID: 40097614 DOI: 10.1038/s41598-025-93096-0

Facial identity recognition is one of the challenging problems in the domain of computer vision. Facial identity comprises the facial attributes of a person's face ranging from age progression, gender, hairstyle, etc. Manipulating facial attributes such as changing the gender, hairstyle, expressions, and makeup changes the entire facial identity of a person which is often used by law offenders to commit crimes. Leveraging the deep learning-based approaches, this work proposes a one-step solution for facial attribute manipulation and detection leading to facial identity recognition in few-shot and traditional scenarios. As a first step towards performing facial identity recognition, we created the Facial Attribute Manipulation Detection (FAM) Dataset which consists of twenty unique identities with thirty-eight facial attributes generated by the StyleGAN3 inversion. The Facial Attribute Detection (FAM) Dataset has 11,560 images richly annotated in YOLO format. To perform facial attribute and identity detection, we developed the Spatial Transformer Block (STB) and Squeeze-Excite Spatial Pyramid Pooling (SE-SPP)-based Tiny YOLOv7 model and proposed as FIR-Tiny YOLOv7 (Facial Identity Recognition-Tiny YOLOv7) model. The proposed model is an improvised variant of the Tiny YOLOv7 model. For facial identity recognition, the proposed model achieved 10.0% higher mAP in the one-shot scenario, 30.4% higher mAP in the three-shot scenario, 15.3% higher mAP in the five-shot scenario, and 0.1% higher mAP in the traditional 70% - 30% split scenario as compared to the Tiny YOLOv7 model. The results obtained with the proposed model are promising for general facial identity recognition under varying facial attribute manipulation.

Face detection Facial attribute manipulation Facial identity recognition StyleGAN3 Tiny YOLOv7

Multimed Tools Appl. 2023;82(6):9243-9275 [PMID: 35968414]
IEEE Trans Image Process. 2019 Nov;28(11):5464-5478 [PMID: 31107649]
IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4217-4228 [PMID: 32012000]
Cognition. 2024 Feb;243:105668 [PMID: 38043180]
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14777-14788 [PMID: 37616132]
IEEE Trans Image Process. 2023;32:5893-5908 [PMID: 37889810]
Arch Comput Methods Eng. 2021;28(7):4503-4521 [PMID: 33824572]
IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16 [PMID: 26353135]
IEEE Trans Image Process. 2004 Apr;13(4):600-12 [PMID: 15376593]

SPARC/2024-2025/NXTG/P3524/Scheme for Promotion of Academic and Research Collaboration

Humans

Male

Female

Facial Recognition

Deep Learning

Automated Facial Recognition

Face

Adult

Young Adult

Algorithms

Image Processing, Computer-Assisted

Journal Article

No available data.

OpenLB
Open Library of Bioscience