Autonomous face mask detection using single shot multibox detector, and ResNet-50 with identity retrieval through face matching using deep siamese neural network.

S Vignesh Baalaji, S Sandhya, S A Sajidha, V M Nisha, M D Vimalapriya, Amit Kumar Tyagi
Author Information
  1. S Vignesh Baalaji: Present Address: School of Compter Science and Engineering, Vellore Institute of Technology, Chennai, India. ORCID
  2. S Sandhya: Present Address: School of Compter Science and Engineering, Vellore Institute of Technology, Chennai, India. ORCID
  3. S A Sajidha: Present Address: School of Compter Science and Engineering, Vellore Institute of Technology, Chennai, India. ORCID
  4. V M Nisha: Present Address: School of Compter Science and Engineering, Vellore Institute of Technology, Chennai, India. ORCID
  5. M D Vimalapriya: Post Graduate Department of Computer Science and Technology, Women's Christian College, Chennai, India. ORCID
  6. Amit Kumar Tyagi: Present Address: School of Compter Science and Engineering, Vellore Institute of Technology, Chennai, India. ORCID

Abstract

The COVID-19 pandemic poses a global health challenge. The World Health Organization states that face masks are proven to be effective, especially in public areas. Real-time monitoring of face masks is challenging and exhaustive for humans. To reduce human effort and to provide an enforcement mechanism, an autonomous system has been proposed to detect non-masked people and retrieve their identity using computer vision. The proposed method introduces a novel and efficient method that involves fine-tuning the pre-trained ResNet-50 model with a new head layer for classification between masked and non-masked people. The classifier is trained using adaptive momentum optimization algorithm with decaying learning rate and binary cross-entropy loss. Data augmentation and dropout regularization are employed to achieve best convergence. During real-time application of our classifier on videos, a Caffe face detector model based on Single Shot MultiBox Detector is used to extract the face regions of interest from each frame, on which the trained classifier is applied for detecting the non-masked people. The faces of these people are then captured, which is passed on to a deep siamese neural network, based on VGG-Face model for face matching. The captured faces are compared with the reference images from the database, by extracting the features and calculating cosine distance. If the faces match, the details of that person are retrieved from the database and displayed on the web application. The proposed method has secured best results where the trained classifier has achieved 99.74% accuracy, and the identity retrieval model achieved 98.24% accuracy.

Keywords

References

  1. Sustain Cities Soc. 2021 Feb;65:102600 [PMID: 33200063]
  2. IEEE Trans Instrum Meas. 2021 Mar 30;70:5009612 [PMID: 37982043]
  3. Sustain Cities Soc. 2021 Jan;64:102568 [PMID: 33110743]
  4. Methods Mol Biol. 2021;2190:73-94 [PMID: 32804361]
  5. Appl Soft Comput. 2021 Oct;110:107610 [PMID: 36569211]
  6. Measurement (Lond). 2021 Jan 1;167:108288 [PMID: 32834324]
  7. Sustain Cities Soc. 2021 Mar;66:102692 [PMID: 33425664]

Word Cloud

Created with Highcharts 10.0.0facepeopleusingmodelclassifierproposednon-maskedidentitymethodtrainedfacessiameseneuralmatchingCOVID-19pandemicmasksvisionResNet-50learningbestapplicationdetectorbasedcaptureddeepnetworkdatabaseachievedaccuracyretrievaldetectionDeepposesglobalhealthchallengeWorldHealthOrganizationstatesproveneffectiveespeciallypublicareasReal-timemonitoringchallengingexhaustivehumansreducehumaneffortprovideenforcementmechanismautonomoussystemdetectretrievecomputerintroducesnovelefficientinvolvesfine-tuningpre-trainednewheadlayerclassificationmaskedadaptivemomentumoptimizationalgorithmdecayingratebinarycross-entropylossDataaugmentationdropoutregularizationemployedachieveconvergencereal-timevideosCaffeSingleShotMultiBoxDetectorusedextractregionsinterestframeapplieddetectingpassedVGG-Facecomparedreferenceimagesextractingfeaturescalculatingcosinedistancematchdetailspersonretrieveddisplayedwebsecuredresults9974%9824%AutonomousmasksingleshotmultiboxComputernetworksFaceMask

Similar Articles

Cited By

No available data.