This research compares the performance of two popular datasets, Labelled Faces in the Wild (LFW) and faces94, in the task of face recognition using Convolutional Neural Networks (CNN) algorithms. The LFW dataset is known for its high variation in pose, lighting, and expression, while faces94 is more structured with more uniform lighting and pose conditions. CNNs were chosen for their ability to extract important features from face images for classification. In this study, a CNN model was trained on both datasets and its performance was evaluated using accuracy, precision, and recall metrics. The experimental results showed that the model trained on the faces94 dataset achieved higher accuracy compared to the model trained on the LFW dataset. However, the model on the LFW dataset demonstrated better resilience to variations in lighting and pose conditions. These findings indicate that while a more structured dataset like faces94 can produce a model with high accuracy under testing conditions similar to the training data, a dataset with greater variation like LFW is more suitable for real-world applications involving diverse conditions. This study provides important insights into the selection of datasets for developing robust and accurate face recognition systems.
Read full abstract