Start by interpreting the research question clearly and planning a manageable workflow. Define the dataset: a curated set of 1,000 photographed film actors from 1990–1999, and specify how you will source images (publicly available archives, licensed photo collections, or screenshots) while documenting provenance and consent/usage limits. Create a spreadsheet recording each subject’s metadata: name, birth year, gender, perceived race/ethnicity, and any other demographic categories you will analyze. Draft a protocol for preprocessing (resolution, cropping, face alignment) so the input to FaceNet (v1) is consistent. Write an ethics statement that explains how you will protect privacy, why this historical dataset is being used, and how you will handle sensitive demographic labeling; submit this to your supervisor and follow IB academic honesty rules for using open-source software and external data. Schedule time to install and validate FaceNet (v1) on a controlled machine and run small pilot tests to confirm the model outputs embeddings and matching scores as expected before processing the full dataset.
Design the experimental methodology tightly: decide on the matching task (verification, identification, or both), the similarity metric and thresholding approach, and how you will split the dataset into probe/gallery sets or generate genuine/impostor pairs. Compute standard quantitative performance metrics: accuracy, precision, recall, F1, false accept/reject rates, ROC and DET curves, and area under curve (AUC). For demographic bias, report performance stratified by gender, age cohort, and race/ethnicity categories, and calculate fairness metrics such as disparate impact ratios, differences in false positive/negative rates, and equal opportunity gaps. Use statistical tests (chi-square, t-tests or bootstrap confidence intervals) to assess whether observed differences across groups are significant. Record all code, parameter settings, and random seeds so your experiments are reproducible and transparent.
When writing the essay, structure it around the research question: introduction of context and significance, detailed methods, quantitative results, fairness analysis, and interpretation. Present tables and clear graphs (confusion matrices, ROC curves, bar charts of group-wise metrics) and describe what they reveal in accessible language. Critically evaluate limitations: dataset representativeness, labeling errors, FaceNet (v1) architecture constraints, and potential confounders from image quality or makeup/costume. Conclude by relating findings back to the research question, discussing implications for real-world use and ethical deployment, and suggesting realistic next steps. Follow IB formatting, cite all software and data sources, and include an appendix with code snippets and raw results for examiner verification.