Skip to main content

Estimating sex using discriminant analysis of mandibular measurements from a modern Greek sample

Abstract

Background

Sex determination is an integral and fundamental step in biological profile construction. The mandible, which holds many dimorphic traits, can be useful for sex estimation in the forensic context. However, reliable sex estimation usually relies on population-specific mandibular morphometric criteria. To date, no such criteria are available for the modern Greek population, and the present study aims to fill this lack of data by identifying the most sexually dimorphic mandibular traits on a modern Greek population sample and reporting the discriminant functions that can most effectively be used for estimating sex.

Materials and methods

For the purposes of this research, the 3D models of 194 adult mandibles (105 males and 89 females) from the Athens skeletal reference collection were used. A battery of 20 linear and 3 angular measurements was calculated from the 3D coordinates of anatomical landmarks positioned on the respective models and was analyzed by means of ANOVA and discriminant function analysis to investigate the expression of sexual dimorphism.

Results

The coronoid height, the ramus height, and the maximum mandibular length are the most sexually dimorphic metric traits of the mandible, while the produced sex discriminant functions yielded cross-validated classification accuracy up to 85.7% for the Greek sample. Furthermore, most of the examined combinations of measurements exhibited the same sex discriminant capacity between different reference samples, despite their respective discriminant functions being population specific.

Conclusions

Our findings indicate that the produced sex discriminant functions can be effectively used for sex determination in forensic casework and to verify the population specificity of these functions but also suggest that the expression of sexual dimorphism in the mandible shares certain features across different populations.

Background

It is well established that the estimation of various biological profile parameters, such as age at death and stature, is affected by sex (Bidmos et al. 2010; Krishan et al. 2016). Therefore, sex determination is an integral and fundamental step in biological profile construction from skeletal remains (Lin et al. 2014; Lopez-Capp et al. 2017). Although sex can be determined by forensic anthropologists with an accuracy of 90–100% when the entire skeleton is available (Byers 2016), in cases where the skeleton is incomplete or fragmented, sex determination can be more difficult (Passalacqua et al. 2013). Hence, the need of reliable sex discriminatory methods based on individual parts of the skeleton is evident.

Most human bones exhibit sexual dimorphism, while the pelvis is the most reliable sex estimator, followed by the skull and the humerus (France 1998). However, when the pelvis and the skull are fragmented or missing, the mandible can be useful for estimating sex in a forensic context, since it is often well preserved and it holds many dimorphic traits (Franklin et al. 2007; Franklin et al. 2008; Saini et al. 2011). The mandible exhibits sexual dimorphism from an early stage of life (Kano et al. 2015). More specifically, earlier studies have shown that the expression of sexual dimorphism can be detected from as early as 5 months old (Schutkowski 1993; Loth and Henneberg 2001; Scheuer 2002). Nevertheless, mandibular sexual dimorphism is reduced during childhood (between the ages of 4 and 14), while it becomes again most prominent during adulthood (Coquerelle et al. 2011). Furthermore, it has been demonstrated that the sex-related morphological characteristics exhibit both inter- and intra-population variability, which is influenced by a range of factors including genetics, nutrition, secular change, growth, and morbidity as well as other general environmental factors (Ubelaker and DeGaglia 2017).

Various sex discriminating morphological and morphometric traits of the mandible have been described in the literature, such as the “mandibular ramus flexure” or the “ramus length,” respectively (Hu et al. 2006; Dong et al. 2015; Sharma et al. 2016). However, morphometric analyses, as opposed to qualitative sex discriminant methods, are more commonly used due to its inherent objectivity, accuracy, reproducibility, and lower level of inter- and intra-observer errors (Saini et al. 2011; Carvalho et al. 2013; Sharma et al. 2016).

Linear discriminant analysis is a simple-to-implement yet accurate classifier for normally distributed datasets (Dixon and Brereton 2009). Hence, it is commonly used in forensic applications for developing sex discriminant functions (Franklin et al. 2005; Dayal et al. 2008; Ogawa et al. 2013). Over the years, numerous sex discriminant functions based on mandibular measurements have been reported for various populations (Giles 1964; Steyn and İşcan 1998; Franklin et al. 2008; Saini et al. 2011; Lin et al. 2014; Dong et al. 2015). Nevertheless, the subtlety of metric assessments renders any derived sex discriminant function highly population specific as well as sensitive to secular trends (Dayal et al. 2008). Hence, their sex discriminating capacity usually diminishes when applied on samples different from the reference population. To date, no such study has been conducted on a modern Greek population sample and the aim of the present study was to fill this gap by identifying those mandibular morphometric traits that can be most effectively used in sex assessment. Additionally, this study investigated whether and to what extent discriminant functions derived from different population samples produce reliable sex estimation on the modern Greek population sample as a means to identify possible similarities in the expression of sexual dimorphism among diverse population groups. Sex discriminant functions representative for four contemporary populations from South Africa, India, and Korea have been examined to this end.

Materials and methods

For the purposes of this research, 194 mandibles of adult Greek individuals were used. The sample consisted of 105 males and 89 females. Only individuals without any evidence of pathological deformation in the mandible and skull were selected. These individuals constitute part of the modern skeletal reference collection, also known as the Athens Collection, which is housed in the Department of Animal and Human Physiology at the National and Kapodistrian University of Athens. For each individual in the collection, sex, age at death, and cause of death as well as place and year of birth are known from death records (Eliopoulos et al. 2007). The age at death of these individuals ranges from 19 to 99 years old and their respective years of birth span from 1879 to 1965. Previous work has shown that the Athens collection is a fairly representative reference sample of the twentieth century Greek population (Bertsatos and Chovalopoulou 2017).

Data acquisition

The mandibles were previously digitized by means of 3D photogrammetry (Photoscan Pro version 1.4, Agisoft LLC, Russia) with a cross-validated model accuracy of 0.2 mm between repetitive digital and caliper-based measurements used for scaling. The ongoing digital documentation of the Athens Collection aims to the preservation of the physical reference collection; hence, the 3D models were utilized in the present study. The three-dimensional coordinates of 16 landmarks (3 midline and 13 bilateral) were extracted from the respective models with MeshLab software (Cignoni et al. 2008). All landmarks used in the present study are illustrated in Fig. 1, and their selection aimed to most appropriately capture the shape of the mandible but also facilitate comparisons with specific sets of measurements used in the existing literature. Most of the landmark definitions were obtained from Franklin et al. (2008), while mental foramen and lingula mandibulae were defined by Bejdová et al. (2013). The non-standard landmarks mandibular tubercle, aMRB, pMRB, and prp were incorporated to facilitate specific measurements. All measurements are listed in Table 1 along with their definitions. In total, 23 measurements were used in the subsequent analysis, from which three were angular and 11 bilateral linear measurements. Bilateral measurements were calculated for both sides of each mandible.

Fig. 1
figure 1

Landmarks and corresponding locations

Table 1 Definitions of the measurements used in this study

Data analysis

Intra- and inter-observer error was estimated using absolute and relative technical error measurement (ΤΕΜ) as well as reliability coefficient R for each measurement based on a subsample of 20 randomly chosen individuals. The expression of sexual dimorphism was identified by means of ANOVA and univariate linear discriminant analysis on each measurement to identify those that can be most effectively used in sex assessment regarding the Greek population. Subsequently, the measurements with correct cross-validated classification in excess of 75% were analyzed by means of multivariate discriminant function analysis, using both enter and stepwise methods, to identify the combination of metric traits that can be most effectively used in sex assessment.

In order to investigate whether and to what extent the mandible-based sex discriminant functions (DFs) are population specific, previously published DFs (from different population samples) were produced from the Greek population sample for comparing their cross-validated classification accuracy with that reported on the original population samples. More specifically, the combination of measurements of one DF from a South African Whites population sample (Steyn and İşcan 1998), three DFs from a South African Blacks population sample (Franklin et al. 2008), two DFs from an Indian population sample (Saini et al. 2011), two DFs from a Korean population sample (Lin et al. 2014), and one DF from a contemporary Han Chinese population sample (Dong et al. 2015) were used for this comparison. Additionally, the classification accuracy of the previously reported DFs was also tested on the Greek sample as a measure of how their sex discriminant capacity changes when applied on a different sample than the reference population.

The evaluation of intra- and inter-observer error as well as the correct classification accuracy of applying previously published DFs on the Greek population sample were performed with the GNU Octave programming language (Eaton et al. 2016), while ANOVA and discriminant function analysis (DFA) were performed with SPSS (IBM SPSS version 23.0, Armonk, NY).

Results

Intra- and inter-observer error, by means of absolute technical error measurement (TEM), relative TEM and R coefficient, are presented in Table 2. Regarding the intra-observer error, absolute TEMs ranged within 0.16–0.43 mm, while the values of the relative TEM ranged within 0.27–1.36%. Our findings showed that the most precise measurement was BicBR and the least precise one was SyH. The corresponding ranges concerning the inter-observer error were 0.28–0.84 mm and 0.46–2.77%, respectively. According to the results, the most consistent measurement was MaxML (left side) while MinRBR (right side) provided the lowest consistency. R coefficients showed good reliability (R > 0.95) for almost all measurements with the exception of six measurements (Agn–cdl, BilBR, SyBR, left-side NL, left- and right-side MinRBR), whose R values ranged from 0.909 to 0.948.

Table 2 Intra- and inter-observer error results

Descriptive statistics for each measurement variable are presented in Table 3, whereas Table 4 lists the respective results of ANOVA and linear discriminant analysis. Males exhibit higher mean values in all linear measurements over females and lower mean values in angular measurements with the sole exception of the angle Agn–go. The univariate linear discriminant analysis revealed that only five metric traits exhibited a cross-validated correct classification percentage over 75%. These traits concerned the right-side measurements, RH, MRH, CH, CGL and MaxML, which were subsequently used in multivariate DFA.

Table 3 Descriptive statistics for each measurement variable
Table 4 ANOVA and linear discriminant analysis for each measurement variable

Tables 5 and 6 summarize the statistics of the discriminant function analysis for all the DFs derived from the Greek population sample for the right- and the left-side measurements, respectively. The first nine DFs in each table concern the corresponding functions based on previously published works, whereas the latter concern the statistically permissible DFs produced by enter and stepwise methods with the most sexually dimorphic measurements found on the Greek population sample. Although no statistically significant differences were observed between the measurements of the right and left side, the corresponding equations based on the left-side measurements were produced in order to detect any differences in classification accuracy, but also for comparative consistency with previously published research.

Table 5 Discriminant functions based on the right-side measurements
Table 6 Discriminant functions based on the left-side measurements

The cross-validated classification accuracy of the examined DFs ranged from 77.1 to 85.7% with Function 7 (based on MRH and MinRBR measurements) exhibiting the lowest accuracy for both the left- and the right-side measurements and Function 4 (based on BigBR, CGL, MaxBL, and MaxML measurements) exhibiting the highest. Regarding the DFs produced by combining the most sexually dimorphic measurements from the Greek population sample, their accuracy was consistently over 82%. The DFs based on the right-side measurements exhibited slightly better overall performance, although Function 15 yielded the best classification score (84.3%) on the left-side measurements. Regarding the nine DFs produced from our sample, which correspond to previously published functions, in most of the cases, the equations based on the right-side measurements outperform their left-side counterparts. The only exceptions concern the Functions 5 and 7, which showed negligible improvement on the left side.

Table 7 summarizes the classification accuracy of the application of the previously reported DFs on the Greek sample. As expected, their discriminant capacity is reduced as compared to the classification accuracy of the respective DFs derived from the Greek sample. More specifically, the accuracy ranged from 63.78 to 83.84% when applied on the left-side measurements and from 68.11% to 83.84% when the right-side measurements were used. The highest sex discriminant capacity was observed with Function 3, which incorporated nine variables and reached 83.84% correct classification for both sides. The lowest correct classification accuracy was exhibited by Function 1, which correctly classified 63.79% and 68.11% of the Greek sample, when using the left-and the right-side measurements, respectively. Finally, it should be noted that the DF reported by Dong et al. (2015) could not be applied on our sample, since they omitted to report the DF’s constant value in their paper.

Table 7 The accuracy of other researchers’ equations as well as their accuracy when applied on the Greek population sample

Discussion and conclusions

In anthropometric studies, the determination of measurement error is essential (Ulijaszek and Lourie 1994). According to this study, all measurements exhibited an acceptable measurement error indicating high repeatability and precision, whereas their respective R coefficients demonstrated that all measurements may be regarded as reliable. These findings are in agreement with those reported by Toneva et al. (2016). Furthermore, most of the measurements are below the intra- and inter-observer %TEM thresholds of 1% and 1.5%, respectively, which are usually considered acceptable for skillful anthropometrists (Perini et al. 2005). Despite that the photogrammetric 3D modeling of the mandibles was not part of the present study, the reported cross-validated accuracy between digital and manual measurements, which was part of the digital documentation process, is smaller than the inter- and intra-observer absolute TEMs reported in the present study. Hence, the present results and DFs can be applied to either digital or manual measurements without any inter-method measurement error weakening their utility.

The aim of the present study has been twofold. We aimed to identify the mandibular measurements based on a modern Greek population sample that yield the highest sex discriminant capacity and use them to produce multivariate DFs that most successfully can estimate sex. Meanwhile, we investigated the performance of previously published DFs derived from different population samples. Population specificity is a long and well-established observation in osteometric studies also evident from our results (Giles 1964; İşcan and Steyn 1999; Franklin et al. 2007). Nevertheless, the rationale of our approach was not limited to a mere validation of population specificity but further aimed to identify particular morphometric traits that consistently yield high sex classification results among different population groups even though their expression patterns may vary among these groups.

Our findings regarding the most sex discriminant mandibular traits were consistent with earlier studies by Franklin et al. (2006, 2008), who found that the coronoid height, the ramus height, and the maximum mandibular length univariately exhibited the most pronounced sexual dimorphism. Furthermore, the produced DF (Function 10, Table 5), when combining these measurements from the Greek population sample, yielded a cross-validated accuracy of 84.1%. Although the cranium and the pelvis provide much more reliable sex estimates (Oikonomopoulou et al. 2017; Bertsatos et al. 2018), our results verify that the mandible can be useful for estimating sex in a forensic context, when cranial and pelvic elements are missing or deteriorated.

Franklin and colleagues in 2008 studied 225 individuals (120 male; 105 female) from five local populations of indigenous South Africans, in order to produce a series of mandibular metric standards for sex estimation, which resulted in Functions 2, 3, and 4 (see Table 7). The corresponding DFs based on the Greek sample (Tables 5 and 6) yielded the highest sex discriminant scores observed in the present study ranging from 84.3 to 85.7%. More specifically, Function 3, which includes nine mandibular measurements, yielded similar classification accuracy on our sample (84.8%) with that reported by Franklin and colleagues on their population sample (84%). Furthermore, applying the original DF (Franklin et al. 2008) on the Greek population sample also resulted to similar accuracy (83.84%). However, the accuracy of Functions 2 and 4, which utilize three and four mandibular measurements, respectively, was higher for the DFs derived from the Greek population (Function 2: right side 85.2%, left side 84.3%; Function 4: right side 85.7%, left side 84.8%) as opposed to their counterparts derived from the indigenous South African population (Function 2: 81.8%; Function 4: 82.7%). Additionally, applying the original DFs on the Greek population sample yielded even lower classification scores (see Table 7). Despite that Functions 2 and 4 resulted in differential performance between the two distinct population groups, which can be attributed to population specificity, the overall performance of these three functions implies some merit to the proposed metric standards by Franklin et al. (2008).

Steyn and İşcan in 1998 evaluated sexual dimorphism in the cranium and the mandible of South African Whites and developed osteometric standards to determine sex. They studied 91 South African Whites (44 males, 47 females) from cadaver collections housed at the Universities of Pretoria and Witwatersrand (Dart Collection). Although comparing their results to our corresponding DF results (Function 1) revealed similar classification accuracy (~ 81.5%), applying the original DF (Steyn and İşcan 1998) on the modern Greek population sample exhibited much lower accuracy (left side: 63.78%; right side: 68.11%). The same pattern was observed on the rest of the comparisons between different population samples. The corresponding DFs produced similar classification accuracy between their respective reference population samples, but applying the original DFs (from other population samples) on the Greek sample resulted in significantly reduced accuracy.

More specifically, Saini et al. (2011), working on 116 dry adult mandibles of a Northern Indian population sample from the Department of Forensic Medicine in India, reported classification accuracy of 80.2% for both Functions 5 and 6, which on our sample yielded approximately 82% and 79%, respectively, with small deviations on each side. Lin et al. (2014) produced their DFs from cranial CT scans of 120 males and 120 females from Seoul St. Mary’s Hospital. Despite most of their reported DFs were based on different set of measurements than these utilized in the present study, which restricted a direct comparison with their most accurate DFs in terms of correct sex classification, the available comparisons showed similar classification score for Function 7 (77.2% on Greek sample; 80.8% on Korean sample) and identical accuracy for Function 8 (80.4%). Similarly, the DF reported by Dong and colleagues based on a contemporary Han Chinese population sample yielded 83.3% classification accuracy (Dong et al. 2015), whereas the corresponding DF from the Greek sample produced 82.7% and 84.9% cross-validated classification scores for the left- and right-side measurements, respectively.

The present study produced a number of suitable DFs based on mandibular measurements that can be used for sexing unidentified individuals assumed to belong to the modern Greek population. Apart from verifying the fact that most often the morphometric DFs based on human bones exhibit population specificity, hence the need for studies on different population samples, the comparative part of this work also revealed some interesting aspects. The observation that the same combinations of measurements yield the same sex discriminant capacity between different population samples, despite their respective DFs being population specific, implies that the magnitude of expression of sexual dimorphism is similar on certain mandibular morphometric traits and shared across different populations, although their expression may follow different patterns in each population group. Regarding the identical results of Function 3 (Franklin et al. 2008) and especially the identical classification accuracy when applying the DF derived from indigenous South Africans on the Greek population, we cannot conclude whether this observation is a mere statistical coincidence between the two reference samples or the result of both population groups sharing similar environmental and developmental factors that led to similar expression of sexual dimorphism in their mandibles. Further work is necessary to this end, and studies on shared datasets comprising diverse population samples may expand our insight on this matter and provide more reliable sex discriminant tools for the forensic practice.

Abbreviations

DF:

Discriminant function

DFA:

Discriminant function analysis

TEM:

Technical Error Measurement

References

Download references

Acknowledgements

Not applicable.

Funding

This research did not receive any grant from funding agencies in the public, commercial, or not-for-profit sectors.

Availability of data and materials

Please contact author for data requests.

Declarations

The authors declare that this manuscript is original, has not been published before, and is not currently being considered for publication elsewhere.

Author information

Authors and Affiliations

Authors

Contributions

AB contributed to the data collection, statistical analysis, and manuscript preparation. KA contributed to the data collection and manuscript preparation. M-EC contributed to the experimental design, statistical analysis, and manuscript preparation. All authors have read and approved the final article.

Corresponding author

Correspondence to Maria-Eleni Chovalopoulou.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bertsatos, A., Athanasopoulou, K. & Chovalopoulou, ME. Estimating sex using discriminant analysis of mandibular measurements from a modern Greek sample. Egypt J Forensic Sci 9, 25 (2019). https://doi.org/10.1186/s41935-019-0133-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s41935-019-0133-7

Keywords