Skip to main content

Sex and age estimation with machine learning algorithms with parameters obtained from cone beam computed tomography images of maxillary first molar and canine teeth

Abstract

Background

The aim of this study is to obtain a highly accurate and objective sex and age estimation by using the parameters of maxillary molar and canine teeth obtained from cone beam computed tomography images in the input of machine learning algorithms. Cone beam computed tomography images of 240 people aged between 25 and 54 were randomly selected from the archive systems of the hospital and transferred to Horos Medikal. 3D curved multiplanar reconstruction was applied to these images and a 3D image was obtained. The resulting image was brought to the orthogonal plane and the measurements were made by superimposing them.

Results

The results were grouped in four different age groups (25–30, 31–36, 37–49, 50–54) and recorded. As a result of our study, the highest accuracy rate was found as 0.81 in sex estimation with ADA Boost Classifier algorithm, while in age estimation, the highest accuracy rate was found as 0.84 between 25–30 and 31–36 age groups with random forest algorithm, as 0.74 between 25–30 and 37–49 age groups with random forest and ADA Boost Classifier algorithms and as 0.85 between 25–30 and 50–54 age groups with random forest algorithm.

Conclusions

Our study differs from other studies in two aspects; the first is the selection of a sensitive method such as cone beam computed tomography, and the second is the selection of machine learning algorithms. As a result of our study, the highest accuracy rate was found as 0.81 in sex estimation and as 0.85 in age estimation with parameters of maxillary canine and molar teeth.

Background

Sex and age estimation of individuals has an important place in forensic anthropology, forensic odontology and forensic medicine studies. Sex and age estimation both facilitate and accelerate identification of individuals; therefore, while sex is the first biomarker sought in forensic studies, the second one is age (Capitaneanu et al. 2017; Pereira et al. 2020).

Skull, pelvis, sternum, phalanx, patella, and mandible bones are used in forensic studies (Oner et al. 2019; Senol et al. 2022; Toy et al. 2022; Turan et al. 2019). However, teeth, mandible, and maxilla are also frequently preferred. Due to their compact structure and the fact that they are surrounded by facial muscles, teeth are among the structures that preserve their integrity for a long time after death. Thanks to this compact structure, teeth come to the forefront in cases such as fire and war when the body integrity is impaired (Narang et al. 2014).

Age estimation is critically important not only in post-mortem cases, but also in living individuals. Especially after the 2000s, it has been used in criminal cases to estimate the age of individuals who do not want to tell their real age, or in civil cases to estimate the exact age of individuals who want to learn their full age (Azevedo et al. 2015).

Although genetic, histological, radiological, and biochemical methods are used in forensic analyses, most of these are not economical and fast. Radiological methods are widely used both because they are fast and economical and also because they allow for segmentation. Of these radiological methods, computed tomography (CT) stands out with its widespread use (Alias et al. 2018; Nayar et al. 2017).

Machine learning algorithm (ML) is an engineering based method that can give more objective and better results than classical methods and it has begun to be used in the field of health today (Erickson et al. 2017; Rajkomar et al. 2019). Logistic regression (LR) is an extension of ordinary regression analysis and it is a strong classification algorithm that can model which class the data belongs to. Decision tree (DT) is a strong, old, quick-to-learn, and easy to interpret tree algorithm that classifies data with a structure similar to tree. Random forest (RF) is an ML algorithm with a feature that classifies ensembles consisting of multiple tree structures (Uddin et al. 2019). Linear discriminant analysis (LDA) is an advanced version of principal component analysis and it is frequently preferred in the classification of linear data. Quadratic discriminant analysis (QDA) is a version of LDA used in non-linear data and it is more flexible than LDA (Jeng et al. 2020). AdaBoost classifier (ADA) is an iterative ensemble classifier algorithm that analyzes data by combining classifiers. Extra tree classifier (ETC) is a classification algorithm that generates pruned decision trees by forming random cut off points (Khan and Ramsahai 2021).

The aim of this study was to obtain highly accurate and objective results by using ML with parameters obtained from CTs of maxillary molar and canine teeth.

Methods

Study population

This study was conducted with the 2020/1099 numbered decision of non-interventional local ethics committee. Study population included 120 female and 120 male individuals between the ages of 25 and 54 who did not have any pathologies or interventional operation in maxillary tooth structure. The individuals included in the study were grouped in 4 different age groups as 25–30, 31–36, 37–49, 50–54 years of age.

Cone beam computed tomography (CBCT) scanning protocol

NewTom 5G CBCT device (Verona, Italy) was used to obtain CBCT images with the following parameters: 110 kVp, 1–11 mA, 3.6 s. Images containing artifacts that prevent segmentation (metal artifact, artifacts caused by position errors during shooting, etc.) were excluded from the study.

Image processing

The images in Digital Imaging and Communications in Medicine (DICOM) format taken from Picture Archiving Communication Systems (PACS) of our hospital were transferred to personal work station Horos Medical Image Viewer (Version 3.0, USA) program. Axial, coronal and sagittal plane images were obtained by applying 3D curved multiplanar reconstruction (MPR) to transferred images (Fig. 1). The line passing through the pars maxillaris of palatum durum on the images of three planes obtained were determined and all images were brought to the orthogonal plane. These images brought to orthogonal plane were overlapped and length, angle, and curvature length measurements were performed (Fig. 2).

Fig. 1
figure 1

Demonstration of the process of bringing to orthogonal plane (a: sagittal, b: coronal, c: axial)

Fig. 2
figure 2

Demonstration of measurements (1 CT-L, 2 FM-L, 3 LM-A, 4 IFCT-A 5 PNSCT-A, 6 RFMTM-L, 7 LFMTM-L, 8 FMRCT-L, 9 FMLCT-L, 10 RM-A, 11 LFM-CL, 12 IFFM-A, 13 PNSFM-A, 14 RFM-CL)

Measurement parameters are as follows:

  • The distance between canine teeth (CT-L)

  • The angle of canine teeth to posterior nasal spine (PNSCT-A)

  • The angle of canine teeth to incisive foramen (IFCT-A)

  • The distance of right canine tooth to first molar (FMRCT-L)

  • The distance of left canine tooth to first molar (FMLCT-L)

  • The angle of first molar to posterior nasal spine (PNSFM-A)

  • The angle of first molar to incisive foramen (IFFM-A)

  • Curvature length of the right first molar (RFM-CL)

  • Curvature length of the left first molar (LFM-CL)

  • The distance between the first molars (FM-L)

  • The distance between the right first molar and third molar (RFMTM-L)

  • The distance between the left first molar and third molar (LFMTM-L)

  • Right molar angle (RM-A)

  • Left molar angle (LM-A)

Machine learning algorithm application

Python programming language (Version 3.7.1) and scikit-learn framework (Version 0.20.0) were used for ML modelling. Modelling was performed by using a computer with Hp-Folio 1040 model, i7 operating system and 8 Gb Ram. In algorithms, training set was determined as 80%, while the test set was determined as 20% and logistic regression (LR), decision tree (DT), linear discriminant analysis (LDA), AdaBoost classifier (ADA), quadratic discriminant analysis (QDA), extra tree classifier (ETC), random forest (RF) algorithms were preferred. Algorithms were applied in 25–30 vs 31–36, 25–30 vs 37–49, 25–30 vs 50–54 age groups and between sexes. In our study, age groups were divided into many different combinations and used as input in algorithm analysis. Among these combinations, those with an Accurucy (Acc) ratio above 0.70 were included in the study. Other combinations provided Acc ratio between 0.50 and 0.70. The factors affecting the accuracy rate can be briefly shown as the number of individuals, the fact that the teeth structure of the individuals does not show values in their own age group, the age of the individual is very close to another age group. In order to eliminate the negativity due to closeness between age groups, individuals can be divided into months, weeks and days, not years, but this requires a huge number of individuals. This will adversely affect situations where quick decisions are required. For this reason, year classification was preferred in our study. Acc, specificity (Spe), sensitivity (Sen), and F1 score (F1) values were used as performance criteria. These performance criteria obtained for sex and age estimation belong to the test set and the training set was used for learning algorithms.

$$\mathrm{Acc}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}+\mathrm{FP}+\mathrm{TN}}$$
$$\mathrm{Sen}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}}$$
$$\mathrm{Spe}=\frac{TN}{TN+FP}$$
$$F1=2\frac{Spe\times Sen}{Spe+Sen}$$

Equation 1 (FP; false positive, FN; false negative, TP; true positive, TN; true negative).

In order for the results obtained to reflect the truth, no extraction or cleaning was performed in the entire data set. In addition, tenfold cross-validation was applied in our study and the Acc value of each algorithm was given as mean ± standard deviation. In our study, the contribution of the parameters to the overall result was revealed by using the SHAP analyzer of the RF algorithm.

Statistical analysis

Two hundred forty individuals and 14 parameters were used in the present study; median, minimum and maximum values were included in the descriptive statistics for each age group of the parameters. Descriptive statistics were performed by using Minitab 17 statistics program.

Results

Mean age of the men in the study were found as 39.29 ± 10.98, while mean age of the women was found as 39.11 ± 10.60 and no significant difference was found between ages in terms of sex (p ≥ 0.05). Of the 240 individuals whose images were evaluated in the study, measurement results of 72 individuals between the ages of 25 and 30 are shown in Table 1. Median value of men was found as 25, while median value of women was found as 26 in terms of sex.

Table 1 Descriptive statistics of individuals between the ages of 25 and 30

Of the 240 individuals whose images were evaluated in the study, measurement results of 24 individuals between the ages of 31 and 36 are shown in Table 2. Median value of men was found as 36, while median value of women was found as 36 in terms of sex.

Table 2 Descriptive statistics of individuals between the ages of 31 and 36

Of the 240 individuals whose images were evaluated in the study, measurement results of 84 individuals between the ages of 37 and 49 are shown in Table 3. Median value of men was found as 40, while median value of women was found as 43 in terms of sex.

Table 3 Descriptive statistics of individuals between the ages of 37 and 49

Of the 240 individuals whose images were evaluated in the study, measurement results of 60 individuals between the ages of 50 and 54 are shown in Table 4. Median value of men was found as 53, while median value of women was found as 53 in terms of sex.

Table 4 Descriptive statistics of individuals between the ages of 50 and 54

In the estimation of 25–30 vs 31–36 age groups, the highest Acc rate was found as 0.84 with RF algorithm. In the estimation of 25–30 vs 37–49 age groups, the highest Acc rate was found as 0.74 with RF and ADA algorithms. The highest Acc rate of 25–30 vs 50–54 age groups was found as 0.85 with RF algorithm. In total sex comparison, the highest Acc rate was found as 0.81 with ADA algorithm (Table 5).

Table 5 Accuracy table of machine learning algorithms

In the estimation of 25–30 vs 31–36 age groups, the highest Spe rate was found as 0.87 with RF algorithm. In the estimation of 25–30 vs 37–49 age groups, the highest Spe rate was found as 0.75 with RF and ADA algorithms. The highest Spe rate of 25–30 vs 50–54 age groups was found as 0.85 with RF algorithm. In total sex comparison, the highest Spe rate was found as 0.81 with ADA algorithm (Table 6).

Table 6 Specificity table of machine learning algorithms

In the estimation of 25–30 vs 31–36 age groups, the highest Sen rate was found as 0.84 with RF algorithm. In the estimation of 25–30 vs 37–49 age groups, the highest Sen rate was found as 0.75 with RF and ADA algorithms. The highest Sen rate of 25–30 vs 50–54 age groups was found as 0.85 with RF algorithm. In total sex comparison, the highest Sen rate was found as 0.81 with ADA algorithm (Table 7).

Table 7 Sensitivity table of machine learning algorithms

In the estimation of 25–30 vs 31–36 age groups, the highest F1 rate was found as 0.80 with RF algorithm. In the estimation of 25–30 vs 37–49 age groups, the highest F1 rate was found as 0.75 with RF and ADA algorithms. The highest F1 rate of 25–30 vs 50–54 age groups was found as 0.85 with RF algorithm. In total sex comparison, the highest F1 rate was found as 0.81 with ADA algorithm (Table 8).

Table 8 F1 table of machine learning algorithms

The highest Acc rate was obtained with RF algorithm between 25–30 and 50–54 age groups and the confusion matrix of this algorithm is shown in Fig. 3. Thirteen of the 14 individuals in the test group between the ages of 25 and 30 were estimated correctly, while 1 was estimated incorrectly. Twelve of the 9 individuals in the test group between the ages of 50 and 54 were estimated correctly, while 3 were estimated incorrectly.

Fig. 3
figure 3

Confusion matrix of the highest accuracy

In our study, the SHAP analyzer belonging to the 25–30 vs 50–54 group with the highest accuracy rate with the RF algorithm was included and it was found that the PNSFM-A parameter contributed the highest to the overall result (Fig. 4).

Fig. 4
figure 4

SHAP analyzer of the RF algorithm (for group 25–30 vs 50–54, feature 0 CT-L, 1 PNSCT-A, 2 IFCT-A, 3 FMRCT-L, 4 FMLCT-L, 5 PNSFM-A, 6 IFFM-A, 7 RFM-CL, 8 LFM-CL, 9 FM-L, 10 RFMTM-L, 11 LFMTM-L, 12 RM-A, 13 LM-A)

In addition, tenfold cross-validation was applied to increase the reliability of our study, and the highest accuracy rate was found to be 84.55 ± 2.06 in the 25–30 vs 50–54 group with the RF algorithm (Table 9).

Table 9 Tenfold cross-validation accuracy value

Discussion

In the present study, sex and age estimation was performed by using ML algorithms with parameters of maxillary canine and molar teeth obtained from CBCT images. In sex estimation, an accuracy rate of 0.81 was obtained with ADA analysis, while in age estimation an accuracy of 0.85 was obtained between 25–30 and 50–54 groups with RF analysis.

Estimation of age from teeth is a research topic that started in 1837 and has survived to the present day. We can group methods of age estimation from teeth in three main categories; these are morphohistological, radiological, and biochemical methods. Of these methods, radiological methods have become more important both since they can be easily used in living and deceased individuals and also because of some laws that prohibit getting tissue from deceased individuals (Verma et al. 2019). When estimating age from teeth, radiological method CT comes to the fore because it provides thinner section and spatial resolution (Bjørk and Kvaal 2018). Different metric measurements of teeth have been used in radiological methods. Asif et al. (2019) evaluated the volumetric analysis of pulp/tooth ration by dividing individuals between 16 and 65 years of age into 5 different age groups and found that this volumetric analysis was valuable in age estimation. In a study, they used the pulp volume of the second molar in individuals between the ages of 21 and 50, Helmy et al. (2020) found that the second molar could be used in age estimation. In the present study, we determined 14 metric parameters from images of maxillary molar and canine teeth obtained with CBCT and obtained 0.85 Acc in age estimation with RF algorithm.

In a study they examined the maxillary and mandibular molar teeth of 1586 individuals by using convolutional neural network, Kim et al. (2021) divided individuals between 0 and 60 years of age into 7 different groups and obtained an accuracy rate between 0.89 and 0.90 in age estimation. In the present study, we performed age and sex estimation by using ML algorithms and found an accuracy rate of up to 0.85. When compared with Kim et al.’s study, we think that the reason for low Acc rate in our study is due to the high number of individuals in the study, and due to the fact that only molars were used and a different computer based algorithm was used.

In a study, they conducted on 100 female and 100 male individuals with parameters of the first molar, Sonika et al. (2011) found that the first molar showed sexual dimorphism. In a study they conducted on 100 women and 100 men, Manchanda et al. (2015) used seven maxillary and seven mandibular teeth and obtained sexual dimorphism between 0.51 and 0.80. Both studies in literature and our study show that maxillary molar and canine teeth show strong sexual dimorphism. In our study, 0.81 sexual dimorphism was found with ADA algorithm. The rates of other algorithms were found to vary between 0.69 and 0.77.

Limitations of the present study can be listed as scarcity of images, variety of algorithms used and the quality of CT used. However, despite these limitations, an Acc rate of 0.85 was obtained in age estimation, while an Acc rate of 0.81 was obtained in sex estimation.

Conclusions

As a result of our study, it was found that images of maxillary molar and canine teeth obtained from CBCT can be used as important biomarkers in sex and age estimation by using ML algorithms.

Availability of data and materials

The datasets generated and/or analyzed during the current study are not publicly available due [reason why data are not public] but are available from the corresponding author on reasonable request.

Abbreviations

CT:

Computed tomography

ML:

Machine learning algorithm

LR:

Logistic regression

DT:

Decision tree

RF:

Random forest

LDA:

Linear discriminant analysis

QDA:

Quadratic discriminant analysis

ADA:

AdaBoost classifier

ETC:

Extra tree classifier

DICOM:

Digital Imaging and Communications in Medicine

PACS:

Picture Archiving Communication Systems

MPR:

3D curved multiplanar reconstruction

Acc:

Accuracy

Spe:

Specificity

Sen:

Sensitivity

F1:

F1 score

FP:

False positive

FN:

False negative

TP:

True positive

TN:

True negative

References

  • Alias A, Ibrahim ANM, Bakar SNA, Rokhani F, Shafie MS, Nor FM (2018) Morphometric analysis of coronoid process of mandible by CT in the Malaysian population: an important step for determination of sex. J Dent Sci Res Ther 5:1–8

    Google Scholar 

  • Asif MK, Nambiar P, Mani SA, Ibrahim NB, Khan IM, Lokman NB (2019) Dental age estimation in Malaysian adults based on volumetric analysis of pulp/tooth ratio using CBCT data. Leg Med 36:50–58

    Article  Google Scholar 

  • Azevedo A, d. C. S., Alves, N. Z., Michel-Crosato, E., Rocha, M., Cameriere, R., & Biazevic, M. G. H. (2015) Dental age estimation in a Brazilian adult population using Cameriere’s method. Braz Oral Res 29:1–9

    Google Scholar 

  • Bjørk MB, Kvaal SI (2018) CT and MR imaging used in age estimation: a systematic review. J Forensic Odontostomatol 36(1):14

    PubMed  PubMed Central  Google Scholar 

  • Capitaneanu C, Willems G, Thevissen P (2017) A systematic review of odontological sex estimation methods. J Forensic Odontostomatol 35(2):1

    CAS  PubMed  PubMed Central  Google Scholar 

  • Erickson BJ, Korfiatis P, Akkus Z, Kline TL (2017) Machine learning for medical imaging. Radiographics 37(2):505

    Article  PubMed  Google Scholar 

  • Helmy MA, Osama M, Elhindawy MM, Mowafey B, Taalab YM, Abd ElRahman HA (2020) Volume analysis of second molar pulp chamber using cone beam computed tomography for age estimation in Egyptian adults. J Forensic Odontostomatol 38(3):25

    CAS  PubMed  PubMed Central  Google Scholar 

  • Jeng M-J, Sharma M, Chao T-Y, Li Y-C, Huang S-F, Chang L-B, Chow L (2020) Multiclass classification of autofluorescence images of oral cavity lesions based on quantitative analysis. PLoS ONE 15(2):e0228132

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Khan K, Ramsahai E (2021) Maintaining proper health records improves machine learning predictions for novel 2019-nCoV. BMC Med Inform Decis Mak 21(1):1–13

    Article  Google Scholar 

  • Kim S, Lee Y-H, Noh Y-K, Park FC, Auh Q (2021) Age-group determination of living individuals using first molar images based on artificial intelligence. Sci Rep 11(1):1–11

    Google Scholar 

  • Manchanda AS, Narang RS, Kahlon SS, Singh B (2015) Diagonal tooth measurements in sex assessment: a study on North Indian population. J Forensic Dent Sci 7(2):126

    Article  PubMed  PubMed Central  Google Scholar 

  • Narang R, Manchanda A, Malhotra R, Bhatia H (2014) Sex determination by mandibular canine index and molar odontometrics: a comparative study. Indian J Oral Sci 5(1):16

    Article  Google Scholar 

  • Nayar AK, Parhar S, Thind G, Sharma A, Sharma D (2017) Determination of age, sex, and blood group from a single tooth. J Forensic Dent Sci 9(1):10

    PubMed  PubMed Central  Google Scholar 

  • Oner Z, Turan MK, Oner S, Secgin Y, Sahin B (2019) Sex estimation using sternum part lenghts by means of artificial neural networks. Forensic Sci Int 301:6–11

    Article  PubMed  Google Scholar 

  • Pereira JGD, Lima KF, da Silva RHA (2020) Mandibular measurements for sex and age Estimation in Brazilian sampling. Acta Stomatol Croat 54(3):294

    Article  Google Scholar 

  • Rajkomar A, Dean J, Kohane I (2019) Machine learning in medicine. N Engl J Med 380(14):1347–1358

    Article  PubMed  Google Scholar 

  • Senol D, Bodur F, Seçgin Y, Babysitter R, Sahin N, Toy S, et al (2022) Sex prediction with morphometric measurements of first and fifth metatarsal and phalanx obtained from X-ray images by using machine learning algorithms. Folia Morphol. Accepted

  • Sonika V, Harshaminder K, Madhushankari G, Kennath JAS (2011) Sexual dimorphism in the permanent maxillary first molar: a study of the Haryana population (India). J Forensic Odontostomatol 29(1):37

    CAS  PubMed  PubMed Central  Google Scholar 

  • Toy S, Secgin Y, Oner Z, Turan MK, Oner S, Senol D (2022) A study on sex estimation by using machine learning algorithms with parameters obtained from computerized tomography images of the cranium. Sci Rep 12(1):1–11

    Article  Google Scholar 

  • Turan MK, Oner Z, Secgin Y, Oner S (2019) A trial on artificial neural networks in predicting sex through bone length measurements on the first and fifth phalanges and metatarsals. Comput Biol Med 115:103490

    Article  PubMed  Google Scholar 

  • Uddin S, Khan A, Hossain ME, Moni MA (2019) Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inform Decis Mak 19(1):1–16

    Article  Google Scholar 

  • Verma M, Verma N, Sharma R, Sharma A (2019) Dental age estimation methods in adult dentitions: an overview. J Forensic Dent Sci 11(2):57

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

Thanks to all the authors.

Funding

No financial support has been received from any institution or organization.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the article writing. Image acquisition: B.S.D; data analysis: Y.S; designed by D.S, S.T, Z.O, Y.S. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Deniz Senol.

Ethics declarations

Ethics approval and consent to participate

Approval was obtained from the Inonu University Non-Interventional Ethics Committee with the decision no. 2020/1099.

Informed consent include appropriate statements.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Senol, D., Secgin, Y., Duman, B.S. et al. Sex and age estimation with machine learning algorithms with parameters obtained from cone beam computed tomography images of maxillary first molar and canine teeth. Egypt J Forensic Sci 13, 27 (2023). https://doi.org/10.1186/s41935-023-00346-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s41935-023-00346-1

Keywords