Age estimation using DNA methylation technique in forensics: a systematic review
Egyptian Journal of Forensic Sciences volume 10, Article number: 38 (2020)
In addition to the DNA sequence, epigenetic markers have become substantial forensic tools during the last decade. Estimating the age of an individual from human biological remains may provide information for a forensic investigation. Age estimation in molecular strategies can be obtained by telomere length, mRNa mutation, or by sjTRECs but the accuracy is not sufficient in forensic practice because of high margin error.
One solution to this problem is to use DNA methylation methods. DNA methylation markers for tissue identification at age-associated CpG sites have been suggested as the most informative biomarkers for estimating the age of an unknown donor. This review aims to give an overview of DNA methylation profiling for estimating the age in cases of forensic relevance and the important aspects in determining the mean absolute deviation (MAD) or mean absolute error (MAE) of the estimated age. Online database searching was performed through PubMed, Scopus, and Google Scholar with keywords selected for forensic age estimation. Thirty-two studies were included in the review, with variable DNA samples but blood commonly as a source. Pyrosequencing and EpiTYPER were methods mostly used in DNA analysis. The MAD in the estimates from DNA methylation was about 3 to 5 years, which was better than other methods such as those based on telomere length or signal-joint T-cell receptor excision circles. The ELOVL2 gene was a commonly used DNA methylation marker in age estimation.
DNA methylation is a favorable candidate for estimating the age at the time of death in forensic profiling, with an uncertainty mean absolute deviation of about 3 to 5 years in the predicted age. The sample type, platform techniques used, and methods to construct age predictive models were important in determining the accuracy in mean absolute deviation or mean absolute error. The DNA methylation outcome suggests good potential to support conventional STR profiling in forensic cases.
The study of epigenetics refers to the heritable changes in gene function that cannot be explained by DNA sequence changes (Deans and Maggert 2015; Felsenfeld 2014). The term “epigenome” refers to the overall epigenetic status of a cell, parallel to the term “genome”. The epigenome is the set of chemical modifications to the DNA that alter gene expression. Epigenetic changes control how and when the genes are turned on or off which regulate the protein production in certain cells. Epigenetic modification types include DNA methylation, histone modification, and chromatic structuring. DNA methylation is a common type of epigenetic modification. The chromatin proteins associated with DNA may be activated or silenced and therefore, only express necessary genes for an activity such as certain protein production (Bird 2007; Vidaki et al. 2013). DNA methylation plays an important role in embryonic development, reprogramming, transcription, imprinting, chromosomal stability, and X-chromosome inactivation. The epigenetic pattern is preserved during cell division in the same way as the DNA sequence is inherited from one generation to the next. However, during an individual’s lifetime, they can change over time (Kanherkar et al. 2014). Epigenetics can be affected by environmental exposure, such as diet and smoking (Lee and Pausova 2013).
In mammalian cells, chemical modification of DNA methylation primarily affects cytosines, followed by guanines in a 5′-3′ direction in the DNA double helix, resulting in the addition of a methyl group (-CH3) to their 5′ carbon (C5). These 5′-3′ CG methylation sites in DNA are called “CpG” dinucleotides, which are mostly methylated in the human genome (Ehrlich et al. 1982). Unmethylated CpGs called “CpG islands” are predominantly encountered in groups of 300–3000 bp with high CG density (>55% CG content), mostly located at the promoter of housekeeping genes (Antequerra and Bird 1993; Espada and Esteller 2010). In the last decades, studies have shown that certain CpG sites are often either hypermethylated or hypomethylated when age increases (Zhang et al. 2011). Hypermethylation (excessive methylation) or hypomethylation (loss of appropriate methylation) can promote carcinogenesis within a living individual (Auerkari 2006).
In a crime investigation scene where highly limited biological remains are found, such as blood, semen, tissue, or saliva, accurate age estimation can be important for the police to narrow down the identity of a victim or criminal. The traditional materials required for age estimation, such as large pieces of skeletal remains, are not always available in crime scenes (Feng et al. 2018). In order to estimate human age, several molecular-based strategies have been proposed, such as telomere repeat length that decreases with increasing age (Weidner et al. 2014), mRNA mutations that accumulate with increasing age, T-cell DNA rearrangements (sjTREC) (Zubakov et al. 2016), age-dependent deletions of mitochondrial DNA, or protein alterations such as the racemization of aspartic acid and advanced glycation end-products (Wochna et al. 2018). Nonetheless, only DNA methylation has provided an acceptable accuracy that is clinically useful (Freire-Aradas et al. 2017).
A study of DNA methylation has provided a forensic method for epigenetic female sex typing. The method is based on the methylation pattern at a repetitive DXZ4 locus that is highly methylated on the active X chromosome but hypomethylated on the inactive X-chromosome. The PCR protocol to detect the latter is very sensitive and only requires 50 pg of DNA for female sex typing (Naito et al. 1993). DNA methylation marks a methyl group at the 5′ position of cytosine residues remaining in the extracted DNA, so this epigenetic marker is compatible with the standard procedures of forensics (Bird 2002; Sijen 2015). The analysis of DNA methylation patterns in forensics may give hints on pathological conditions (Klutstein et al. 2016) or circumstances that lead to death (Virani et al. 2016) and indicate the age of the DNA donor (Feng et al. 2018). This review aims to address the DNA methylation-based age estimation and the important aspect of its uncertainty in forensic applications.
The online literature search in the Scopus, Google Scholar, and Pubmed/Medline databases was applied to define keywords of “age estimation” OR “age determination” AND “DNA methylation” AND “forensic”. The guidelines of the Preferred Reporting Items for Systematic reviews and Meta-analyses (PRISMA) were used for the systematic review (Moher et al. 2009).
The inclusion and exclusion criteria were determined as follows. The inclusion criteria were studies describing the DNA methylation analysis for age estimation combined with or without other methods, with no restriction of sample size or age ranges, but restricted to reporting in the English language, publication within 2014–2019, and topics related to forensic studies. The exclusion criteria were satisfied by review studies, age estimation without molecular analysis, and abstracts without full paper available.
Reading the full articles for possible inclusion in the review followed the initial screening of the titles and abstracts. The full articles that matched the inclusion criteria and none of the exclusion criteria were set as eligible.
Literature search and screening
The analysis initially included 495 studies, and after the removal of duplicate articles, 462 were left for screening. After excluding 340 articles by the relevance of title and abstract, 122 full text articles were left. After further careful screening for more detailed contents, 91 full articles were excluded, leaving 31 eligible full articles. The procedure of the data selection is presented in Fig.1.
The studies comparing the different methods of forensic age estimation were extracted as follows: name of the first author, year of publication, methods, source of samples, number of samples, age (in years), age prediction (in years) as MAD and RSME/SEE (Table 1). In the DNA methylation method’s studies, the following data were extracted: name of the first author, year of publication, population, source of samples, age range (in years), sample size, CpG coverage, gene(s), technique/input DNA for bisulfite conversion, statistical model, age prediction by MAD or MAE, in years (Table 2).
Table 1 shows previous studies about DNA methylation-based age estimation together with other methods based on telomere length, mRNA methylation, and signal-joint T-cell receptor excision circles (sjTRECs). The uncertainty levels (as MAD) in age prediction are compiled in Fig.2.
In the included studies (Table 2), the population and the sample types used were varied, as seen in Figs. 3 and 4, respectively. The study age range was 0–104 years old, and the range of the number of samples was 16–725, as shown in Figs.5 and 6, respectively. The number of CpG coverage in this study was from 1 CpG to 32 CpGs. Variable candidate genes date was used for age prediction. The ELOVL2 gene was most frequently used in studies with different body fluids and teeth (Fig. 7). The techniques used in the study are compiled in Fig. 8.
The platforms in age prediction used the multivariate linear regression model (MLRM), SNaPshot, methylation sensitive-high resolution melting (MS-HRM), EpiTYPER, next-generation sequencing (NGS), massively parallel sequencing (MPS), support vector regression model (SVRM), multivariate quadratic regression model (MQDRM), multivariate quantile regression model (MQTRM), random forest regression (RFR), generalized regression neural networks (GRNN), neural network (NN), artificial neural network (ANN), R-models, or combinations, as shown in Fig. 9.
The uncertainty in the predicted age ranged in MAD from ±1.2 (Giuliani et al. 2016) to 7.87 years (Huang et al. 2015), in MAE from ±0.94 (Freire-Aradas et al. 2018) to 7.45 years (Vidaki et al. 2017), and in RSME from ±4.03 (Hong et al. 2019) to 11.1 years (Aliferi et al. 2018). Levels of mixed sample MAD and MAE are presented in Fig. 10.
Age estimation is important to investigate in forensic cases on persons of unknown age, in fraud cases, and other legal affairs of victim identification. Several DNA-based methods can be used to estimate human age, such as those based on telomere length, mRNA, DNA rearrangement or sjTREC, and aspartic amino acid (Asp) racemization, which decrease along with increasing age (Zubakov et al. 2016).
Telomeres are located at the terminal regions of chromosomes and protect chromosome ends. Shortening of the telomeres will lead to cell senescence, characterized by the incapacity of the cell to replicate. The measurement of the telomere length for the estimation of human age was first published using the Southern blot technique (Butler et al. 1998). The current methods in measuring the telomere length for age prediction have been presented in two studies (Weidner et al. 2014; Zubakov et al. 2016), with MAD of more than 10 years for the predicted age prediction was more than 10 years, while it was 5 years for the method of DNA methylation. The telomere length-based approach is hence not sufficient in forensic practice because of the high margin of error.
By identifying the mRNA markers via microarray screening and validating with TaqMan qRT-PCR profiling, the results can provide an age prediction model. The correlation between gene expression and age has been used to find the strongest of nine mRNA candidate markers. The MAD for mRNA methylation-based age prediction was about 9 years, i.e., more than that the 5 years for DNA methylation-based prediction (Zubakov et al. 2016).
The sjTREC levels in the blood decrease with increasing age. The sjTRECs are episomal DNA molecules, by-products of T-cell somatic rearrangements in the T-cell receptor loci in order to recognize a wide range of foreign antigens. These molecules do not replicate and are progressively lost during subsequent cell divisions (Yamanoi et al. 2018). The MAD for sjTREC-based age prediction is 9–10 years, again more than the 4–5 years for DNA methylation-based age prediction (Cho et al. 2017).
The sjTREC-based methods are only applicable with a limited range of tissues under specific conditions (fresh blood samples and tissues of fresh cadavers) and do not meet the requirements of robustness under variable environmental factors and accurate estimation models (Zubakov et al. 2016).
Other methods of molecular age determination include Asp racemization (Hartomo et al. n.d.). The racemization is a first-order kinetics reaction where the amino acid changes from the levo (l) to the dextro (d) form. The aspartic amino acid (Asp) is a protein compound in many human tissues, including the teeth. Asp is most prone to racemization, which is optically active change because of an asymmetric carbon atom arrangement. Asp has the highest racemization reaction rate of all amino acids (Ogino and Onino 1988). In cartilage, bone, and teeth, the turnover accumulation of the d form proceeds at a low temperature-dependent rate linearly with age. The ratio of d/l may be used to estimate chronological age. In dentin, the MAD of the estimated chronological age was approximately 3 years (Ohtani and Yamamoto 2010).
The DNA methylation-based methods developed rapidly since the first relevant studies on DNA methylation and age estimation were published (Naito et al. 1993). The studies comparing DNA methylation-based age estimation with other methods showed that, e.g., sjTREC-based methods alone give MAD of about 10 years, while DNA methylation gave MAD of about 4 years. Combining SjTRECs and DNA methylation exhibited even higher predictive accuracy with MAD of about 3.3 years (Cho et al. 2017), while a combination set of five DNA methylation markers and one mRNA marker gave MAD of 4.6 years (Zubakov et al. 2016).
In line with the increasing age, DNA hypomethylation increases in the distribution of the genome (affecting intronic, exonic, promoters, and intergenic regions) or, in other words, the global level of methylated genomic DNA decreases as a person is aging (Wilson et al. 1987). However, DNA methylation is also susceptible to reproducibility variation in the assays according to the type of tissue used in the analysis, because some of the 5mC methylation marks in DNA are specific (Rana 2018). To scrutinize further on DNA methylation in different types of tissue, Horvath (2013) developed a multi-tissue age predictor, which allowed estimating the DNA methylation age in most tissues. The age predictor used 8000 samples from 82 Illumina DNA methylation array datasets, covering 51 healthy tissues and cell types. The multi-tissue age predictor is freely available (Horvath 2013).
Different sample sources can be modified in CpG coverage, such as buccal swabs as DNA methylation source of age prediction. The buccal epithelial cells with leukocytes by two additional CpGs provided age prediction with a multivariate model, showing that two cell type-specific CpGs actually improve epigenetic age prediction (Eipel et al. 2016). Different oral tissue sources showed different MADs: MAD was 1.2 years for cementum, 2.3 years for dental pulp, 7 years for dentin (Giuliani et al. 2016), 6 years for saliva, and 7.7 years for cigarette butts (Hamano et al. 2017).
Predicting younger age was more accurate and the accuracy decreased with increasing age. Five years prediction achieved 86.7% in the 2–19 years of age category and decreased to 50% in the 60–75 years of age category (Zbiec-Piekarska et al. 2015a). Validation of the age-prediction model for young age ranges showed MAE ± 1.25 years in the 2–18 years of donor age range while it was MAE ± 3.07 years in the adult populations (Freire-Aradas et al. 2018). The CpG site methylation markers with reduced methylation with age were CCDC102B, ASP, C1orf132, and chr16:85395429, while ELOVL2, FHL2, and PDE4C progressed with increasing DNA methylation with increasing age (Park et al. 2016). On the other hand, young age tends to be overestimated, while older age tends to be underestimated more often (Naue et al. 2017). An experimental study showed that ELOVL2, FHL2, PENK, and KLF14 did not display an age-related change in gene expression in peripheral blood mononucleated cells (Steegenga et al. 2014).
ELOVL2 locus provides a very good blood source of information of human chronological age and did not change significantly after 4 weeks of storage at room temperature, although along with increasing time, the positive result determined by PCR was gradually decreased (Zbiec-Piekarska et al. 2015b). The ELOVL2 gene was mostly used especially in blood and bloodstain samples (75%) as seen in Fig. 7. ELOVL2 also appeared to be an excellent age predictor across multiple ethnic groups such as Polish (Zbiec-Piekarska et al. 2015a,b), Koreans (Cho et al. 2017), and Singaporeans (Thong et al. 2017). ELOVL2 was not affected by the disease, so it appears suitable for forensic age prediction (Spolnicka et al. 2018). ELOVL2 is also a stable gene and has a strong positive correlation between methylation and age across other samples such as teeth (Giuliani et al. 2016; Bekaert et al. 2015b), buccal swabs (Bekaert et al. 2015a; Giuliani et al. 2016; Jung et al. 2019), saliva (Jung et al. 2019), and even cigarette butts (Hamano et al. 2017). The PDE4C gene was used in 33.3% of studies using blood samples, teeth, and buccal swabs. Eipel et al. demonstrated that methylation of PDE4C (cg17861230) has a higher correlation to chronological age with saliva and buccal swabs than blood. While semen samples were detected mostly by NOX4 (cg06979108) then TTC7B (cg06304190) and cg12837463 with no gene associated (Lee et al. 2015; Li et al. 2017; Lee et al. 2018; Richards et al. 2019).
For the target sites or CpG coverage and the age prediction accuracy, three target sites have been suggested as a preferable number for practical reasons (Weidner et al. 2014; Park et al. 2016), while one study suggested two target sites (Hamano et al. 2017). The age differential in methylation might be similar or significantly disparate between different tissues, depending on the specific CpG site. Therefore, in designing an age-prediction model, the method should be investigated thoroughly for multi-tissue forensic applicability (Aliferi et al. 2018).
Epigenetic studies are best in comparing monozygotic twins because they share the same genetic basis. They both display the same methylation and gradually show more differences in the methylation patterns (Kader and Ghai 2015). There is a specific forensic marker in discriminating monozygotic twins by the differences of LINE-1 in interspersed repeat sequences (Xu et al. 2015b). Buccal samples from 31 CpG sites from three loci in identical twins have demonstrated that at least one CpG site with DNA methylation was significantly different in all twin pairs (p < 0.05) and the highest number of significantly different CpG sites was six (Richards et al. 2019). The sampling of reference subjects from monozygotic twin pairs is often favored for investigating environmental influences on age prediction models since monozygotic twins usually have a similar growth environment (Xu et al. 2015a; Vidaki et al. 2017).
The collected samples in the studies mostly use blood from a donor or cadaver, but one study used samples from both healthy subjects and cadavers collected within 10 days and found no significant changes between living and dead body samples in the methylation status (Hamano et al. 2016). DNA methylation is also stable in bloodstains obtained from peripheral blood in both FTA cards and gauze exposed at room temperature for about 3 months (Peng et al. 2019).
Chronological age prediction from a forensic setting usually gives no information regarding possible disease status; therefore, age prediction is also performed in deceased subjects (Spolnicka et al. 2018; Vidaki et al. 2017; Horvath 2013). The biological age is relevant for the onset and progression rate of many diseases. Chronological age and biological age differences are important in forensic studies. Biological aging is influenced by cellular and molecular aging including changes in dysregulated nutrition, cell senescence, stem cell exhaustion, and disease-related factors (Bell et al. 2019). In one study, blood-related diseases showed high MAEs of the predicted age, with the highest MAE for anemia at 14.38 years, while schizophrenia showed the lowest age-prediction error of 5.03 years (Vidaki et al. 2017). In another study, a group with early-onset Alzheimer’s disease was predicted to be 1.7 years older than the chronological age of patients. The genes C1orf132 and ELOVL2 were stable in the three groups of early-onset Alzheimer’s disease, late-onset Alzheimer’s disease, and Grave’s disease. Therefore, they can be used as predictors of chronological age in forensic investigations (Spolnicka et al. 2018). ELOVL2 or ELOVL fatty acid elongase 2, also known as SSC2, is located in human chromosome 6 (6p24.2) (Jakobsson et al. 2006). In forensics, ELOVL2 is a promising candidate marker for age estimation because of its strong correlation with age prediction and a wide range of changes in methylation in aging (Zbiec-Piekarska et al. 2015b).
The pyrosequencing method was used in 13 out of 32 studies and is considered as a gold standard to detect DNA methylation. Pyrosequencing gives a detailed profile and accurate pattern of DNA methylation within 100 bases from the pyrosequencing binding sites. The ratio of nucleotides T and C determine the methylation degree at each CpG site in a sequence. Bisulfite conversion methods change unmethylated cytosines to uracil, while methylated cytosines remain cytosines. This is a quantitative technique, which can detect low methylation of up to 5%, and it can be used for multiplex assays (Kurdyukov and Bullock 2016).
The NGS is capable to detect DNA methylation differences in bisulfite-converted DNA fragments with overall performance <0.05 standard deviation. Other advantages include high sensitivity, multiplexing capabilities, and the potential for merging with other DNA marker analysis (Vidaki et al. 2017; Horvath 2013).
The disadvantage of pyrosequencing and NGS is that they are time-consuming and expensive (Mawlood et al. 2016); therefore, new methods were developed, such as MS-HRM, which can indicate methylation status more effectively in terms of labor, time, and cost (Hamano et al. 2016; Hamano et al. 2017). MS-HRM is a method to measure methylation profiles where the PCR amplification of bisulfite-treated DNA is followed by melting analysis. MS-HRM only requires qPCR, less time, and a gDNA amount of 20 ng/gene, whereas pyrosequencing needs 150 ng of gDNA. However, MS-HRM cannot measure the individual methylation rates and the issue of PCR bias such as intrinsic differences in the amplification efficiency of templates or by the self-annealing templates in the late stages of amplification (Hamano et al. 2017).
Other methylation detection methods include EpiTYPER (Feng et al. 2018; Zubakov et al. 2016; Freire-Aradas et al. 2016; Freire-Aradas et al. 2018; Peng et al. 2019), massive parallel sequencing or MPS (Naue et al. 2018), and single-base extension such as the SNaPshot technique. EpiTYPER is a sequencing method based on mass spectrometry-based bisulfite analysis. This technique indicates regional-specific DNA methylation, is fast and accurate but carries high cost in forensic service (Suchiman et al. 2015). A single EpiTYPER run yields 126 triplicate measurements that with the required controls are provided from a 384-well PCR plate (Suchiman et al. 2015). Therefore, EpiTYPER is useful for measuring relatively large numbers of samples. The MPS is a high throughput approach to DNA sequencing. Millions of short reads are sequenced per instrument run (Richards et al. 2018). The main advantage of MPS is its multiplexing capability, which allows simultaneous detection of multiple CpG sites from different genomic locations in a single reaction. MPS also has high sensitivity with single-base resolution, successfully applied to forensic analysis (Aliferi et al. 2018). The disadvantages include the high recommended DNA input (∼200–500 ng) due to the extensive DNA fragmentation and loss during the bisulfite conversion process (Richards et al. 2018).
The small amount of DNA commonly found in forensic cases increases margins of error of DNA methylation levels (Naue et al. 2018). The degraded and forensic relevant materials mostly contain inhibitors that can prevent DNA amplification of those samples and STR typing often fails to produce full DNA profiles. Therefore, shorter markers such as single-nucleotide polymorphisms (SNPs) and mini-STRs can be used with the SNaPshot approach (Zar et al. 2018). The SNP genotyping allows the identification of highly degraded biological samples. In the multiplex methylation SNaPshot method, the needed amount of bisulfite-converted DNA is only about 4 ng; therefore, it can be used in a routine forensic laboratory analysis (Hong et al. 2017). The average value of gDNA input before bisulfite conversion is 50 ng as the optimum input (Aliferi et al. 2018), but regarding the samples, the reliable identification of blood and saliva was possibly down to 10 and 0.1 ng for semen (Silva et al. 2016).
Identifying age-associated DNA methylation sites require prediction models. MLRM was used in most studies of this review. Weidner et al. proposed an age-prediction model with only three CpG sites with MLRM and pyrosequencing (Weidner et al. 2014). Constructed models for blood data by applying MLRM with pyrosequencing achieved MAD of about 3–4 years (Zubakov et al. 2016; Zbiec-Piekarska et al. 2015a; Park et al. 2016). The combination of MLRM based on SNaPshot data also provided predicted age from semen (Lee et al. 2015; Lee et al. 2018) or saliva (Hong et al. 2017; Hong et al. 2019; Jung et al. 2019) with MAD of 3–5 years. A disadvantage of the multivariate linear model was oversimplicity to explain the relationship between DNA methylation and age. The relationship between DNA methylation and age showed much faster (3- to 4-fold) change during childhood than as adults, so the changes were more accurately modeled with a logarithmic age function (Alisch et al. 2012). Therefore, some studies proposed the MQDRM, which performed well in both living individuals and deceased samples (Bekaert et al. 2015b).
Another statistical model is MQTRM that the prediction is not hindered by the prediction error which increases with age, which establishes by age-specific prediction intervals each time the new data contribute to the model (Freire-Aradas et al. 2016).
Hong et al. suggested that different platforms give different MADs between chronological and predicted ages. The predicted age obtained by applying MPS and SNaPshot data from the same individuals differed greatly, so they used platform-independent age predictive models using a neural network (NN) and MLRM. NN was tuned to have five and two neurons on layer 1 and layer 2 concurrently with the MLRM method tuned as well. The results demonstrated different MADs: 3.19 years for NN and 3.69 years for MLRM analysis (Hong et al. 2019).
The ANN model was believed to improve the prediction accuracy because it has the ability to recognize complex patterns in chronological age traits and seems to be a good alternative compared with the traditional parametric methods such as multiple linear regression models (Vidaki et al. 2017). ANN could eliminate the problem of nonlinear patterns but had a slightly lower prediction accuracy than NN (Spolnicka et al. 2018).
Aliferi et al. used GRNN and ANN modeling for age prediction, and the R project was employed to test 14 regression methods. After using the same sample, both GRNN networks and R model subsets were trained and blind-tested. GRNN has a disadvantage in using a small training dataset (n < 1000) for its susceptibility to overfitting and loss of generalizability (Vidaki et al. 2017; Aliferi et al. 2018).
Xu et al. compared age-prediction models in selected 11 CpG loci, including MLRM, multivariate nonlinear regression, back-propagation NN, and SVRM. They found that SVRM was the best model with the least MAD and superior to MLRM (Xu et al. 2015a). Other studies have used RFR, which allowed the selection and incorporation of linear and nonlinear markers (Naue et al. 2017). The established models from several studies provide an online calculator that is freely accessible to calculate predicted age (Feng et al. 2018; Weidner et al. 2014; Horvath 2013).
The methods and their advantages, limitations, and observed performance in DNA methylation-based age prediction in the studies of this review were hence quite variable. In general, the best-performing methods of DNA methylation-based age prediction showed MAD of around 3 to 5 years. In the forensic field, DNA methylation should therefore provide fair information about the remains of an unknown individual and his/her age. As before, it remains likely that future development in the assessment methods and techniques will reduce the associated limitations such as time and cost of analysis, and possibly allow for improved accuracy in the predicted age.
DNA methylation is not only age-specific but also influenced by diet, lifestyle, smoking, ancestry, and other factors that cannot be excluded in the studies. Lifestyle and genetic factors are associated with the level of variation in DNA methylation despite their stability as epigenetic markers (Xia et al. 2014). Therefore, further study is suggested on DNA methylation markers for age estimation in e.g. different ethnic groups.
DNA methylation is a favorable candidate in estimating the age at the time of death in forensic profiling. DNA methylation changes rapidly up to adulthood and the uncertainty (e.g., as mean absolute deviation or MAD) of the age estimates is under favorable circumstances about 3 to 5 years. The important aspects that influence the MAD include the available tissue or body fluid used for samples, analysis methods and platforms used according to the type of samples, and ways to construct the age predictive models. Developments in the methods of DNA methylation profiling and these studies are important in supporting conventional STR profiling to solve forensic cases in the future.
Availability of data and materials
Artificial neural network
Genomic deoxyribonucleic acid
Generalized regression neural networks
Mean absolute deviation
Median absolute error
Multivariate linear regression method
Massively parallel sequencing
Multivariate quadratic regression model
Multivariate quantile regression model
Methylation sensitive-high resolution melting
Number of samples
Next generation sequencing
Polymerase chain reaction
Preferred reporting items for systematic reviews and meta-analyses
Random forest regression
Root mean square error
Real-time polymerase chain reaction
Standard error of the estimate
Signal-joint T-cell receptor excision circles
Support vector regression model
Alghanim H, Antunes J, Silva DSBS, Alho CS, Balamurugan K, McCord B (2017) Detection and evaluation of DNA methylation markers found at SCGN and KLF14 loci to estimate human age. Forensic Sci Int Genet 31:81–88 https://doi.org/10.1016/j.fsigen.2017.07.011
Aliferi A, Ballard D, Gallidabino MD, Thurtle H, Barron L, Syndercombe Court D (2018) DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models. Forensic Sci Int Genet 37:215–226 https://doi.org/10.1016/j.fsigen.2018.09.003
Alisch R, Barwick B, Chopra P, Myrick L, Satten G, Conneely K et al (2012) Age-associated DNA methylation in pediatric populations. Genome Res 22:623–632 https://doi.org/10.1101/gr.125187.111
Antequerra F, Bird A (1993) Number of CpG islands and genes in human and mouse. Proc Natl Acad Sci 90:11995–11999 https://doi.org/10.1073/pnas.90.24.11995
Auerkari EI (2006) Methylation of tumor suppressor genes p16(INK4a), p27(Kip1) and E-cadherin in carcinogenesis. Oral Oncol 42(1):4–12 https://doi.org/10.1016/j.oraloncology.2005.03.016
Bekaert B, Kamalandua A, Zapico SC, Van de Voorde W, Decorte R (2015a) A selective set of DNA-methylation markers for age determination of blood, teeth and buccal samples. Forensic Sci Int Genet Suppl Ser 5:e144–e145 https://doi.org/10.1016/j.fsigss.2015.09.058
Bekaert B, Kamalandua A, Zapico SC, Van de Voorde W, Decorte R (2015b) Improved age determination of blood and teeth samples using a selected set of DNA methylation markers. Epigenetics 10(10):922–930 https://doi.org/10.1080/15592294.2015.1080413
Bell CG, Lowe R, Adams PD, Baccarelli AA, Beck S, Bell JT et al (2019) DNA methylation aging clocks: challenges and recommendations. Genome Biol 20(1):249 https://doi.org/10.1186/s13059-019-1824-y
Bird A (2002) DNA methylation patterns and epigenetic memory. Genes Dev 16:6–21 https://doi.org/10.1101/gad.947102
Bird A (2007) Perceptions of epigenetics. Nature 447(7143):396–398 https://doi.org/10.1038/nature05913
Butler MG, Tilburt J, DeVries A, Muralidhar B, Aue G, Hedges L et al (1998) Comparison of Chromosome Telomere Integrity in Multiple Tissues from Subjects at Different Ages. Cancer Genet Cytogenet 105(2):138–144 https://doi.org/10.1016/s0165-4608(98)00029-6
Cho S, Jung SE, Hong SR, Lee EH, Lee JH, Lee SD et al (2017) Independent validation of DNA-based approaches for age prediction in blood. Forensic Sci Int Genet 29:250–256 https://doi.org/10.1016/j.fsigen.2017.04.020
Deans C, Maggert KA (2015) What do you mean, "epigenetic"? Genetics 199(4):887–896 https://doi.org/10.1534/genetics.114.173492
Ehrlich M, Gama-Sosa MA, Huang LH, Midgett RM, Kuo KC, McCune RA et al (1982) Amount and distribution of 5-methylcytosine in human DNA from different types of tissues or cels. Nucleic Acids Res 10(8):2709–2721 https://doi.org/10.1093/nar/10.8.2709
Eipel M, Mayer F, Arent T, Ferreira MR, Birkhofer C, Gerstenmaier U et al (2016) Epigenetic age predictions based on buccal swabs are more precise in combination with cell type-specific DNA methylation signatures. Aging (Albany NY) 8(5):1034–1048 https://doi.org/10.18632/aging.100972
Espada J, Esteller M (2010) DNA methylation and the functional organization of the nuclear compartment. Semin Cell Dev Biol 21(2):238–246 https://doi.org/10.1016/j.semcdb.2009.10.006
Felsenfeld G (2014) A brief history of epigenetics. Cold Spring Harb Perspect Biol 6(1) https://doi.org/10.1101/cshperspect.a018200
Feng L, Peng F, Li S, Jiang L, Sun H, Ji A et al (2018) Systematic feature selection improves accuracy of methylation-based forensic age estimation in Han Chinese males. Forensic Sci Int Genet 35:38–45 https://doi.org/10.1016/j.fsigen.2018.03.009
Freire-Aradas A, Phillips C, Giron-Santamaria L, Mosquera-Miguel A, Gomez-Tato A, Casares de Cal MA et al (2018) Tracking age-correlated DNA methylation markers in the young. Forensic Sci Int Genet 36:50–59 https://doi.org/10.1016/j.fsigen.2018.06.011
Freire-Aradas A, Phillips C, Lareu MV (2017) Forensic Individual Age Estimation with DNA: From Initial Approaches to Methylation Tests. Forensic Sci Rev 29(2):121–144 https://www.researchgate.net/publication/319103107_Forensic_Individual_Age_Estimation_with_DNA_From_Initial_Approaches_to_Methylation_Tests
Freire-Aradas A, Phillips C, Mosquera-Miguel A, Giron-Santamaria L, Gomez-Tato A, Casares de Cal M et al (2016) Development of a methylation marker set for forensic age estimation using analysis of public methylation data and the Agena Bioscience EpiTYPER system. Forensic Sci Int Genet 24:65–74 https://doi.org/10.1016/j.fsigen.2016.06.005
Giuliani C, Cilli E, Bacalini MG, Pirazzini C, Sazzini M, Gruppioni G et al (2016) Inferring chronological age from DNA methylation patterns of human teeth. Am J Phys Anthropol 159(4):585–595 https://doi.org/10.1002/ajpa.22921
Hamano Y, Manabe S, Morimoto C, Fujimoto S, Ozeki M, Tamaki K (2016) Forensic age prediction for dead or living samples by use of methylation-sensitive high resolution melting. Leg Med (Tokyo) 21:5–10 https://doi.org/10.1016/j.legalmed.2016.05.001
Hamano Y, Manabe S, Morimoto C, Fujimoto S, Tamaki K (2017) Forensic age prediction for saliva samples using methylation-sensitive high resolution melting: exploratory application for cigarette butts. Sci Rep 7(1):10444 https://doi.org/10.1038/s41598-017-10752-w
Hartomo BT, Soedarsono N, Adrianto AWD, Auerkari EI (n.d.) Review of biomolecular methods for age estimation in application of forensic odontology. The 4th Biomedical Engineering’s Recent Progress In Biomaterials, Drugs Development, Health, and Medical Devices 2019. Proc Int Symp Biomed Eng (ISBE). https://doi.org/10.1063/1.5139364
Hong SR, Jung SE, Lee EH, Shin KJ, Yang WI, Lee HY (2017) DNA methylation-based age prediction from saliva: High age predictability by combination of 7 CpG markers. Forensic Sci Int Genet 29:118–125 https://doi.org/10.1016/j.fsigen.2017.04.006
Hong SR, Shin KJ, Jung SE, Lee EH, Lee HY (2019) Platform-independent models for age prediction using DNA methylation data. Forensic Sci Int Genet 38:39–47 https://doi.org/10.1016/j.fsigen.2018.10.005
Horvath S (2013) DNA methylation age of human tissues and cell types. Genome Biol 14:R115 https://doi.org/10.1186/gb-2013-14-10-r115
Huang Y, Yan J, Hou J, Fu X, Li L, Hou Y (2015) Developing a DNA methylation assay for human age prediction in blood and bloodstain. Forensic Sci Int Genet 17:129–136 https://doi.org/10.1016/j.fsigen.2015.05.007
Jakobsson A, Westerberg R, Jacobsson A (2006) Fatty acid elongases in mammals: their regulation and roles in metabolism. Prog Lipid Res 45(3):237–249 https://doi.org/10.1016/j.plipres.2006.01.004
Jung SE, Lim SM, Hong SR, Lee EH, Shin KJ, Lee HY (2019) DNA methylation of the ELOVL2, FHL2, KLF14, C1orf132/MIR29B2C, and TRIM59 genes for age prediction from blood, saliva, and buccal swab samples. Forensic Sci Int Genet 38:1–8 https://doi.org/10.1016/j.fsigen.2018.09.010
Kader F, Ghai M (2015) DNA methylation and application in forensic sciences. Forensic Sci Int 249:255–265 https://doi.org/10.1016/j.forsciint.2015.01.037
Kanherkar RR, Bhatia-Dey N, Csoka AB (2014) Epigenetics across the human lifespan. Front Cell Dev Biol 2:49 https://doi.org/10.3389/fcell.2014.00049
Klutstein M, Nejman D, Greenfield R, Cedar H (2016) DNA Methylation in Cancer and Aging. Cancer Res 76(12):3446–3450 https://doi.org/10.1158/0008-5472.Can-15-3278
Kurdyukov S, Bullock M (2016) DNA Methylation Analysis: Choosing the Right Method. Biology (Basel) 5(1) https://doi.org/10.3390/biology5010003
Lee HY, Jung SE, Oh YN, Choi A, Yang WI, Shin KJ (2015) Epigenetic age signatures in the forensically relevant body fluid of semen: a preliminary study. Forensic Sci Int Genet 19:28–34 https://doi.org/10.1016/j.fsigen.2015.05.014
Lee JW, Choung CM, Jung JY, Lee HY, Lim SK (2018) A validation study of DNA methylation-based age prediction using semen in forensic casework samples. Leg Med (Tokyo) 31:74–77 https://doi.org/10.1016/j.legalmed.2018.01.005
Lee KW, Pausova Z (2013) Cigarette smoking and DNA methylation. Front Genet 4:132 https://doi.org/10.3389/fgene.2013.00132
Li L, Song F, Huang Y, Zhu H, Hou Y (2017) Age-associated DNA methylation determination of semen by pyrosequencing in Chinese Han population. Forensic Sci Int Genet Suppl Ser 6:e99–e100 https://doi.org/10.1016/j.fsigss.2017.09.042
Mawlood S, Dennany L, Watson N, Pickard B (2016) The EpiTect Methyl qPCR Assay as novel age estimation method in forensic biology. Forensic Sci Int 264:132–138 https://doi.org/10.1016/j.forsciint.2016.03.047
Moher D, Liberati A, Tetzlaff J, Altman D (2009) Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. Ann Intern Med 151:264–269 https://doi.org/10.1371/journal.pmed.1000097
Naito E, Dewa K, Yamanouchi H, Takagi S, Kominami R (1993) Sex determination using the hypomethylation of a human macro-satellite DXZ4 in female cells. Nucleic Acids Res 21(10):2533–2534 https://doi.org/10.1093/nar/21.10.2533
Naue J, Hoefsloot HCJ, Kloosterman AD, Verschure PJ (2018) Forensic DNA methylation profiling from minimal traces: How low can we go? Forensic Sci Int Genet 33:17–23 https://doi.org/10.1016/j.fsigen.2017.11.004
Naue J, Hoefsloot HCJ, Mook ORF, Rijlaarsdam-Hoekstra L, van der Zwalm MCH, Henneman P et al (2017) Chronological age prediction based on DNA methylation: Massive parallel sequencing and random forest regression. Forensic Sci Int Genet 31:19–28 https://doi.org/10.1016/j.fsigen.2017.07.015
Ogino T, Onino H (1988) Application to Forensic Odontology of Aspartic Acid Racemization in Unerupted and Supernumerary Teeth. J Dent Res 68(10):1319–1322 https://doi.org/10.1177/00220345880670101501
Ohtani S, Yamamoto T (2010) Age estimation by amino acid racemization in human teeth. J Forensic Sci 55(6):1630–1633 https://doi.org/10.1111/j.1556-4029.2010.01472.x
Park JL, Kim JH, Seo E, Bae DH, Kim SY, Lee HC et al (2016) Identification and evaluation of age-correlated DNA methylation markers for forensic use. Forensic Sci Int Genet 23:64–70 https://doi.org/10.1016/j.fsigen.2016.03.005
Peng F, Feng L, Chen J, Wang L, Li P, Ji A et al (2019) Validation of methylation-based forensic age estimation in time-series bloodstains on FTA cards and gauze at room temperature conditions. Forensic Sci Int Genet 40:168–174 https://doi.org/10.1016/j.fsigen.2019.03.006
Rana AK (2018) Crime investigation through DNA methylation analysis: methods and applications in forensics. Egypt J Forensic Sci 8(1) https://doi.org/10.1186/s41935-018-0042-1
Richards R, Patel J, Stevenson K, Harbison S (2018) Evaluation of massively parallel sequencing for forensic DNA methylation profiling. Electrophoresis 39(21):2798–2805 https://doi.org/10.1002/elps.201800086
Richards R, Patel J, Stevenson K, Harbison S (2019) Assessment of DNA methylation markers for forensic applications. Aust J Forensic Sci 51(sup1):S99-S102. https://doi.org/10.1080/00450618.2019.1574898.
Sijen T (2015) Molecular approaches for forensic cell type identification: On mRNA, miRNA, DNA methylation and microbial markers. Forensic Sci Int Genet 18:21–32 https://doi.org/10.1016/j.fsigen.2014.11.015
Silva D, Antunes J, Balamurugan K, Duncan G, Alho CS, McCord B (2016) Developmental validation studies of epigenetic DNA methylation markers for the detection of blood, semen and saliva samples. Forensic Sci Int Genet 23:55–63 https://doi.org/10.1016/j.fsigen.2016.01.017
Spolnicka M, Pospiech E, Peplonska B, Zbiec-Piekarska R, Makowska Z, Pieta A et al (2018) DNA methylation in ELOVL2 and C1orf132 correctly predicted chronological age of individuals from three disease groups. Int J Legal Med 132(1):1–11 https://doi.org/10.1007/s00414-017-1636-0
Steegenga WT, Boekschoten MV, Lute C, Hooiveld GJ, de Groot PJ, Morris TJ et al (2014) Genome-wide age-related changes in DNA methylation and gene expression in human PBMCs. Age (Dordr) 36(3):9648 https://doi.org/10.1007/s11357-014-9648-x
Suchiman HE, Slieker RC, Kremer D, Slagboom PE, Heijmans BT, Tobi EW (2015) Design, measurement and processing of region-specific DNA methylation assays: the mass spectrometry-based method EpiTYPER. Front Genet 6:287 https://doi.org/10.3389/fgene.2015.00287
Thong Z, Chan XLS, Tan JYY, Loo ES, Syn CKC (2017) Evaluation of DNA methylation-based age prediction on blood. Forensic Sci Int: Genetics Supplement Series 6:e249–e251 https://doi.org/10.1016/j.fsigss.2017.09.095
Vidaki A, Ballard D, Aliferi A, Miller TH, Barron LP, Syndercombe Court D (2017) DNA methylation-based forensic age prediction using artificial neural networks and next generation sequencing. Forensic Sci Int Genet 28:225–236 https://doi.org/10.1016/j.fsigen.2017.02.009
Vidaki A, Daniel B, Court DS (2013) Forensic DNA methylation profiling--potential opportunities and challenges. Forensic Sci Int Genet 7(5):499–507 https://doi.org/10.1016/j.fsigen.2013.05.004
Virani S, Rentschler KM, Nishijo M, Ruangyuttikarn W, Swaddiwudhipong W, Basu N et al (2016) DNA methylation is differentially associated with environmental cadmium exposure based on sex and smoking status. Chemosphere 145:284–290 https://doi.org/10.1016/j.chemosphere.2015.10.123
Weidner CI, Lin Q, Koch CM, Eisele L, Beier F, Ziegler P et al (2014) Aging of blood can be tracked by DNA methylation changes at just three CpG sites. Genome Biol 15:R24 https://doi.org/10.1186/gb-2014-15-2-r24
Wilson V, Smith R, Ma S, Cutler R (1987) Genomic 5-Methyldeoxycytidine Decreases with Age. J Biol Chem 262:9948–9951 https://www.jbc.org/content/262/21/9948.long
Wochna K, Bonikowski R, Smigielski J, Berent J (2018) Aspartic acid racemization of root dentin used for dental age estimation in a Polish population sample. Forensic Sci Med Pathol 14(3):285–294 https://doi.org/10.1007/s12024-018-9984-8
Xia YY, Ding YB, Liu XQ, Chen XM, Cheng SQ, Li LB et al (2014) Racial/ethnic disparities in human DNA methylation. Biochim Biophys Acta 1846(1):258–262 https://doi.org/10.1016/j.bbcan.2014.07.001
Xu C, Qu H, Wang G, Xie B, Shi Y, Yang Y et al (2015a) A novel strategy for forensic age prediction by DNA methylation and support vector regression model. Sci Rep 5:17788 https://doi.org/10.1038/srep17788
Xu J, Fu G, Yan L, Craig JM, Zhang X, Fu L et al (2015b) LINE-1 DNA methylation: A potential forensic marker for discriminating monozygotic twins. Forensic Sci Int Genet 19:136–145 https://doi.org/10.1016/j.fsigen.2015.07.014
Yamanoi E, Uchiyama S, Sakurada M, Ueno Y (2018) sjTREC quantification using SYBR quantitative PCR for age estimation of bloodstains in a Japanese population. Leg Med (Tokyo) 32:71–74 https://doi.org/10.1016/j.legalmed.2018.03.003
Zar MS, Shahid AA, Shahzad MS, Shin KJ, Lee HY, Lee SS et al (2018) Forensic SNP Genotyping with SNaPshot: Development of a Novel In-house SBE Multiplex SNP Assay. J Forensic Sci 63(6):1824–1829 https://doi.org/10.1111/1556-4029.13783
Zbiec-Piekarska R, Spolnicka M, Kupiec T, Makowska Z, Spas A, Parys-Proszek A et al (2015b) Examination of DNA methylation status of the ELOVL2 marker may be useful for human age prediction in forensic science. Forensic Sci Int Genet 14:161–167 https://doi.org/10.1016/j.fsigen.2014.10.002
Zbiec-Piekarska R, Spolnicka M, Kupiec T, Parys-Proszek A, Makowska Z, Paleczka A et al (2015a) Development of a forensically useful age prediction method based on DNA methylation analysis. Forensic Sci Int Genet 17:173–179 https://doi.org/10.1016/j.fsigen.2015.05.001
Zhang FF, Cardarelli R, Carroll J, Fulda KG, Kaur M, Gonzalez K et al (2011) Significant differences in global genomic DNA methylation by gender and race/ethnicity in peripheral blood. Epigenetics 6(5):623–629 https://doi.org/10.4161/epi.6.5.15335
Zubakov D, Liu F, Kokmeijer I, Choi Y, van Meurs JBJ, van IWFJ et al. (2016) Human age estimation from blood using mRNA, DNA methylation, DNA rearrangement, and telomere length. Forensic Sci Int Genet 24:33-43. https://doi.org/10.1016/j.fsigen.2016.05.014.
The authors would like to thank Universitas Indonesia for the library facility support and Enago (www.enago.com) for the English language review
Ethics approval and consent to participate
Consent for publication
The authors declare that there are no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Maulani, C., Auerkari, E.I. Age estimation using DNA methylation technique in forensics: a systematic review. Egypt J Forensic Sci 10, 38 (2020). https://doi.org/10.1186/s41935-020-00214-2
- Age estimation
- DNA methylation