Revealing Time of Death: The Microbiome Edition

Forensic analysts have long sought precision when determining time of death. While on crime scene investigation television shows, the presence of insects always seems to reveal when a person died, there are many elements to account for, and the probable date may still not be accurate. Insects arrive days after death if at all (e.g., if the body is found indoors or after burial), and the stage of insect activity is influenced by temperature, weather conditions, seasonal variation, geographic location and other factors. All this makes it difficult to estimate the postmortem interval (PMI) of a body discovered an unknown time after death. One way to make estimating PMI less subjective would be to have calibrated molecular markers that are easy to sample and are not altered by environmental variabilities.

Bacterial communities called microbiomes have been frequently in the news. The influence of these microbes encompass living creatures and the environment. Not surprisingly, research has focused on the influence of microbiomes on humans. For example, changes in gut microbiome seem to affect human health. Intriguingly, microbiomes may also be a key to determining time of death. The National Institute of Justice (NIJ) has funded several projects focused on the forensic applications of microbiomes. One focus involves the necrobiome, the community of organisms found on or around decomposing remains. These microbes could be used as an indicator of PMI when investigating human remains. Recent research published in PLOS ONE examined the bacterial communities found in human ears and noses after death and how they changed over time. The researchers were interested in developing an algorithm using the data they collected to estimate of time of death.

Sample Collection

Samples were collected from the ears and noses of four cadavers every 2–3 days over several months at the Anthropological Research Facility (ARF) at the University of Tennessee at Knoxville, colloquially known as The Body Farm. In addition, single samples were taken from 17 other cadavers. All samples had DNA isolated, PCR amplified and microbes identified using 16S next generation metagenome sequencing. The bacterial species were classified from the sequence data, and each sequence was assigned to its genus, family, order, class, phylum and kingdom. For the computer analysis and learning, the days since death were calculated as accumulated degree days (ADD), taking into account time and temperature variations because the cadavers were subject to an uncontrolled outdoor environment.

Microbe Diversity

How does the microbe diversity change over time? Correlation between time and species, genus or phylum was analyzed for ear, nose and both samples using Pearson’s correlation coefficient. In fact, bacterial diversity decreased over time and there was negative correlation with ADD for ear; a smaller positive correlation with nose. These correlations were seen over species, order and phylum.

Model Development

The data were taken, split into two, a training set to fit the data and a testing set used to predict, and used to create a model using a regressor with a choice of hyperparameters. Seven regressors were used in a grid-search algorithm, and resulting the model from each regressor is evaluated to select the model with the best cross-validation score (lowest error) on the training set. After scoring and ranking each model, the model is retrained on the training set and then the testing set applied. The error rate from this test offered an independent validation of the model with an unbiased accuracy estimate. Each model was then compared to a dummy regressor. This dummy or mean model simply memorized the mean ADD found in the training samples.

For the nose data, generated from 69 viable samples, the dummy model outperformed the ten lowest-error models based on curated or uncurated data over various taxa (species, genus, order, phylum or class). However, for the ear data based on 83 samples, the ten ear models from curated or uncurated taxa data outperformed the dummy model. There were 67 samples from the four cadavers that had good sequencing reads from both ear and nose samples. Despite the smaller data set, all of the top ten models for the dual samples surpassed the dummy model by a wide margin.

In addition, Johnson et al. were interested in exploring which combination of dataset and regressor may give the best overall generalization based on our data, eschewing accuracy and using the entire dataset for model selection. Through testing, researchers found that simpler algorithms (e.g., linear regression) performed well when ranked for validation error and phylum taxon worked well in several of the models.

Informative Taxa

While the top-performing models suggest that using all the data sequenced from cadaver ears and noses produced the best predictive algorithm, researchers were interested to know which taxa might be more informative. To do this, three different feature selections, a method to select the best independent variables for predicting a dependent variable, were performed. Each method produced different features, suggesting that the data set as whole was more important than any particular taxon. There were two or three models that had agreement on some phyla with the data set only encompassing 52 phyla. While some phyla may have predictive value, the best models were constructed from the entire data set and not restricted to any one taxon.

Conclusions

Using the full data set of nose and ear sequences, Johnson et al. were able to develop a model through computer learning from an algorithm that can identify time of death plus or minus 55 ADD or about two days in the Tennessee summer. This is a proof-of-concept that integrates 16S RNA sequences from microbes swabbed from the ears and noses of decomposing bodies with a predictive algorithm to determine time-of-death with more accuracy than current methods. However, the sample size is small and only from a single geographic location. Are there differences in the necrobiome in other locations? Are there other sites on the body that could benefit the model? Exploring the value of the necrobiome is still in the early stages, but the results from this research suggest it is a worthwhile pursuit and may give forensic investigators another tool for their arsenal.

Reference
Johnson, H.R., Trinidad, D.D., Guzman, S., Khan, Z., Parziale, J.V., DeBruyn, J.M. and Lents, N.H. (2016) A machine learning approach for using the postmortem skin microbiome to estimate the postmortem interval. PLOS ONE 11, e0167370. doi: 10.1371/journal.pone.0167370

Updated April 11, 2024 to remove broken link.

Bio
Latest Posts

Sara Klink

Technical Writer at Promega Corporation

Sara is a native Wisconsinite who grew up on a fifth-generation dairy farm and decided she wanted to be a scientist at age 12. She was educated at the University of Wisconsin—Parkside, where she earned a B.S. in Biology and a Master’s degree in Molecular Biology before earning her second Master’s degree in Oncology at the University of Wisconsin—Madison. She has worked for Promega Corporation for more than 15 years, first as a Technical Services Scientist, currently as a Technical Writer. Sara enjoys talking about her flock of entertaining chickens and tries not to be too ambitious when planning her spring garden.

Latest posts by Sara Klink (see all)

A One-Two Punch to Knock Out HIV - September 28, 2021
Toxicity Studies in Organoid Models: Developing an Alternative to Animal Testing - June 10, 2021
Herd Immunity: What the Flock Are You Talking About? - May 10, 2021

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
gdpr_status	6 months 2 days	This cookie is set by the provider Media.net. This cookie is used to check the status whether the user has accepted the cookie consent box. It also helps in not showing the cookie consent box upon re-entry to the website.
lang		This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
SC_ANALYTICS_GLOBAL_COOKIE	10 years	This cookie is associated with Sitecore content and personalization. This cookie is used to identify the repeat visit from a single user. Sitecore will send a persistent session cookie to the web client.
vuid	2 years	This domain of this cookie is owned by Vimeo. This cookie is used by vimeo to collect tracking information. It sets a unique ID to embed videos to the website.
WMF-Last-Access	1 month 18 hours 24 minutes	This cookie is used to calculate unique devices accessing the website.
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.

Cookie	Duration	Description
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Duration	Description
BIGipServerwww.promega.com_sitecore		No description
CanCheckOut		No description
CommerceCustomerId		No description
CONSENT	16 years 7 months 15 days 6 hours 22 minutes	No description
cookies.js	session	No description
Country	3 months	No description
CountrySelected	3 months	No description
CustomerId		No description
PreferredLanguage	3 months	No description
PromegaCompno	3 months	No description
PromegaCountry	3 months	No description
RememberMe	6 months	No description
SameSite		No description
sc_ext_contact	2 years	No description
sc_ext_session	session	No description
TS01ae363a		No description
UID	2 years	No description
website#lang		This cookie is used for storing the visitor language preferences. It heps in delivering localised language version.
wp_api	past	No description
wp_api_sec	past	No description
_ga_WHZLGVEZ9X	2 years	No description

Cookie	Duration	Description
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.
_gat_UA-62336821-1	1 minute	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It appears to be a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.

Promega Connections

Thoughts, tech tips and news about science

Revealing Time of Death: The Microbiome Edition

Sample Collection

Microbe Diversity

Model Development

Informative Taxa

Conclusions

Related Posts

Sara Klink

Latest posts by Sara Klink (see all)

Like this:

Leave a ReplyCancel reply

Sample Collection

Microbe Diversity

Model Development

Informative Taxa

Conclusions

Related Posts

Sara Klink

Latest posts by Sara Klink (see all)

Share this:

Like this:

Leave a ReplyCancel reply