Datasets

Information about the Datasets Used with OdontoSearch 3.0

The two distinct datasets utilized as the core of this research correspond to different U.S. populations, one composed of a military population and another composed of the civilian population. The datasets are referred to as: TSCOHS (Military) and NHANES (Civilian). The data from these studies have been pooled together for use with OdontoSearch 3.0. All datasets are composed of dental records of United States residents. All records pertain to adults. In total the dataset to be used with OdontoSearch 3.0 consists of 57,980 individuals.

For OdontoSearch 3.0, the ages of individuals in the dataset was expanded beyond what was included in previous versions of OdontoSearch. Now, individuals between 14 and 90 years of age are included. Customized searches can be completed based on age, sex, and ancestry.

TSCOHS Data (Military):


The dental health data representing the modern military population was graciously provided by the Tri-Service Center for Oral Health Studies, which is affiliated with the Uniform Services University of the Health Sciences, Bethesda, Maryland. The raw data from this source were originally gathered by the Tri-Service Center for Oral Health Studies as part of an ongoing study observing dental health throughout the active duty and recruit population of the U.S. military. The data was collected in 1994 and 2000 as part of two phases of the Tri-Service Comprehensive Oral Health Survey (TSCOHS). The 1994 data is composed of detailed dental conditions of active duty and recruits from all branches of the service and from different military installations across the continental U.S. The year 2000 phase of TSCOHS considered all branches of the military, but only in regard to recruits. Because the 2000 data only included recruits, the combined TSCOHS dataset is biased towards the recruit population as opposed to active duty.
These data represent the first military oral health study to be conducted on a tri-service level. The study design was created to be comparable to large-scale civilian dental health studies. The TSCOHS utilized electronic data collection, which greatly reduced the chance of data entry errors and expedited analysis. The data was collected from airmen, sailors, and soldiers by clinical examination and with radiographs. Additional information regarding TSCOHS can be found at their website ( http://www.usuhs.mil/tscohs).
As these data were originally collected for dental health assessments, thorough documentation and coding of information was available to an extent that surpassed the detail needed for this research. For example, the raw data had separate codes for teeth missing due to decay and teeth missing for reasons other than decay. Similarly, teeth with sound restorations were differentiated from teeth with faulty restorations. For use with OdontoSearch 3.0, these distinct codes were collapsed into single codes designating, for example, that a tooth was simply missing regardless of the cause.

Sample Size and Demographic Composition of the TSCOHS Data


(N= 19,422)

Age White Black Other
- Male Female Male Female Male Female
17-19 2,116 474 521 192 468 119
20-29 5,789 1,004 1,542 414 936 166
30-39 2,966 314 713 127 353 29
40-49 768 75 152 26 107 10
50-59 30 2 2 0 5 1
60-69 1 0 0 0 0 0
Total 11,670 1,869 2,930 759 1,869 325

 

 

 

 

NHANES Data (Civilian):


A dataset of the modern civilian population was derived from several components of the National Health and Nutrition Examination Survey (NHANES). Dental data from several NHANES studies were consolidated covering the years from 1988 through 2012. The NHANES study is a cross-sectional survey that was conducted by the National Center for Health Statistics and the Centers for Disease Control and Prevention, in collaboration with a large consortium of federal agencies, including the National Institute of Dental Research. The NHANES study is a multifaceted health examination survey that was conducted in various locations across the United States to collect data on the civilian, noninstitutionalized population. Dental health information represents only a single facet of the overall study. For adults, only data for 28 permanent teeth were collected (3rd molars excluded). These data are available to the general public for research purposes via their website (http://www.cdc.gov/nchs/about/major/nhanes/datalink.htm).

As the NHANES dataset contains information on a range of individuals from infants to the elderly, a subset of data was extracted for this research that consisted of individuals over 14 years of age.

As NHANES was conducted in order to examine the dental health of the US civilian population, much more detail was documented than was necessary for this research. For example, dental health studies are only concerned with teeth that are missing due to decay, but based on the information included in this study it was possible to include teeth missing due to any cause. The format of the data contained in NHANES is very similar to the TSCOHS and generally allowed for the codes to be simplified for use with OdontoSearch 3.0.


Sample Size and Demographic Composition of the NHANES Data




(N= 38,558)

Age White Black Other
- Male Female Male Female Male Female
14-19 1,281 1,422 1,295 1,222 1,224 1,227
20-29 1,522 1,767 777 956 633 716
30-39 1,371 1,688 716 887 518 579
40-49 1,229 1,264 634 735 465 501
50-59 1,052 1,122 438 515 310 344
60-90 3,259 3,328 988 1,054 750 769
Total 9,714 10,591 4,848 5,369 3,900 4,136

 

Combined Data (TSCOHS and NHANES):

This dataset is simply a compilation of all of the data from the TSCOHS and the NHANES data. See the corresponding sections for information related to these samples. As this dataset represents a large sample of the contemporary U.S. military and civilian population, it should be very useful for calculating frequency values related to modern forensic cases. The combined data format includes a total sample size of 57,980 individuals.


Sample Size and Demographic Composition of Combined Data TSCOHS and NHANES Data



(N= 57,980)

Age White Black Other
- Male Female Male Female Male Female
14-19 3,397 1,896 1,816 1,414 1,692 1,346
20-29 7,311 2,771 2,319 1,370 1,569 882
30-39 4,337 2,022 1,429 1,014 871 608
40-49 1,997 1,339 786 761 572 511
50-59 1,082 1,124 440 515 315 345
60-90 3,260 3,328 988 1,054 750 769
Total 21,824 12,460 7,778 6,128 5,769 4,461