Genetic differentiation between and within Northern Native American language groups: an argument for the expansion of the Native American CODIS database

The National Research Council recommends that genetic differentiation among subgroups of ethnic samples be lower than 3% of the total genetic differentiation within the ethnic sample to be used for estimating reliable random match probabilities for forensic use. Native American samples in the United...

Full description

Bibliographic Details
Main Authors: Jessica A. Weise, Jillian Ng, Robert F. Oldt, Joy Viray, Kelly L. McCulloh, David Glenn Smith, Sreetharan Kanthaswamy
Format: Article
Language:English
Published: Oxford University Press 2021-07-01
Series:Forensic Sciences Research
Subjects:
Online Access:http://dx.doi.org/10.1080/20961790.2021.1963088
_version_ 1797704951641669632
author Jessica A. Weise
Jillian Ng
Robert F. Oldt
Joy Viray
Kelly L. McCulloh
David Glenn Smith
Sreetharan Kanthaswamy
author_facet Jessica A. Weise
Jillian Ng
Robert F. Oldt
Joy Viray
Kelly L. McCulloh
David Glenn Smith
Sreetharan Kanthaswamy
author_sort Jessica A. Weise
collection DOAJ
description The National Research Council recommends that genetic differentiation among subgroups of ethnic samples be lower than 3% of the total genetic differentiation within the ethnic sample to be used for estimating reliable random match probabilities for forensic use. Native American samples in the United States’ Combined DNA Index System (CODIS) database represent four language families: Algonquian, Na-Dene, Eskimo-Aleut, and Salishan. However, a minimum of 27 Native American language families exists in the US, not including language isolates. Our goal was to ascertain whether genetic differences are correlated with language groupings and, if so, whether additional language families would provide a more accurate representation of current genetic diversity among tribal populations. The 21 short tandem repeat (STR) loci included in the Globalfiler® PCR Amplification Kit were used to characterize six indigenous language families, including three of the four represented in the CODIS database (i.e. Algonquian, Na-Dene, and Eskimo-Aleut), and two language isolates (Miwok and Seri) using major population genetic diversity metrics such as F statistics and Bayesian clustering analysis of genotype frequencies. Most of the genetic variation (97%) was found to be within language families instead of among them (3%). In contrast, when only the three of the four language families represented in both the CODIS database and the present study were considered, 4% of the genetic variation occurred among the language groups. Bayesian clustering resulted in a maximum posterior probability indicating three genetically distinct groups among the eight language families and isolates: (1) Eskimo, (2) Seri, and (3) all other language groups and isolates, thus confirming genetic subdivision among subgroups of the CODIS Native American database. This genetic structure indicates the need for an increased number of Native American populations based on language affiliation in the CODIS database as well as more robust sample sets for those language families. Supplemental data for this article is available online at https://doi.org/10.1080/20961790.2021.1963088 .
first_indexed 2024-03-12T05:29:00Z
format Article
id doaj.art-c3b205df8128494aaaf0bdbc333776f5
institution Directory Open Access Journal
issn 2096-1790
2471-1411
language English
last_indexed 2024-03-12T05:29:00Z
publishDate 2021-07-01
publisher Oxford University Press
record_format Article
series Forensic Sciences Research
spelling doaj.art-c3b205df8128494aaaf0bdbc333776f52023-09-03T07:06:46ZengOxford University PressForensic Sciences Research2096-17902471-14112021-07-010011110.1080/20961790.2021.19630881963088Genetic differentiation between and within Northern Native American language groups: an argument for the expansion of the Native American CODIS databaseJessica A. Weise0Jillian Ng1Robert F. Oldt2Joy Viray3Kelly L. McCulloh4David Glenn Smith5Sreetharan Kanthaswamy6Forensic Science Graduate Program, University of CaliforniaMolecular Anthropology Laboratory, Department of Anthropology, University of CaliforniaSchool of Mathematical and Natural Sciences, Arizona State UniversitySacramento County District Attorney’s Crime LaboratoryForensic Science Graduate Program, University of CaliforniaMolecular Anthropology Laboratory, Department of Anthropology, University of CaliforniaSchool of Mathematical and Natural Sciences, Arizona State UniversityThe National Research Council recommends that genetic differentiation among subgroups of ethnic samples be lower than 3% of the total genetic differentiation within the ethnic sample to be used for estimating reliable random match probabilities for forensic use. Native American samples in the United States’ Combined DNA Index System (CODIS) database represent four language families: Algonquian, Na-Dene, Eskimo-Aleut, and Salishan. However, a minimum of 27 Native American language families exists in the US, not including language isolates. Our goal was to ascertain whether genetic differences are correlated with language groupings and, if so, whether additional language families would provide a more accurate representation of current genetic diversity among tribal populations. The 21 short tandem repeat (STR) loci included in the Globalfiler® PCR Amplification Kit were used to characterize six indigenous language families, including three of the four represented in the CODIS database (i.e. Algonquian, Na-Dene, and Eskimo-Aleut), and two language isolates (Miwok and Seri) using major population genetic diversity metrics such as F statistics and Bayesian clustering analysis of genotype frequencies. Most of the genetic variation (97%) was found to be within language families instead of among them (3%). In contrast, when only the three of the four language families represented in both the CODIS database and the present study were considered, 4% of the genetic variation occurred among the language groups. Bayesian clustering resulted in a maximum posterior probability indicating three genetically distinct groups among the eight language families and isolates: (1) Eskimo, (2) Seri, and (3) all other language groups and isolates, thus confirming genetic subdivision among subgroups of the CODIS Native American database. This genetic structure indicates the need for an increased number of Native American populations based on language affiliation in the CODIS database as well as more robust sample sets for those language families. Supplemental data for this article is available online at https://doi.org/10.1080/20961790.2021.1963088 .http://dx.doi.org/10.1080/20961790.2021.1963088forensic sciencespopulation geneticsnative americansnorth americalanguagesshort tandem repeats (strs or microsatellites)
spellingShingle Jessica A. Weise
Jillian Ng
Robert F. Oldt
Joy Viray
Kelly L. McCulloh
David Glenn Smith
Sreetharan Kanthaswamy
Genetic differentiation between and within Northern Native American language groups: an argument for the expansion of the Native American CODIS database
Forensic Sciences Research
forensic sciences
population genetics
native americans
north america
languages
short tandem repeats (strs or microsatellites)
title Genetic differentiation between and within Northern Native American language groups: an argument for the expansion of the Native American CODIS database
title_full Genetic differentiation between and within Northern Native American language groups: an argument for the expansion of the Native American CODIS database
title_fullStr Genetic differentiation between and within Northern Native American language groups: an argument for the expansion of the Native American CODIS database
title_full_unstemmed Genetic differentiation between and within Northern Native American language groups: an argument for the expansion of the Native American CODIS database
title_short Genetic differentiation between and within Northern Native American language groups: an argument for the expansion of the Native American CODIS database
title_sort genetic differentiation between and within northern native american language groups an argument for the expansion of the native american codis database
topic forensic sciences
population genetics
native americans
north america
languages
short tandem repeats (strs or microsatellites)
url http://dx.doi.org/10.1080/20961790.2021.1963088
work_keys_str_mv AT jessicaaweise geneticdifferentiationbetweenandwithinnorthernnativeamericanlanguagegroupsanargumentfortheexpansionofthenativeamericancodisdatabase
AT jillianng geneticdifferentiationbetweenandwithinnorthernnativeamericanlanguagegroupsanargumentfortheexpansionofthenativeamericancodisdatabase
AT robertfoldt geneticdifferentiationbetweenandwithinnorthernnativeamericanlanguagegroupsanargumentfortheexpansionofthenativeamericancodisdatabase
AT joyviray geneticdifferentiationbetweenandwithinnorthernnativeamericanlanguagegroupsanargumentfortheexpansionofthenativeamericancodisdatabase
AT kellylmcculloh geneticdifferentiationbetweenandwithinnorthernnativeamericanlanguagegroupsanargumentfortheexpansionofthenativeamericancodisdatabase
AT davidglennsmith geneticdifferentiationbetweenandwithinnorthernnativeamericanlanguagegroupsanargumentfortheexpansionofthenativeamericancodisdatabase
AT sreetharankanthaswamy geneticdifferentiationbetweenandwithinnorthernnativeamericanlanguagegroupsanargumentfortheexpansionofthenativeamericancodisdatabase