1. 5 .1. .
u. . 1.

 

.Iif:
» U ,

r A \
$73K
i so .u

....
yr?

.
z 1
at? .
1:. ..

 

4 l, “3. .m.
.1156..." .5. 1‘ , , ‘ . L .
4......2. Au . .

, 9.: - Hut .

 

.kmnur , . ,
.mmﬁﬂw.

P1»!
II- tun”.—
'l3.
4:...)

Firm...

 

 

33.!

“-11%

an."

LI.

 

 

J9:

 

his” L.- ... . t.
‘K. U'- V . , y y ‘ ‘ nl‘»!}‘4"‘ I
z ‘ . 2. L \ .531“. ‘ ‘ , ... ‘ , . ﬁfviarPI ﬁght-Q

;._.._£1 :1... ﬁgéaﬁga" m H V ﬁémuﬁwuﬁ“

) .17.

.11.: ...v . a

 

w

 

 

This is to certify that the
thesis entitled

THE CHARACTERIZATION OF VARIANT ALLELES AT THE
13 CODIS STR LOCI FOR USE IN PATERNITY DISPUTE
RESOLUTIONS

presented by

Catherine Therese Allor

has been accepted towards fulﬁllment
of the requirements for the

Master of degree in Criminal Justice with
Science Specialization in Forensic
Science

 

/

W
‘Major Professor's Signature

12/4/06

Date

MSU Is an Aﬂlnmtlve Action/Equal Oppommlly Institution

_ _A.. —.-.-------------~----- —

o r—‘hv n

9 v Vﬂ‘

. C T- ‘

 

LIBRARY
Michigan State
University

 

 

 

PLACE IN RETURN BOX to remove this checkout from your record.
To AVOID FINES return on or before date due.
MAY BE RECALLED with earlier due date if requested.

 

DATE DUE DATE DUE DATE DUE

 

 

 

 

 

 

 

 

 

 

 

 

 

 

6/01 c:/ClRC/DateDue.p65~p.15

THE CHARACTERIZATION OF VARIANT ALLELES AT THE 13 CODIS STR
LOCI FOR USE IN PATERNIT Y DISPUTE RESOLUTIONS

By

Catherine Therese Allor

A THESIS

Submitted to
Michigan State University
in partial fulﬁllment of the requirements
for the degree of

MASTER OF SCIENCE
School of Criminal Justice

2004

ABSTRACT

THE CHARACTERIZATION OF VARIANT ALLELES AT THE 13 CODIS STR
LOCI FOR USE IN PATERNIT Y DISPUTE RESOLUTIONS

By

Catherine Therese Allor

This study investigated rare, genetic anomalies known as variant alleles, and their
role in resolving paternity disputes. Paternity analysis typically involves the comparison
of DNA proﬁles from a mother, child and alleged father, and conclusions are drawn
based on the genetic evidence. In the past, most variant alleles could not be utilized in
these analyses, leading to weaker genetic support for the conclusions. A total of 32,671
DNA proﬁles were examined for variant alleles. A selection of affected samples were
quantiﬁed, ampliﬁed and subjected to gel electrophoresis to conﬁrm the presence of
variant alleles. A total of 85 variant alleles at 12 of the 13 CODIS loci were conﬁrmed in
757 samples. The affected samples were sorted by racial group in order to calculate
allele frequencies. Twenty-eight of the variant alleles were observed in 5 or more
samples, regardless of racial group, and were added to the Orchid GeneScreen allele
frequency database for use in paternity calculations. Paternity analyses were performed
on two cases to demonstrate that the use of variant allele data greatly increases the
strength of the genetic evidence, providing further support for the paternity conclusions.
It is expected that the regular use of these alleles may lead to a reduction in laboratory
expenditures, minimize the need for additional testing and decrease the tum-around time
for reporting results. In addition, 50 of the variant alleles were listed on the STRBase

website as a reference for others in the paternity and forensic science communities.

To my family. Your love and support mean the world to me.

ACKNOWLEDGEMENTS

I would like to thank several people for helping to make this project a success.
First, I would like to acknowledge my advisor, Dr. David Foran, for his guidance and
assistance in editing this paper. I could not have done this without your help. I would
also like to recognize Dr. Marco Scarpetta, Manager of Paternity and Associate
Laboratory Director at Orchid GeneScreen as the creative genius and my mentor during
this project. Without your direction, I might still be looking for thesis topic ideas. Dr.
John Butler of the National Institute of Standards and Technology is owed a great deal of
thanks for performing some additional tests on several of my samples. Your
contributions were greatly appreciated. The laboratory staff at Orchid GeneScreen is
owed a great deal of thanks for assisting with some of the preliminary data collection.
Your efforts made a seemingly endless amount of work feel less overwhelming. Lastly,
Dr. Jay Siege], my professor and advisor is owed a tremendous amount of thanks for all
of his encouragement and endless support throughout this process. It has been both a
pleasure and an honor to have been one of your students, and I am extremely proud

knowing that I have learned from the best!

TABLE OF CONTENTS

LIST OF TABLES .............................................................................. vi

LIST OF FIGURES ............................................................................. viii
LIST OF ABBREVIATIONS .................................................................. x

INTRODUCTION .............................................................................. 1

MATERIALS AND METHODS .............................................................. 10
RESULTS ......................................................................................... 27
DISCUSSION .................................................................................... 47
APPENDIX A — Original Orchid GeneScreen Allele Frequency Database . . . . . . ... 74
APPENDIX B -— Modiﬁed Orchid GeneScreen Allele Frequency Database ............. 79
APPENDIX C — Electropherograms For 14 Novel Variant Alleles ....................... 83
APPENDIX D — Formulas for Paternity Indices ............................................. 89
BIBLIOGRAPHY ................................................................................. 91

Table 1:

Table 2:

Table 3:

Table 4:

Table 5:

Table 6:

Table 7:

Table 8:

Table 9:

Table 10:

Table 1 1:

Table 12:

Table 13:

Table 14:

Table 15:

Table 16:

Table 17:

Table 18:

Table 19:

Table 20:

Table 21:

LIST OF TABLES

Examples of Paternity Inclusions and Exclusions .................................. 4
Information Recorded for Samples with Suspected Variant Alleles ............ 12
Concentrations and Positions of Quantiﬁcation K562 DNA Standards. . . . . 13
Volumes of Ampliﬁcation Kit Components Added Per Sample ................ 15
Thermal Cycler Temperatures and Times .......................................... 15
Sample Loading Buffer Solution Components and Volumes .................... 16
Loading Volumes for Samples, Controls and Ladder ............................. l7
Electrophoresis Parameters for the ABI Prism® 377 DNA Sequence ....... 17
117 Suspected Variant Alleles ....................................................... 27
Conﬁrmed Variant Alleles .......................................................... 28
Adjusted Variant Allele Designations ............................................. 31
Suspected Variant Alleles That Were Not Conﬁrmed ........................... 32
Conﬁrmed FGA Variants — SGM PlusTM Analysis .............................. 34
Extrapolated FGA Variant Allele Designations .................................. 34
XI Microvariant Alleles ............................................................ 36
X2 Microvariant Alleles ............................................................ 36
X3 Microvariant Alleles ............................................................ 37
Variant Alleles Below the Ladder .................................................. 37
Variant Alleles Above the Ladder .................................................. 38
STRBase Categorization of Variant Alleles ....................................... 39
Variant Allele Observations and Frequencies ..................................... 41

vi

Table 22:

Table 23:

Table 24:

Table 25:

Table 26:

Table 27:

Table 28:

Table 29:

LIST OF TABLES (cont’d)

28 Variant Alleles Incorporated into Orchid GeneScreen Database ........... 43
28 Adjusted Variant Allele Frequencies and Observations ..................... 46
14 Novel Variant Alleles ............................................................ 53
CPI Calculations — Most Common Variant Allele ............................... 67
CPI Calculations - Least Common Variant Allele ............................... 69
Original Orchid GeneScreen Allele Frequency Database ....................... 75
Modiﬁed Orchid GeneScreen Allele Frequency Database ...................... 8O
Formulas for Paternity Indices ................................................... -... 9O

vii

Figure 1:
Figure 2:

Figure 3:

Figure 4:
Figure 5:
Figure 6:
Figure 7:
Figure 8:

Figure 9:

Figure 10:

Figure 1]:
Figure 12:
Figure 13:
Figure 14:
Figure 15:
Figure 16:
Figure 17:
Figure 18:
Figure 19:
Figure 20:

Figure 21:

LIST OF FIGURES

Electropherogram of a Locus Containing an Off—Ladder Allele ............... 7
Electropherogram showing Pull-Up Peaks caused by Amelogenin Marker.. 11
Electropherogram showing Microvariant Allele Calculations at the FGA
Locus ................................................................................... 19
Example of an X] Microvariant at the D78820 Locus .......................... 48
Missing 75bp Size Standard Peak ................................................... 55
Low Size Standard RFUs ............................................................ 57
Migration Anomalies in the Size Standards ....................................... 59
Allele Migration Anomaly at the D1885] Locus .............................. i. .. 61
Off—Ladder Pull-Up Peak at the D881179 Locus ................................. 63
Incomplete Adenylation at the D7882O Locus ................................... 64
15.1 Microvariant at the D3Sl358 Locus .......................................... 84
Below Ladder 14.3 Microvariant at the FGA Locus ............................ 84
Below Ladder 15.3 Microvariant at the FGA Locus ............................ 84
21.1 Microvariant at the FGA Locus .............................................. 85
Above Ladder 33.1 Microvariant at the FGA Locus ............................ 85
Above Ladder 34.1 Microvariant at the FGA Locus ............................ 85
Above Ladder 41.2 Microvariant at the FGA Locus ............................ 86
13.3 Microvariant at the D1885] Locus .......................................... 86
16.1 Microvariant at the D1885] Locus .......................................... 86
Above Ladder 28.] Microvariant at the D1885] Locus ........................ 87
12.3 Microvariant at the D588l8 Locus .......................................... 87

viii

LIST OF FIGURES (cont’d)

Figure 22: Above Ladder Variant Allele 18 at the D58818 Locus .......................... 87
Figure 23: Below Ladder 5.2 Microvariant at the D7882O Locus ........................... 88
Figure 24: 7.3 Microvariant at the TPOX Locus .............................................. 88

AABB
AF

BP/bp

CPI

DNA

PCR

PI

STR

LIST OF ABBREVIATIONS

— American Association of Blood Banks

- Alleged Father

- Base Pair

- Child

- Cumulative Paternity Index

- Deoxyribonucleic Acid

- Mother

- Polymerase Chain Reaction

- Paternity Index

- Relative Fluorescence Unit

- Short Tandem Repeat

INTRODUCTION

DNA analysis is currently at the forefront of human identity testing, utilized by
both the forensic and parentage testing communities. Current methods of DNA testing
make it possible to distinguish one person from all other humans, living or dead, based on
their genetic makeup. While forensic DNA analysis is typically performed for matters of
a criminal nature, parentage testing involves the examination of DNA to determine
familial relationships. The majority of parentage cases are paternity disputes, which
involve the testing of a mother, child and alleged father to determine the likelihood that
the tested man is the biological father of the child. Other parentage cases involve the
establishment of maternity, where an alleged mother and child are tested, typically
without the biological father. Maternity testing is commonly performed for adoption
agencies that want to verify the relationship between the mother and the child to be
adopted. A third form of parentage identiﬁcation, sibship testing, is useful in determining
whether two or more individuals have the same biological mother or father.

DNA proﬁles are generated for each individual involved in a parentage testing
case. These proﬁles contain several pieces of genetic information useful in establishing
identity. The more genetic information that can be gathered from the individuals
involved in a parentage testing case, the stronger the conclusions become. Sometimes,
one or more pieces of information are missing from a DNA proﬁle, and the strength of
the parentage conclusions diminishes. A lack of data in a DNA proﬁle can be caused by
several factors, including genetic variations not recognized by the software that generates

the proﬁles. This project was an investigation into a speciﬁc type of rare genetic

anomaly found in certain individuals’ DNA that can affect the results of parentage

testing.

DNA Analysis — Orchid GeneScreen Paternity Laboratory

Orchid GeneScreen in East Lansing, Michigan is a high throughput parentage-
testing laboratory that currently tests approximately 135,000 samples from 45,000 cases
annually using DNA analysis. Although the laboratory performs paternity, maternity and
sibship testing on a regular basis, approximately 96% of the samples are tested for
paternity, and that nomenclature will be used throughout this paper. Nearly 65% of the
paternity testing performed by Orchid GeneScreen is court—ordered for the purposes of
providing ﬁnancial support to children, with the remaining 35% performed for private
accounts.

A standard paternity case involves 3 samples (one each for the mother, child and
alleged father). Laboratory analysis involves the comparison of the child’s DNA proﬁle
to those of the mother and alleged father. Every individual inherits two copies of their
chromosomes, one from each biological parent, and speciﬁc chromosomal regions known
as loci are examined in paternity analysis. These loci are comprised of short tandem
repeats (STRs), 2 to 7 base pair (bp) repeats of DNA that vary in number among
individuals. For example, the sequence ‘AGAGAG’ contains three copies of the 2bp
repeat unit, AG. STR loci are often highly polymorphic, and a multiple number of alleles
existing at each locus, facilitating the differentiation of individuals.

The STR loci commonly analyzed by Orchid GeneScreen contain repeat

sequences comprised of 4bp. The repeat pattern can be simple, compound or complex.

A simple repeat pattern consists of a single STR unit repeated a variable number of times

(n), such as [AGAT]n. A compound repeat is comprised of two or more adjacent STR

units such as TCTA[TCTG]3[TCTA],,. Complex repeat patterns contain variable DNA

sequences amidst blocks of several different STR units, such as

[TCTA]n[TCTG]n{ [TCTA]3TA[TCTA]3TCA[TCTA]2TCCATA } [TCTA]nTATCTA

(Butler, 2001, Urquhart et al., 1994).

The number of repeat units determines the size of each allele, which is measured
using standards added to each sample prior to analysis. The nomenclature, or designation
for each allele is then determined based on the number of repeats present. For example, a
person may be typed as a ‘7/8’, having inherited 7 repeat units from one parent and 8
from the other. Alleles are assigned repeat numbers based on the comparison of their size
to the size of previously deﬁned alleles found in an ‘allelic ladder’. These ladders
contain the most common alleles existing for each locus. The comparisons made
between the DNA proﬁles of the child and alleged father result in a classiﬁcation of
‘inclusion’ or ‘exclusion’ for each locus. A paternal inclusion occurs at a locus when the
alleged father and the child share the obligate paternal allele, the allele that must be
contributed by the biological father. A paternal exclusion occurs at that locus if the

alleged father could not have contributed the obligate paternal allele to the child (Table

l).

Table 1: Examples of Paternity Inclusions and Exclusions
The numbers indicate the allele designation for the mother, child and alleged father for each locus.
Heterozygous loci are indicated by two different allele designations, while homozygous loci have two
copies of a single allele designation. A paternal inclusion occurs at a locus when the alleged father shares a
common allele with the child that the mother did not contribute to the child, the obligate paternal allele. A
paternal exclusion occurs at a locus when the alleged father could not have contributed the obligate paternal
allele to the child.

12 Inclusion
Exclusion

Inclusion
Exclusion
Inclusion
Exclusion

 

An inclusion at a given locus can occur by chance, even if the tested man is not
the biological father. Therefore, several different loci must be tested in order to reduce
the possibility of a random match between unrelated individuals. Orchid GeneScreen
regularly tests samples using between 9 — 13 different CODIS STR loci. Exclusions at a
minimum of 2 loci are required by the American Association of Blood Banks (AABB) to
conclude that the tested man is not the biological father. There are occasions when the
biological father may differ from the child at one or two loci due to mutations that can
occur in the sperm. At Orchid GeneScreen, single exclusions occur at a per-case ratio of
1:50, with double exclusions expected to occur at a 1:2500 ratio. For this reason, Orchid
requires exclusions at three loci before rendering the conclusion that the tested man could
not have fathered the child.

The ladder alleles for each locus have established frequencies that represent how
often the allele is seen within a given racial group (Caucasian, African American,
Hispanic or Other). The allele frequencies are utilized to calculate a paternity index (PI)

value for each locus. A P] is a likelihood ratio deﬁned as the probability that an event

will occur under certain conditions divided by the probability that the same event will
occur under a different, mutually exclusive set of conditions (Traver, 1998). More
speciﬁcally, the PI ratio is the likelihood that a non-excluded alleged father from one
racial group is the biological father of a child divided by the likelihood that a man
randomly selected from the same racial group is the biological father. The PIs for all loci
are then multiplied together to calculate the cumulative paternity index (CPI). The CPI is
a numerical value that indicates how strongly the genetic data support either the
hypothesis that the alleged father is the biological father or the alternative hypothesis that
another man is the biological father (Traver, 1998).

Theoretically speaking, CPI values can range from 0 to inﬁnity. CPI values less
than ] indicate that the alleged father may not be the biological father, while a CPI value
of 0 signiﬁes that the alleged father cannot be the biological father and has been
excluded. When the CPI value is 1 or greater, the genetic data support the hypothesis that
the tested man could have fathered the child. As the CPI value increases, the genetic data
more strongly support this hypothesis (Traver, 1998).

In addition to the CPI value, the probability of paternity is included with all
paternity reports. This value is a measure of the strength of one’s belief in the hypothesis
that the tested man in the father (Traver, 1998). This probability is not only based on the
genetic evidence (e. g. the CPI value), but also on the prior probability of paternity. This
prior probability refers to the strength of one’s belief that the tested man is the father
based solely on non-genetic evidence, such as the mother’s assertion that the correct man

was tested (Traver, 1998). Orchid GeneScreen uses the commonly utilized prior

probability value of P = 0.5. As the CPI value increases, the probability of paternity

increases proportionally.

Variant Alleles

The DNA proﬁles of the majority of individuals involved in a paternity dispute
contain alleles found within the ladders. However, some people have rare variant alleles
that do not correspond with those in the ladder. Variant alleles differ from the ladder
alleles by one or more hp or one or more STR units. This variation is caused by
mutations involving the insertion or deletion of nucleotides within the repeat sequence
(Butler, 2001).

Some variants lie beyond the smallest or largest alleles found in the ladder.
Crouse et al. (1999) proposed that the general nomenclature for these variants include a
‘>’ or ‘<’ sign in reference to the nearest allele. For example, if the largest ladder allele
has 30 repeat units, a variant containing more than 30 repeats would be described as a
‘>30’ allele.

Other variants are found amongst the alleles in the ladder, and differ from them
by 1, 2 or 3bp. These are termed “microvariants” due to the small size deviation from the
ladder alleles. There are several common microvariants, some of which are included in
the ladder. The general nomenclature for microvariants contains the ladder allele
designation that has the full number of repeats followed by a decimal value indicating the
number of additional bp present (Crouse et al., 1999). For example, an allele that is 2bp
larger than allele 20 would be designated as 20.2. The highly polymorphic loci,

including FGA, D2181] and D1885] contain the most microvariant alleles. These loci

contain larger sized compound and complex repeat patterns, with more locations for
mutations to occur as compared to loci that have simple repeat patterns (Butler, 2001).
Samples containing variant alleles are revealed during the analysis of DNA
proﬁles. Graphical representations of the alleles present in the DNA sample, known as
electropherograms, are generated using software programs after the sample has been
tested. These programs label the alleles with the repeat number based on the sizes and
designations of the ladder alleles. When the software is unable to recognize an allele at a

locus, the allele is labeled “off-ladder” (Figure l).

 

Figure l: Electropherogram of a Locus Containing an Off-Ladder Allele

Each peak represents a single allele. The off-ladder allele is designated with the “OL Allele?” label. This
off-ladder allele has a measured size of 204.27bp. Allele l] is referred to as the “sister” allele, and has a
measured size of 219.70bp. Although the actual bp size of the allele is a whole number, the analysis
software estimates the size to 2 decimal points.

Alleles that are not recognized by the analysis software do not have established
frequencies and cannot be used to calculate the CPI. As a result, all variant alleles not
included in the ladder are of no value in establishing paternity.

Not all off-ladder alleles are caused by the presence of variant alleles. Several
factors may lead to the presence of off-ladder alleles including an insufﬁcient volume of
sample loaded onto the gel, using excessive amounts of DNA for testing (which can
cause extra, off-ladder peaks to be present in a locus), and weak signal intensity of the

size standards used to assign allele designations. Therefore, certain steps must be taken

in order to conﬁrm the presence of a variant allele. Butler (2001) proposed that the most
common method of conﬁrming a variant is the re-ampliﬁcation of the sample in question,
and the reanalysis of the DNA proﬁle. If the off—ladder allele is present in the DNA
proﬁle after re-analysis and no other causative factors can be identiﬁed, the allele can be

conﬁrmed as variant.

Loss of Data for Paternity Calculations

If an alleged father cannot be excluded as the biological father of a child, the
value of the CPI determines how well the genetic data support the conclusion of
paternity. The more P] values used to calculate the CPI, the larger the CPI value
becomes and the greater the likelihood of correctly identifying the biological father.
However, the CPI value decreases when data from any loci are missing, providing
weaker support for the hypothesis of paternity. Orchid GeneScreen requires data from a
minimum of 6 loci with paternal inclusions and a minimum CPI value before a report of
inclusion can be generated. Minimum CPI values range from 100 to 10,000 depending
on the particular account to which a paternity case belongs. Cases that do not meet these
criteria require the testing of additional loci before a paternity report can be issued.
While supplementary testing is very costly in terms of resources, it also delays the

reporting of results, ultimately leading to an increase in laboratory tum-around time.

The Investigation
In recent years, the number of samples submitted to Orchid GeneScreen for

paternity analysis has steadily increased, and variant alleles have been observed more

frequently. This investigation was designed as a means to learn more about them and
how often they occur. Samples containing suspected variants were re-ampliﬁed and re-
analyzed to conﬁrm their existence. Frequencies were calculated for conﬁrmed variants
found in each racial group. The ultimate goal of this investigation was to measure the
extent to which the CPI values would increase in cases where the allele shared by the
child and the alleged father was variant.

An additional aspect of this investigation involved the submission of conﬁrmed
variant alleles to the STRBase Internet website database
(http://www.cstl.nist.gov/biotech/strbase/). An important function of this website is to
provide information about variant alleles to members of the forensic science, paternity
and general biology communities. By submitting these alleles, they are not only
beneﬁcial to Orchid GeneScreen, but also to any individual hoping to obtain information

about them.

MATERIALS AND METHODS

Reviewing the Orchid GeneScreen DNA Proﬁle Database

A total of 32,671 DNA proﬁles from 409 different AmpFESTR® Proﬁler PlusTM
and COﬁlerTM (Applied Biosystems) gel runs were reviewed for variant alleles. A
maximum of 90 DNA proﬁles can be found in each gel run. Using ABI Prism®
Genotyper® software (Applied Biosystems), the elecropherograms (referred to herein as
“original” electropherograms or DNA proﬁles) from each gel run were reviewed after
applying the “Kazaam (20% ﬁlter)” macro. This macro re-assigns allele designation
labels to all loci so that any off-ladder alleles are clearly marked as “OL Allele?” within
the electropherograms (Figure 1 above). This step was necessary as the labels from loci
containing off-ladder alleles are manually removed from the electropherograms prior to
being saved in the database (Proﬁler PlusTM User Manual, 1997).

Data from samples containing off-ladder alleles were recorded based on the

Orchid GeneScreen criteria for acceptable DNA proﬁles:

1) All 1] of the size standard peaks (GeneScan-500TM ROX, Applied
Biosystems), ranging from 75 — 400bp must be present in each sample and
have peak heights of at least 100 relative fluorescence units (RFUs) to yield
accurate base pair sizing of the alleles.

2) The sample in question must have allele peak heights 2150 RFUs. Alleles
with peak heights <600 RFUs must be scrutinized for signal leakage from

adjacent sample lanes.

10

3) Samples with high RFU values (>4000) must be checked for pull up, which
appears as a peak of similar bp size in the dye color above and/or below the
locus that contains the peak(s) with high RFU values (see example in Figure
2). These pull up peaks may be labeled as oﬁlladder alleles by the software
and should not be mistaken for true alleles.

4) Two peaks within a locus are considered a pair, or sister alleles, only if the

smaller peak’s RFU value is at least 50% of the larger peak’s RFU value.

4301101310P 43 Blue (43)1101310P

 

4301101310P 43 Green (43)]101310P

-- .11.
amelomgenin -i£

4301101310P 43 Yellow (43)]101310P

\1

E

  

IELEI
Figure 2: Electropherogram showing Pull-Up Peaks caused by Amelogenin Marker
The arrows indicate the pull-up peaks at the D3Sl358 locus in the blue dye region (top panel) and D58818
locus in the yellow dye region (bottom panel). These are mused by the amelogenin marker, the leftmost
marker in the green dye region (center panel). Pull-up peaks are sometimes labeled as oﬁ-ladder alleles

andmay falsely resembletrueallele. PeakstotherightofamelogeninarefmmothermarkersintheDNA
proﬁle.

11

Samples that met these criteria were classiﬁed as having “suspected” or “potential”

variant alleles and several pieces of information were recorded for each (Table 2).

Table 2: Information Recorded for Samples with Suspected Variant Alleles
The ampliﬁcation kit used, gel run number, sample barcode number and lane assignment were essential in
order to locate the original electropherograms in the database. The sample barcode number also indicated
where the sample could be located in the storage unit. The number of samples tested per gel was used to
determine the total number of DNA proﬁles reviewed. The locus containing the suspected variant, the
sizes of the suspected variant and the sister allele, as well as the sister allele designation were useful in
approximating the suspected variant allele designation.

 

Identification Categories

Ampliﬁcation Kit (Proﬁler PlusTM or COﬁlerTM)

 

Gel Run #
Sample Barcode #
Lane Assignment #

 

 

 

# Samples Tested on Gel

 

Locus Containing Suspected Variant Allele

 

bp Size of Suspected Variant Allele

 

Sister Allele Designation

 

 

 

Sister Allele’s bp Size

 

Approximation of Allele Designations and SaImple Selection

Samples containing potential variant alleles were sorted and grouped together per
affected locus and size. Allele designations for those found within the ladder were
approximated by comparing their sizes with the mean size of the ladder alleles listed in
the Proﬁler PlusTM or COﬁlerTM User Manuals (1997). Designations for suspected variant
alleles found outside of the ladder were extrapolated based on their sizes as compared to
those of the smallest or largest ladder alleles.

In general, one representative sample from each variant allele grouping was re-

ampliﬁed and re-analyzed. Two or more samples were re-analyzed for variant allele

12

groupings with a broader range of bp sizes to determine if the grouping contained only
one allele, or two. The samples selected contained suspected variants with sizes near
either end of the range. For example, the observed size range for the 44.2 microvariant in
FGA was 326.40bp — 326.79bp. The two samples tested contained off-ladder alleles with

sizes of 326.44bp and 326.79bp.

DNA Quantitation

DNA quantitation was performed on DNA samples that were previously
organically extracted using in-house protocols (Orchid GeneScreen SOP Manual, 2001).
A single probe Hamilton MICROLAB® 2200 Robot Liquid Handler and an Eclipse v.

4.1 software application (both Hamilton Co.) were used to transfer 80uL of TE (lOmM

Tris-HCL, 0.1mM NagEDTA, pH 8) to the wells of a 96-well microplate. A robot then

added 20uL of each sample to the microplate. Twenty rnicroliters of varying
concentrations of K562 DNA standard was hand-added to the ﬁrst 12 wells of the

microplate (Table 3).

Table 3: Concentrations and Positions of Quantitation K562 DNA Standards
Each of the K562 DNA standards was added to two wells, and the measured average of each standard
concentration was used to determine the DNA concentration of each sample.

1 and 2 0.0 onl
3 and 4 0.5
5 and 6 2.5

7 and 8 5.0

9 and 10 7.5
11 and 12 10.0

 

13

A solution containing 100uL TE and 0.5uL PicoGreen® dsDNA quantitation reagent
(Molecular Probes) was made for each DNA sample and standard, and lOOuL of this
solution was added to each.

The microplate was placed inside an Fmax Microplate Quantiﬁer (Molecular
Devices) and the samples were allowed to incubate for 4 minutes. An Fmax Quantiﬁer is
a ﬂuorometer with an excitation maximum set at 485nm and an emission maximum set at
538nm. After excitation, a SoftMAX Pro v. 1.3.1 software program (Molecular Devices)
records the amount of light emitted by each sample. A SoftMAX Pro program created a
standard curve based on the measured concentration of each K562 DNA standard and
determined the DNA concentration of each unknown sample accordingly. The
concentration data were then exported from a SoftMax Pro program to a Laboratory
Information Management System (LIMS) software program v. 1.5 (Blue Sabre Systems).
LIMS was then used to calculate the amount of TE required for a DNA concentration of

approximately 0.25 — 0.30ng/uL, and the samples were hand diluted accordingly.

DNA Ampliﬁcation and Polymerase Chain Reaction (PCR)
A Hamilton robot and an Eclipse software ampliﬁcation program were used to

transfer 6uL of each sample into the corresponding wells of a 96-well microplate. In

addition, 6uL each of NANOpure® dHZO or AmprSTR® Control DNA 9947A were

also transferred to the microplate. The deO served as a negative control, while the

9947A served as a positive control.

14

An ampliﬁcation master mix of DNA primers, deoxynucleotide triphosphates
(dNTPs) and DNA polymerase was made for each sample or control (Table 4). Nine

microliters of the master mix were added to each for a total PCR volume of lSuL.

Table 4: Volumes of Ampliﬁcation Kit Components Added Per Sample

DNA primers, reaction mix containing dNTPs and DNA polymerase were combined to form the
ampliﬁcation master mix. The DNA primers varied depending on whether the Proﬁler PlusTM or
COﬁlerTM ampliﬁcation kit was used.

 

\laster .\li\ (,‘omponent Volume per sample fill.»

 

 

 

AmpFESTR® Proﬁler PlusTM/COﬁlerTM Primer Set 3.3
AmpFESTR® PCR Reaction Mix 6.3
AmpliTaq Gold® DNA Polymerase 0.3

 

 

 

The samples were then placed into a PTC-100TM Programmable Thermal Controller (MJ

Research, Inc.) for hot start PCR using the cycling parameters listed in Table 5.

Table 5: Thermal Cycler Temperatures and Times
During PCR, the samples underwent an initial incubation of 11 minutes at 95° prior to 28 cycles of
denaturation, annealing and extension. The ﬁnal extension step lasted 45 minutes at 60° before the ﬁnal

 

 

 

 

soak at 25°.
PCR Step 'l'emperalure 4 (‘1 Time (llllll.|
Initial Incubation 95 1 l
28 cycles: 1) Denature 94 l
2) Anneal 59 l
3) Extend 72 1
Final Extension 60 45
Final Soak 25 Hold

 

 

 

 

 

Gel Electrophoresis
At the completion of ampliﬁcation, 4uL of each sample were transferred from the

ampliﬁcation microplate to the corresponding wells of a 96-well loading microplate.

15

Four microliters of AmpFESTR® Profiler PlusTM or COﬁlerTM Allelic Ladder were added
to an empty well in the loading plate. A loading buffer solution was then made for the
samples, negative and positive controls and ladder. This solution contained the ROX
sizing standard, formamide loading solution and Hi-DiTM Formamide (Table 6). Each
sample or control was combined with 3.5uL of the loading buffer solution, and 4uL of

the solution was added to the ladder.

Table 6: Sample Loading Buffer Solution Components and Volumes

A loading buffer solution comprised of ROX, formamide loading solution and Hi-DiTM Formamide was
added to each sample, control or ladder prior to electrophoresis.

 

Loading Buffer (‘omponent Volume l’er Sample till;l

 

 

GeneScan-500TM ROX 0.55
Formamide Loading Solution 1.75
Hi-DiTM Formamide 1.75

 

 

 

 

The loading microplate was placed on a 95°C heat block for 3 minutes to denature the
samples, and then immediately placed on ice for 4 minutes.

A Long Ran ger® Singel® (BioWhittaker Molecular Applications) pack was
mixed and poured in between two 36cm well-to-read plates according to manufacturer’s
instructions. The gel was allowed to polymerize for a minimum of 2 hours at room
temperature prior to loading. Electrophoresis was performed using an ABI Prism® 377
DNA Sequencer and an ABI Prism® 377-96 Collection software v. 2.5 application

(Applied Biosystems). The running buffer used was 0.5X TBE (47mM Tris, 47mM
H3803, lmM NazEDTA, pH 8). Half of the DNA samples were loaded into their

respective lanes using a 96-lane Sharks Tooth Comb before the control reagents and

16

ladders were loaded. The loading volumes for the samples, controls and ladder are listed

in Table 7.

Table 7: Loading Volumes for Samples, Controls and Ladder
A total sample volume of 0.8uL was loaded in each lane. 1.5uL of allelic ladder was loaded in each of the
4 designated ladder lanes, while luL of the negative and positive controls was loaded in the designated
control lanes.

 

 

 

l)\ \ Source Volume l’er Lane rill»
Sample 0.8
Negative/Positive Controls I 1.0
Allelic Ladder | 1.5

 

 

 

Electrophoresis was performed for 1 minute prior to loading the remaining DNA samples
into their respective lanes. This was done to slightly stagger the DNA banding patterns
of the samples, which makes visualization of the lane assignments easier during post-
electrophoresis analysis. The gel run continued for 2.5 hours using the settings listed in
Table 8. The remaining samples were tested with two additional Proﬁler PlusTM gel runs

and one COﬁlerTM run.

Table 8: Electrophoresis Parameters for the ABI Prism® 377 DNA Sequencer

These are the optimal electrophoresis settings described in the Proﬁler PlusTM and COﬁlerTM User
Manuals (1997).

is Vo k 3.0
is Current mA 60.0

' Power 200.0
Laser Power m 40.0
Gel T 51.0

 

l7

Data Analysis

At the completion of electrophoresis, the resultant data were ﬁrst analyzed using a
GeneScan® software program. Following the guidelines in the Proﬁler PlusTM User
Manual (1997), the “Auto-Track” function was used to place a tracking line through each
sample’s DNA banding pattern. The “Extract” function was utilized to determine the
allele sizes before the data were imported into a Genotyper® software application. The
“Check G8500” macro was used to analyze the ROX peaks in the samples ampliﬁed
using the Proﬁler PlusTM kit, while the “Check G8350” macro was used for samples
ampliﬁed using the COﬁlerTM kit. Allele designations were assigned to the peaks in each
locus using the “Kazaam (20% Filter)” macro.

The sample DNA proﬁles were reviewed for the presence of off-ladder alleles at
the affected loci; any found were classiﬁed as “conﬁrmed” variant alleles. The barcode
numbers of samples not containing off-ladder alleles were recorded in order to review
their original electropherograms. This was done to determine reasons why these alleles
were off-ladder originally, but were recognized as one of the ladder alleles after re-
analysis. Both peaks in an affected locus were manually labeled with their bp sizes.

Allele designations were calculated for each variant found within the size range of
the ladder to conﬁrm whether or not the approximated designations were correct. Gill et
al. (2001) described three calculations used to compare the relative size difference (5)
between the sample alleles (S) and the ladder alleles (L) run under the same

electrophoretic conditions:

51=SY'LY 52:30L'Lx c=l51"52l

18

6] represents the size difference between non-variant sister allele Y, (8y) and ladder allele
Y (Ly). 82 signiﬁes the size difference between the off-ladder variant allele (801) and
ladder allele X (Lx), the Smaller allele adjacent to the variant. The e value is the relative

size shift between alleles in the sample and the ladder alleles, and indicates how many

additional bp are present in each variant (Figure 3).

 

TI'ITI'I'I'I'I'l‘l'l'l'l'l'l

236 238 240 242 244 246 248 250 252 254 256 258 260 262
FGA

' Pro Plus...erSamplel lBluc ProﬁlerPlus LADDER

  

 

 

 

off-ladder allele?
257.51

51 = 825 - L25 = 244.34 - 244.46 = -0.12bp

52 = Son. . L2,, = 257.51 - 256.64 = +0.87bp

c = |al - sz| = |-0.12 - 0.87| = 0.99bp

From Butler (2001)

Figure 3: Electropherogram showing Microvariant Allele Calculations at the FGA Locus

The sample in the bottom panel is compared to the ladder shown in the top panel. Peaks are labeled with
the allele designations and the hp size. The 51 calculation shows the size difference between sample allele
25 and ladder allele 25. The 82 calculation shows the size difference between the off-ladder sample allele
and ladder allele 28. The c calculation shows the absolute value of the difference between 51 and 52, and
this indicates how many additional bp are present in the off-ladder allele.

The c value in Figure 3 is 0.99bp, indicating that the relative peak shift between the off-
ladder allele and allele 25 in the sample is approximately 1bp. The off-ladder allele in
this sample was therefore designated as a 28.] microvariant.

Butler (2001) stated that allele designations must be extrapolated for variant
alleles found outside the expected ladder range, as the above calculations are intended
only for variants within the ladder. Therefore, allele designations for the beyond ladder
variants were extrapolated using their sizes as compared to the smallest or largest ladder
alleles.

Some suspected variant allele designations were adjusted after the calculations
and extrapolations were performed, as several errors were made when the designations
were initially assigned. These errors occurred because the designations were not
calculated using the formulas from Figure 3, but were instead approximated by the author
by comparing the sizes from the original run with the mean size of the ladder alleles
listed in the Proﬁler PlusTM or COﬁlerTM User Manuals (1997). As a result, some
samples were grouped with allele designations that were inaccurate. The adjustments
that were made involved combining samples from two groups of variant alleles (e. g.
combining 21.3 samples with the 21.2 samples), dividing one group into two groups (e. g.
dividing the 16.2 samples into 16.] and 16.2 samples), and assigning a different variant
allele designation to a group (e. g. changing the 24.3 samples into 24.2 samples).

Additional details on these adjustments can be found in the discussion.

20

Off—site SGM PlusTM Analysis

Sixteen samples containing above and below ladder variant alleles at the FGA
locus were sent to Dr. John Butler at the National Institute of Standards and Technology
(NIST) laboratory for SGM PlusTM analysis. This SGM kit contains additional alleles in
the FGA ladder not included in the Proﬁler PlusTM kit. The goal of this testing was to see
if they were in the kit and to demonstrate whether or not the variant allele designations
were accurately extrapolated. Only samples containing variant alleles that were within
4bp of the SGM ladder alleles were submitted for testing to avoid further extrapolations
of the designations. The samples were analyzed via capillary electrophoresis using an
ABI Prism® 3100 Genetic Analyzer (Applied Biosystems) according to N IST’s standard

operating procedures.

Classiﬁcation of Conﬁrmed Variant Alleles

The conﬁrmed variant alleles were sorted into 5 different categories based on
their sizes relative to the smaller, adjacent allele (X) with a full number of repeats. For
microvariant alleles below and above the ladder, X was not an allele included in the
ladder, but represented the smaller, adjacent, extrapolated allele. The ﬁrst category, X.1
microvariants, included alleles within the ladder that were lbp larger than the smaller,
adjacent allele. X.2 microvariants included alleles that were 2bp larger than the smaller
allele. The X.3 microvariants were 3bp larger than the smaller allele. Categories 4 and 5
were comprised of all variant alleles (including relevant X.1, X2 and X3 microvariants)

that were smaller or larger than the ladder alleles.

21

Submission to the STRBase Website

The conﬁrmed variant alleles were compared to those already listed on the
STRBase website to determine which could be submitted as previously unreported
alleles. It was relevant to note whether or not the alleles already listed on the STRBase
website had been tested using an ABI 377 because the way allele sizes are measured can
vary slightly depending on the instrument platform used for electrophoresis. These
variations are caused by differences in the type and concentration of the gel used and the
electrophoresis conditions (Proﬁler PlusTM User Manual, 1997). For example, the
STRBase listing for the 33.1 microvariant in D2181] indicates a size of 221.35bp when
electrophoresed using an ABI 310. In contrast, one of the 33.1 microvariants re-analyzed
for this project had a size of 224.73bp, appearing to be more than 3bp larger than the 33.1
microvariant tested on the AB] 310. The alleles were sorted into 3 categories:

1) Variant alleles not already listed on the STRBase website AND not included
in any of the allelic ladders used by Orchid GeneScreen.

2) Variant alleles already listed on the STRBase website that were
electrophoresed using an instrument other than an ABI 377, AND were not
included in any of the allelic ladders used by Orchid GeneScreen.

3) Variant alleles already listed on the STRBase website that were
electrophoresed using an ABI 377, OR were included in any of the allelic
ladders used by Orchid GeneScreen.

All variant alleles classiﬁed under categories 1 and 2 were submitted to the STRBase

website.

22

Racial Data and Variant Allele Frequencies

Orchid GeneScreen collects self-described racial data from both the mother (M)
and alleged father (AF) prior to testing, while the child’s (C) race can only be assigned
after paternity is established. This data was necessary as allele frequencies vary among
racial groups, and the categorization of the alleles by group inﬂuences the calculated
frequency values. Using a LIMS software application, each sample containing a
conﬁrmed variant allele was assigned to one of 4 racial groups: African American,
Caucasian, Hispanic and Other. The “Other” category included anyone who could not be
grouped with the other three or did not specify their race on the chain of custody form
that accompanied the samples to the lab. Racial data for individuals with conﬁrmed
variant alleles were recorded using a LIMS software application. LMS was then used to
sort the 32,671 DNA proﬁles reviewed for this project to yield the following numbers of

individuals (N):

0 Caucasian N = 16,257
0 African-American N = 14,160
0 Hispanic N = 1,895

o Other N = 359

Racial data for all children were designated using the 9 different classiﬁcations as

follows:

23

1. If M and AF were the same race, then AF race was used for C, regardless
of AF inclusion or exclusion in paternity.

2. If M and AF were different races and AF was included in paternity, then
AF race was used for C.

3. If M and AF were different races and AF was excluded in paternity, then
M race was used for C.

4. If M was not tested and AF race was known, then AF race was used for C
regardless of AF inclusion or exclusion in paternity.

5. If M race was known and AF was not tested, then M race was used for C.

6. If M race was known, AF race was Other and AF was excluded in
paternity, then M race was used for C.

7. If M race was known, AF race was Other and AF was included in
paternity, then C race was Other.

8. If M race was Other, AF race was known and AF was excluded in
paternity, then C race was Other.

9. If both M and AF races were Other, then C race was Other.

For each racial group, observed variant allele frequencies were calculated by counting the
number of occurrences and dividing by 2N, the approximated total number of alleles (n)

found in the above samples.

24

Variant Alleles and the Orchid GeneScreen Allele Frequency Database

The variant allele occurrences and frequencies were added to the Orchid
GeneScreen allele frequency database prior to their use in CPI calculations. The
laboratory directors decided that conﬁrmed variants found in 5 or more samples,
regardless of race, would be included in the database. In addition, only those alleles
within 12bp of the smallest and largest ladder alleles were chosen in order to minimize
the use of those with extrapolated designations. These extrapolated designations may or
may not be completely accurate as they were simply estimated based on the sizes of the
variant alleles and the directors wanted to avoid adding potentially erroneous alleles to
the database. The directors also decided that the number of variant allele observations,
and not the complete DNA proﬁles from affected individuals would be added to the
database. Thus, the addition of the alleles did not increase the number of individuals (N’)
in the Orchid database, but rather increased the total allele occurrences (n’) per locus.

The number of individuals reviewed in this investigation was roughly twice the
number of individuals in the Orchid database. The numbers of allele observations were
therefore approximately halved prior to adding to the Orchid database so that the
observed allele frequencies would not change. These adjusted values represented the
number of times each would have occurred had they been found in the Orchid database
samples. Because the observed allele frequencies were determined by dividing the
number of observations by 2N, the modiﬁed number of observations was calculated by
multiplying each frequency by 2N’. For example, if a variant allele was found 7 times in
African Americans with a frequency of 0.00024718, the extrapolated number of

observations amongst 7419 African Americans in the database was calculated as 3.67,

25

rounded off to 4 occurrences. The calculated observations were then added to the n’
values for each locus in the database. Frequencies for all alleles, including variants, were
re-calculated by dividing the number of occurrences by the new 11’ values.

One of the goals in calculating CPI values is to use allele frequencies that are
scientiﬁcally conservative. That is, they must not overestimate the strength of an
inference that the tested man is the biological father. According to the National Research
Council (1996), allele frequencies can be very inaccurate if the allele is so rare that it is
only observed one or a few times in a database. Therefore, conservative frequency values
must be used so that the power of the tests is not compromised. These values are used in
paternity calculations for alleles found 5 times or less in a racial group. This value was
calculated using 5 allele observations and dividing by the total number of alleles included
in the database, and was determined for the loci tested in each group using 5/2N’. The
original database is included in Appendix 1, and the modiﬁed database with the 5/2N’

values is found in Appendix 2.

26

RESULTS

Approximation of Suspected Variant Allele Designations

A total of 32,671 DNA proﬁles were reviewed for the presence of variant alleles.
Of these, 892 samples were found to contain 117 prospective variants at 12 of the 13
CODIS loci. The allele designations for these variants were approximated (see Materials

and Methods), and are included in Table 9.

Table 9: 117 Suspected Variant Alleles
These potential variants were found at 12 of the 13 CODIS loci. Allele designations were approximated by
comparing their bp sizes to the sizes of the alleles found in the ladders.

Locus

 

Suspected Variant \llele

 

 

 

 

 

 

 

 

 

 

 

 

D3Sl358 9, 9.1, 12.3, 13.1, 13.3, 14.1, 14.3, 15.1, 15.3, 16.2,
17.1,17.2, 18.2, 20, 20.1
vWA 13.3, 16.1, 18.3
FGA 14.3, 15.3, 16.1, 16.2, 19.3, 20.1, 20.3, 21.2, 21.3,
22.1, 22.2, 22.3, 23.3, 24.1, 24.2, 24.3, 25.2, 25.3,
31.2, 32.2, 33.2, 34.1, 34.2, 41.2, 42.2, 43.1, 43.2,
44.2, 45.2, 45.3, 46.2, 46.3, 47.2, 47.3, 48.3, 49.3
D881179 6.2, 7.3, 9.2, 12.3, 14.3
D2181] 24.3, 25.3, 27.1, 29.1, 29.3, 30.3, 31.1, 32.1, 33.1,
34.1, 35.1, 36.2
D18S51 7, 11.2, 12.2, 12.3, 13.3, 14.1, 15.2, 16.2, 17.2,
17.3, 18.1, 18.2, 20.2, 20.3, 21.2, 21.3, 27.1, 28.1
D58818 11.1, 12.3, 18
DI3S317 5, 6, 7.1, 10.3, 13.1
D78820 5.2, 6.3, 7.3, 8.1, 8.3, 9.1, 9.3, 10.1, 10.3, 11.1,
113,121, 13.1
TH01 4, 8.3, 13.3
TPOX 7.3, 14
CSFlPO 10.2, 16

 

 

27

 

Gel Electrophoresis Data
A total of 134 samples, which encompassed all observed variant allele sizes, were
re-tested on 3 Proﬁler PlusTM gels and 1 COﬁlerTM gel. Ninety-eight of these samples
contained 85 different conﬁrmed variant alleles (explanations for the other 36 samples
below). Of these 98 samples:
0 66 contained 1 unique variant each for a total of 66 variants;
0 4 contained 2 unique variants each, for a total of 8 variants;
- 22 contained 11 unique variants, as two samples were tested for each of these
variants; and
0 6 contained 5 variants, none of which were unique, as they were included with the
other sample groupings mentioned above.
All variants were observed at heterozygous loci. Fifty-two of the alleles fell amongst the
ladder alleles, while the remaining 33 were smaller or larger than the ladder alleles. Once
a variant was conﬁrmed, all samples with an allele of the same size were grouped with
that variant, although they were not re-tested. Therefore, a total of 757 samples

contained the 85 conﬁrmed alleles (Table 10).

Table 10: Conﬁrmed Variant Alleles
A total of 85 different variant alleles were conﬁrmed at 12 CODIS loci. The sizes are from the DNA
proﬁles of the re-analyzed samples. Some of the variant alleles are listed more than once, as more than one
sample containing the variant allele was tested. Alleles in bold fell outside the allelic ladder and have
extrapolated allele designations.

 
   

Locus Variant \llele Size Illlll

 

D3Sl358 9 102.45
15.1 126.31

16.2 132.58

17.1 135.74

20.1 147.95

vWA 18.3 188.02

 

 

 

 

 

28

 

Table 10 (cont’d)

 

 

 

 

 

FGA 14.3 207.79
15.3 21 1.88
16.1 212.91
16.2 214.02
19.3 227.49
20.1 229.62
20.3 231.61
20.3 231.71
21.1 233.70
22.3 239.93
22.3 239.89
23.3 244.05
24.1 246.21
24.3 248.31
25.1 250.34
25.3 252.19
31.2 274.64
31.2 274.60
32.2 278.49
33.1 281.74
34.1 285.70
34.2 286.32
41.2 314.37
42.2 318.48
43.2 322.56
43.2 322.59
44.2 326.65
44.2 326.63
45.2 330.90
45.2 330.73
45.2 330.82
46.2 334.89
46.2 334.88
47.2 339.05
47.2 339.05
47.2 339.30
48.2 343.45
49.2 347.53

D881 179 7 123.15
D2181] 24.3 189.99
25.3 194.09

27.1 200.19

29.] 208.37

29.3 210.55

29.3 210.54

30.3 214.60

31 . 1 216.59

3 l .1 216.60

32.1 220.70

33.1 224.73

33.1 224.83

33.1 224.79

34.1 228.89

35.1 233.01

 

29

 

Table 10 (cont’d)

D2181] 35.1 232.95

 

 

 

 

 

 

 

 

@ont’d) 36.1 237.06
018851 7 266.53
1 1.2 294.09

12.2 288.06

13.3 293.18

15.2 299.92

16.] 303.01

16.2 303.85

17.2 308.06

17.3 309.05

18.1 31 1.10

20.2 320.17

21.2 324.37

21.2 324.37

27 347.35

28.1 351.52

D58818 12.3 158.72
18 179.32

Dl38317 5 195.09
6 199.16

7.1 204.27

D78820 5.2 257.78
6.3 262.56

8.1 268.33

8.3 ' 270.34

9.1 272.26

9.1 272.25

9.3 274.15

9.3 274.17

10.1 276.10

10.3 278.01

1 1 .1 280.02

1 1.3 281.89

12.1 283.75

12.1 283.85

13.1 287.66

TH01 4 165.61
8.3 184.63

13.3 204.89

TPOX 7.3 225.67
14 251.00

CSFlPO 10.2 298.86
16 320.64

 

 

 

 

 

30

The calculation or extrapolation of the conﬁrmed allele designations (see
Materials and Methods) led to minor adjustments of 29 inaccurate designations (Table 9
above). Eighteen of these were combined together as 9 alleles. The suspected 16.2
microvariants at the D1885] locus became conﬁrmed 16.] and 16.2 microvariants. The

remaining 10 changed by a matter of l — 2bp after re-analysis (Table 11).

Table 1]: Adjusted Variant Allele Designations

Re-testing of samples led to several adjustments of the suspected allele designations. Two values in the
suspected variant allele column indicate designations that were combined as one conﬁrmed variant. Two
values in the conﬁrmed variant allele column indicate designations that resulted from the division of a
single suspected variant grouping. Single values in both columns indicate the designation changed after
being calculated or extrapolated.

 

 

 

 

 

 

Locus Suspected Variant ('onl’irmerl
.\|lelels1 \ :lritmt .\llelelsl
D3Sl358 9, 9.1 9
20, 20.1 20.1
FGA 21.2 21.1
24.2 24.1
25.2 25.1
33.2 33.1
43.1, 43.2 43.2
45.2, 45.3 45.2
46.2, 46.3 46.2
47.2, 47.3 47.2
48.3 48.2
49.3 49.2
D8Sll79 6.2 7
D2181] 36.2 36.1
D1885] 12.2, 12.3 12.2
16.2 16.1,16.2
18.2 18.1
20.2, 20.3 20.2
21.2,21.3 21.2
27.1 27

 

 

 

 

A total of 24 suspected variant alleles were not off-ladder after re-ampliﬁcation
and re-analysis because they were alleles that were then recognized by the Genotyper®
software (Table 12). These 24 were found in the 36 samples that were re-tested, and 135

of the samples originally recorded. Although only these 36 samples were re-tested, the

31

original DNA proﬁles for all 135 were reviewed to verify that none contained true variant
alleles. It was determined that even if all of these samples had been re-tested, none
would have contained true variant alleles. Detailed explanations for why these alleles

were originally off-ladder are found in the Discussion.

Table 12: Suspected Variant Alleles That Were Not Confirmed

A total of 24 suspected variant alleles were not conﬁrmed after re-am liﬁcation and re-analysis. The
observed sizes (or ran 6 of sizes if >1 tested) are from their original D A proﬁles. The common allele
column refers to the alle e that was called when the sam les were re-analyzed. Common allele bp sizes are
mean values from the Proﬁler PlusTM User Manual. The number of samples containing these suspected

alleles ranged from 1 to 35, and testing was performed between 2 and 5 times for each suspected allele.
Suspected allele was pull up peak from D58818 locus (see Discussion).

 

 

 

 

 

 

 

 

 

Locus Suspected Observed bp Common Common Total 'l'ilnes
Variant Range/Size Allele Allele Size Found Re-testetl
.-\llele (bp)
D3Sl358 12.3 1 17.68 — 117.78 13 118.24 5 3
13.1 118.37 13 118.24 1 1
13.3 121.63 — 121.96 14 122.27 35 3
14.1 122.36 — 122.53 14 122.27 5 1
14.3 125.45 — 125.95 15 126.34 31 4
15.3 129.97 — 130.22 16 130.45 13 2
17.2 136.70 17 134.59 1 1
18.2 139.97 - 140.46 18 138.75 2 2
vWA 13.3 168.36 14 168.88 1 1
16.1 177.37 16 176.76 1 1
FGA 21 .3 236.32 22 236.48 1 1
22.1 237.49 22 236.48 1 1
22.2 238.64 23 240.60 1 2
24. 1 245.79 24 244.75 1 1
D881179 7.3 126.65 8 127.06 2 2
9.2 132.43 9 131.17 1 1
1232| 147.86 13 148.58 1 1
14.3 155.78 - - 1 1
D1885] 14.1 294.48 14 293.90 1 1
18.1 310.48 18 309.80 1 1
D58818 11.1 152.43 11 151.80 1 1
D13S3l7 10.3 219.21 11 219.54 1 1
13.1 227.87 — 227.99 13 227.63 20 1
D78820 7.3 266.73 - 266.76 8 267.18 3 3

 

 

 

 

 

 

 

32

 

Among the 135 samples that did not contain variants were 4 that were originally
grouped with microvariants that were conﬁrmed using other samples. Initially, a total of
14 samples were grouped with the 23.3 variant at FGA while 8 were grouped with the 9.3
microvariant at D78820. After re-testing 2 samples containing 23.3 and 3 containing 9.3,
1 of each did not contain off-ladder alleles. All of the original DNA proﬁles for the 22
samples grouped with these alleles were reviewed to determine if any problems had
occurred when the samples were initially tested. The two samples, as well as an
additional 2 grouped with the 9.3 microvariant had experienced anomalies during
electrophoresis, and were erroneously included in this study. As a result, a total of 13
samples remained grouped with the 23.3 microvariant, while 5 were grouped with the 9.3

microvariant. Further explanation on these anomalies is found in the Discussion.

Off-site SGM PlusTM Analysis

Electropherograms for 11 of the 16 samples submitted for SGM PlusTM analysis
were obtained from NIST. Eight of these contained alleles found in the SGM PlusTM
FGA ladder. These 8 are considered to be variants for the purposes of this study, as they
are not found in the FGA ladder of the Proﬁler PlusTM kit. The other 3 samples
contained microvariants not found in the SGM PlusTM FGA ladder, and their allele
designations were determined at NIST (Table 13). All of these allele designations
matched those approximated using the Proﬁler PlusTM data, indicating that they were

extrapolated correctly.

33

Table 13: Conﬁrmed FGA Variants - SGM PlusTM Analysis

A total of 11 samples tested at NIST using the SGM PlusTM kit yielded variant alleles at the FGA locus.
These alleles ranged from 31.2 to 48.2 repeats. The bp sizes ranged from 272.88bp and 343.3lbp and were

measured using an AB] 3100 Genetic Analyzer. 3 Allele is found in the SGM PlusTM FGA ladder.

Variant \llele Variant \lleleSizerllpl

 

31.2“ 272.88
32.2a 277.02
33.1 280.28
34.1 284.35
34.2 285.12
42.25' 319.50
43.2“ 323.79
44.2a 327.92
45.23 331.93
47.23 339.78
43,2a 343.31

 

 

 

 

The remaining 5 samples submitted to NIST did not yield useable data under the
PCR conditions used, and results were not received. These 5 included 2 variants that
were below allele 18, the smallest in the Proﬁler PlusTM FGA ladder, and 3 variants that
were above allele 30, the largest in the Proﬁler PlusTM FGA ladder (Table 14). Of these,
only the 46.2 microvariant is included in the SGM PlusTM ladder. Without SGM PlusTM

veriﬁcation, it was assumed that the allele designations were extrapolated correctly.

Table 14: Extrapolated FGA Variant Allele Designations

These 5 variant allele designations were extrapolated using the sizes from the Proﬁler PlusTM gel runs.
They ranged from 16.1 repeats to 49.2 repeats, with hp sizes ranging from 212.91 — 347.53. The 46.2

microvariant is included twice, and size data from both runs are included. a Allele is found in the SGM

PlusTM FGA ladder.

 

 

 

16.] 212.91
16.2 214.02
41.2 314.37
46.2“ 334.88
46.23 334.89
49.2 347.53

 

 

34

Classiﬁcation of Conﬁrmed Variant Alleles

The 85 variant alleles were sorted into 5 different categories based on their sizes:

X.1 microvariants, X.2 microvariants, X.3 microvariants, variant alleles below the ladder,

and variant alleles above the ladder (TableslS — 19). The latter two categories included

their own X. 1, X2 and X.3 microvariants. The totals for each category were:

Twenty-two X.1 microvariants in 182 samples;
Nine X.2 microvariants in 97 samples;
Twenty—one X.3 microvariants in 134 samples;
Twelve below ladder variants in 99 samples; and

Twenty-one above ladder variants in 245 samples.

35

Table 15: X.1 Microvariant Alleles
A total of 22 variant alleles in 182 samples were categorized as X.1 microvariants. The number in
parentheses in the Locus column indicates the total number of X.1 microvariants observed at that locus.
The size range and average size of each microvariant were determined using the allele sizes from both the
original DNA proﬁles and the DNA proﬁles generated after being re-tested. The number of samples

. . . . a .
conlalnlng each mlcrovanant range from 1 to 76. Allele has a published repeat structure.

Locus _\licro\:ll‘lant ()1)\t‘l‘\t‘(1.‘l/L‘ \\ 2.81m 4‘1 Found ()l‘iuillal

 
 

1 13ml 15‘ will Range ll)|)1 ll)|)l Reference

 

 

 

 

 

 

 

 

 

 

 

 

 

 

l27.25 — 127.30 1 -
135.69 — 135.85 3 SGM PlusTM
FGA 20.1 229.56 — 229.62 229.59 l SGM PlusTM
(4) 21.1 233.70 — 233.82 233.76 1 -
24.1 246.02 — 246.21 246.] l 5 SGM PlusTM
25.1 250.24 — 250.34 250.29 1 SGM PlusTM
D2181] 27.13 200.09 — 200.19 200.15 5 STRBase
(8) 29.1 208.37 — 208.44 208.40 I SGM PlusTM
31.1 216.59 — 2l6.6l 2l6.60 2 SGM PlusTM
32.1 220.68 - 220.70 220.69 4 SGM PlusTM
33.l 224.69 — 224.88 224.80 76 Proﬁler PlusTM
34.1 228.84 — 229.0l 228.90 20 Proﬁle: plusm
35.1 232.89 — 233.05 232.96 1 1 STRBase
36.1 237.01 — 237.06 237.03 1 STRBase
D1885] 16.1 — 303.01 l - ‘
(2) 18.1 311.01 —3ll.10 311.04 3 ProﬁlerPlusTM
D7S820 8.1 268.32 — 268.36 268.34 2 STRBase
(6) 9.1 272.14 — 272.46 272.28 11 STRBase
10.1 275.93 — 276.20 276.07 17 STRBase
l 1.1 279.85 — 280.08 279.97 9 STRBase
12.1 283.74 — 283.85 283.77 3 STRBase
13.1 287.66 — 287.78 287.73 4 STRBase

 

Table 16: X.2 Microvariant Alleles
A total of 9 variant alleles in 97 samples were categorized as X.2 microvariants. The number in
parentheses in the Locus column indicates the total number of X.2 microvariants observed at that locus.
The size range and average size of each microvariant were determined using the allele sizes from both the
original DNA proﬁles and the DNA proﬁles generated after re-analysis. The number of samples containing

. . a .
each mlcrovanant range from 1 to 41. Allele has a publlshed repeat structure.

   

   

Locus Microvariant Observed Size Avg. Size # Found Original Reference

    

 

 

      
       

     

 

 

(Total Found) Range lbp) (bp)
D3Sl358 (1) 16.2 132.58 — 132.74 132.67 4 Budowle er al. (1997)
D18S51 l 1.2 284.09 — 284.24 284.17 5 STRBase
(7) 122,, 287.94 — 288.28 288.11 8 SGM PlusTM
15.2‘ 299.75 — 299.92 299.85 4] Barber and Parkin (1996)
1923 303.85 * 303.94 303.89 1 SGM PlusT'M
17.2 307.86 — 308.06 307.96 2 Gill er al. (1996)
20.2 320.05 7 320.30 320.16 11 SGM PlusTM
21.2 324.03 — 324.44 324.23 24 SGM PlusTM
CSFIPO (1) 10.2 — 298.86 1 STRBase

 

 

 

 

 

 

 

36

Table 17: X.3 Microvariant Alleles

A total of 21 variant alleles in 134 samples were categorized as X.3 microvariants. The number in
parentheses in the Locus column indicates the total number of X.3 microvariants observed at that locus.
The size range and average size of each microvariant were determined using the allele sizes from both the
original DNA proﬁles and the DNA proﬁles generated after re-analysis. The number of samples containing

. . a .
each m1crovanant range from 1 to 29. Allele has a publlshed repeat structure.

 

 

 

 

 

 

 

 

Locus \licroulriant ()1).\t‘1‘\ ed Si/e .\\'2. Size #1 Found Original Reference
(1.11.11 Found) Runue 11)]1) (1)1)!
vWA (1) 18.3 - 188.02 1 SGM PlusTM
FGA 19.3 227.47 — 227.53 227.50 2 SGM PlusTM
(6) 20.3 231.59 - 231.71 231.65 3 SGM PlusTM
22.3 239.81 — 240.12 239.93 8 Gill et al. (1996)
23.3 243.91 — 244.27 244.09 13 SGM PlusTM
24.3 248.17 - 248.35 248.24 7 SGM PlusTM
25.3 252.11 — 252.57 252.27 7 SGM PlusTM
021511 24.3 189.93 - 190.07 190.02 29 Proﬁler PlusTM
(4) 25.3 194.05 — 194.11 194.08 2 Proﬁler PlusTM
29.3 210.43 - 210.62 210.54 15 Proﬁler PlusTM
30.3 214.60 — 214.68 214.65 4 Proﬁler PlusTM
D1885] 13.3 293.03 - 293.18 293.11 4 -
(2) 17.3 308.97 — 309.05 309.00 2 SGM PlusTM
D5S818 (1) 12.3 158.72 - 158.76 158.74 2 - '
D78820 6.3 262.44 — 262.56 262.51 7 Proﬁler PlusTM
(5) 8.3 270.28 - 270.34 270.31 1 STRBase
9.3 274.10 — 274.20 274.14 5 STRBase
10.3 277.94 - 278.15 278.03 16 STRBase
11.3 281.89 — 281.98 281.92 2 STRBase
TH01 (1) 8.321 184.59 - 184.72 184.64 3 Brinkmann et al. (1996)
W (1 ) 7.3 225.65 - 225.67 225.66 1 -

 

 

 

 

 

 

 

microvariant range from 1 to 50.

 

Table 18: Variant Alleles Below the Ladder

A total of 12 variant alleles in 99 samples were smaller than alleles in the ladder. The number in
parentheses in the Locus column indicates the total number of these variants observed at that locus. The
size range and average size of each variant were determined using the allele sizes from both the original
DNA proﬁles and the DNA proﬁles agenerated after re-analysis. The number f samples containing each

Allele has a published repeat structure.
PlusTM ladder.

Allele is found in SGM

 

 

 

 

 

 

 

Locus Valrialllt 01)er ed Size Me. Site it Found Original Reference
(101.11 Found) \llele Range (hp) (11p)
0381358 (1) 9 102.36 — 102.60 102.46 50 Proﬁler PlusTM
FGA 14.3 207.79 — 207.87 207.83 2 -
(4) 1533 211.87-211.92 211.89 3 -
16.1 212.91 — 213.10 213.02 21 Grifﬁths et al. (1998)
16.23 213.96 - 214.20 214.09 8 Proﬁler Plusm
0881179 (1) 73 122.98 — 123.15 123.05 2 Grifﬁths et al. (1998)
018851 (1) 7b 266.45 - 266.53 266.49 2 SGM PlusTM (Ladder)
0138317 5 194.99 — 195.09 195.05 2 Proﬁler Plus“
(3) 6 199.10 — 199.16 199.12 2 STRBase
_ 7.1 204.27 — 204.34 204.32 3 STRBase
D7S820 (1 ) 5.2 257.77 - 257.78 257.77 1 —
T1101 (1) 43’ b 165.61 — 165.66 165.63 3 Grifﬁths et al. (1998)

 

 

 

 

 

 

 

 

37

Table 19: Variant Alleles Above the Ladder
A total of 21 variant alleles in 245 samples were larger than the alleles in the ladder. The number in
parentheses in the Locus column indicates the total number of above ladder variants observed at that locus.
The size range and average size of each variant were determined using the allele sizes from both the
original DNA proﬁles and the DNA proﬁles generated after re-analysis. The nu ber of samples containing

each microvariant range from I to 41. a Allele has a published repeat structure.
PlusTM ladder.

Allele is found in SGM

 

 

 

 

 

 

 

Locus Valriunt Observed Size .\vu. Size fr‘ Found ()rigillall Reference
< 1111.11 Found) .\llele Range (hp) (111))
D3Sl358 (1 ) 20.1 147.86 — 148.09 147.92 40 STRBase
FGA 31.23» b 274.35 — 274.71 274.55 41 Grifﬁths et al. (1998)
('4) 32 23. b 278.35 — 278.59 278.46 1 1 Grifﬁths er al. (1998)
3‘“ 281.70 — 281.79 281.74 2 -
3“ 285.64 - 285.70 285.67 2 -
34.23 286.32 - 286.42 286.36 2 Barber er al. (1996)
41.2 314.31 —314.41 314.35 5 -
4229 b 318.28 — 318.60 318.37 21 Grifﬁths et al. (1998)
4328. b 322.32 — 322.65 322.47 31 Grifﬁths et al. (1998)
442a. b 326.40 — 326.79 326.55 28 Grifﬁths er al. (1998)
45.22). b 330.53 — 330.94 330.77 22 Grifﬁths er al. (1998)
462a, b 334.64 — 335.09 334.86 18 Barber 8! at (1996)
47221. b 338.85 — 339.49 339.16 8 Grifﬁths et al. (1998)
482a, b 343.11 — 343.45 343.29 3 Grifﬁths et al. (1998)
49.2 347.53 - 347.54 347.53 1 SGM Plusm
018851 273’ b 347.33 — 347.35 347.34 3 Barber and Parkin (1996)
(2) 2&1 351.48 — 351.52 351.50 1 -
058818 (1) 18 179.32 — 179.39 179.35 2 -
T1101 (1) 1333' b 204.89 — 204.93 204.92 2 Gene et al. (1996)
TPOX (1) 142‘ 250.84 — 251.00 250.92 1 Huang et al. (1995)
CSFlPO (1) 16“ 320.56 — 320.64 320.60 1 Margolis-Nunno er al. (2001)

 

 

 

 

 

 

 

 

Submission to STRBase Website

The 85 conﬁrmed variant alleles were sorted into three categories before being added
to the STRBase database. Twenty-four variant alleles at 8 loci were grouped under category
1, which included those not already listed on the STRBase website and not included in any of
the allelic ladders used by Orchid GeneScreen. Twenty-six variant alleles at 8 loci were
grouped under category 2, which included those already listed on the STRBase website that
were electrophoresed using an instrument other than an AB] 377, and that were not included

in any of the allelic ladders used by Orchid GeneScreen. The remaining 35 variants at 9 loci

38

were grouped under category 3, which included those already listed on the STRBase website
that were electrophoresed using an ABI 377, or that were included in any of the allelic
ladders used by Orchid GeneScreen (Table 20). A11 50 of the category 1 and category 2

variant alleles were submitted and are now listed on the STRBase website.

Table 20: STRBase Categorization of Variant Alleles

Twenty-four of the variant alleles were included in category 1, 26 alleles were included in category 2 and
35 alleles were included in category 3.

  
   

   

Locus Category 1 Category 2 (.klttugtil'} 3

   

   

 

 

 

 

 

 

 

 

 

 

 

 

 

D3S1358 9
20.1 16.2
vWA - - 18.3
FGA 14.3 19.3 16.1
15.3 23.3 22.3
16.2 24.1 24.3
20.1 25.1 25.3
20.3 31.2
21.1 32.2
33.1 42.2
34.1 43.2
34.2 44.2
41.2 45.2
49.2 46.2
47.2
48.2
D8Sl 179 7 - -
D21S11 25.3 24.3 29.1
30.3 27.1 31.1
29.3 34.1
32.1 35.1
33.1
36.1
018851 13.3 11.2 7
16.1 18.1 12.2
17.2 20.2 15.2
17.3 16.2
28.1 21.2
27
D58818 12.3 - -
18
D13S3l7 - 6 5
7.1
D78820 5.2 8.3 6.3
9.3 8.1
10.1 9.1
10.3 11.1
11.3 13.1
12.1
TH01 - 8.3 4
13.3
TPOX 7.3 - 14
CSFlPO - 10.2 -
16

 

 

 

 

 

 

39

Racial Group Data andviariant Allele Frequencies

The samples containing conﬁrmed variants were sorted by racial group prior to
calculating the allele frequencies. African Americans had the highest number of samples
containing variants, with a total of 511. This comprised 67.5% of the 757 samples
containing variant alleles and 1.6% of the 32,671 proﬁles reviewed. One hundred forty-
eight Caucasian samples contained variant alleles, representing 19.6% of the 757
containing variants and 0.45% of the total proﬁles reviewed. The 63 Hispanic samples
comprised 8.3% of the 757 containing variants and 0.19% of the total proﬁles reviewed.
Thirty-four samples from the Other racial category contained variants, representing 4.5%
of the 757 containing variants and 0.10% of the total proﬁles reviewed. For each group,
observed variant allele frequencies were calculated and are listed with the number of

occurrences in Table 21.

40

Locus

Table 21: Variant Allele Observations and Frequencies
The total number of variant allele observations per locus is in bold face. The number of observations per
racial group is included with the corresponding observed allele frequency in parentheses (given as a
percentage). Frequencies were calculated based on the number of observations in each group and the total
number of allele occurrences at each locus (approximated as 2N). The totals for all variant allele
observations are included at the bottom of each racial group column.

Variant
Allele

Total
Found

Caucasian
N = 16.257

African
American
N = 14.161)

Hispanic
.V = 1895

Other
N = 359

 

 

 

 

 

 

 

 

 

 

 

0381358 9 50 2 (0.006151 ) 46 (0.162429) - 2 (0.278552)
15.1 1 - 1 (0.003531) - -
16.2 4 - 4 (0.014124) - -
17.1 3 3 (0.009227) - - -
20.1 40 28 (0.086117) 4 (0.014124) 5 (0.131926) 3 (0.417827)
vWA 18.3 1 1 (0.003076) - - -
FGA 14.3 2 2 (0.006151) - - -
15.3 3 2 (0.006151) - - 1 (0.139276)
16.1 21 - 21 (0.074153) - -
16.2 8 - 8 (0.028249) — -
19.3 2 2 (0.006151) - - -
20.1 1 1 (0.003076) - - -
20.3 3 - 2 (0.007062) 1 (0.026385) -
21.1 1 1 (0.003076) - - -
22.3 8 - 6 (0.021186) 2 (0.052770) -
23.3 13 - 12 (0.042373) 1 (0.026385) -
24.1 5 - 5 (0.017655) - -
24.3 7 1 (0.003076) 6 (0.021186) — -
25.1 1 - - 1 (0.026385) -
25.3 7 - 7 (0.024718) - -
31.2 41 37 (0.113797) - - 4 (0.557103)
32.2 11 1 (0.003076) 10 (0.035311) - -
33.1 2 - 2 (0.007062) — -
34.1 2 - 2 (0.007062) - -
34.2 2 - 2 (0.007062) - -
41.2 s - 3 (0.010593) 1 (0.026385) 1 (0.139276)
42.2 21 - 20 (0.070621) 1 (0.026385) -
43.2 31 1 (0.003076) 30 (0.105932) - -
44.2 28 2 (0.006151) 26 (0.091808) - -
45.2 22 - 18 (0.063559) 4 (0.105541) -
46.2 18 - 15 (0.052966) 2 (0.052770) 1 (0.139276)
47.2 8 1 (0.003076) 7 (0.024718) - -
48.2 3 - 3 (0.010593) - -
49.2 1 - 1 (0.003531 ) - -
0881179 7 2 - 2 (0.007062) - -
021811 24.3 29 - 23 (0.081215) 2 (0.052770) 4 (0.557103)
25.3 2 - 2 (0.007062) - -
27.1 5 - 5 (0.017655) - -
29.1 1 1 (0.003076) — - -
29.3 15 8 (0.024605) 7 (0.024718) - -
30.3 4 - 3 (0.010593) 1 (0.026385) -

 

 

 

Locus

Variant

Table 21 (cont’d)

Caucasian

African

Hispanic

()tller

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Allele N = 16.257 American .\' = 1895 .\' = 359
.V = 14.160
D2181] 31.1 - - 2 (0.052770) -
(cont’d) 32.1 2 (0.006151) 2 (0.007062) - -
33.1 6 (0.018454) 60 (0.211864) 4 (0.105541) 6 (0.835655)
34.1 3 (0.009227) 16 (0.056497) 1 (0.026385) -
35.1 11 - 11 (0.038842) - -
36.1 1 - 1 (0.003531) - -
D1885] 7 2 - - 1 (0.026385) 1 (0.139276)
1 1.2 5 - 5 (0.017655) - -
12.2 8 - 8 (0.028249) - -
13.3 4 - 4 (0.014124) - -
15.2 41 2 (0.006151) 36 (0.1271 19) 3 (0.079156) -
16.1 1 - - - 1 (0.139276)
16.2 1 - 1 (0.003531) - -
17.2 2 - - - 2 (0.278552)
17.3 2 - - 2 (0.052770) -
18.1 3 3 (0.009227) — - -
20.2 11 - 10 (0.035311) - 1 (0.139276)
21.2 24 1 (0.003076) 22 (0.077684) - 1 (0.139276)
27 3 1 (0.003076) - 2 (0.052770) -
28.1 1 - - 1 (0.026385) -
D58818 12.3 2 - 2 (0.007062) - -
18 2 - 2 (0.007062) - -
D13S317 5 2 1 (0.003076) 1 (0.003531) - -
6 2 2 (0.006151) - - -
7.1 3 - - 3 (0.079156) -
D78820 5.2 1 - 1 (0.003531) - -
6.3 7 7 (0.021529) - - -
8.1 2 - - 2 (0.052770) -
8.3 l - 1 (0.003531) - -
9.1 11 6 (0.018454) 2 (0.007062) - 3 (0.417827)
9.3 5 - 3 (0.010593) 1 (0.026385) 1 (0.139276)
10.1 17 3 (0.009227) 12 (0.042373) 2 (0.052770) -
10.3 16 1 (0.003076) - 15 (0.395778) -
11.1 9 8 (0.024605) 1 (0.003531) - -
11.3 2 - 2 (0.007062) - -
12.1 3 3 (0.009227) - - -
13.1 4 1 (0.003076) 2 (0.007062) 1 (0.026385) -
TH01 4 3 - 2 (0.007062) 1 (0.026385) -
8.3 3 3 (0.009227) - - -
13.3 2 2 (0.006151) - — -
TPOX 7.3 l - - 1 (0.026385) -
14 l - 1 (0.00353 D - -
CSFIPO 10.2 1 - - - 1 (0.139276)
16 1 - - - 1&139276)
TOTALS - 757 148 511 63 34

 

 

Variant Alleles Added to Orchid GeneScreen Allele Frequency Database

A total of 28 variant alleles were found in ﬁve or more samples, regardless of
racial group, and were selected for inclusion in the Orchid GeneScreen allele frequency
database (Appendix A). These were observed in a total of 521 samples, comprising
68.8% of the 757 samples containing variant alleles. Not surprisingly, each of these
alleles was found at the Proﬁler PlusTM loci, as the laboratory uses the COﬁlerTM kit less
frequently (Table 22). Variant alleles found in fewer than 5 samples were considered too
rare for inclusion in the database, as it was unlikely these variant alleles would be

encountered in DNA proﬁles on a regular basis.

Table 22: 28 Variant Alleles Incorporated into Orchid GeneScreen Database
These 28 variant alleles were selected for inclusion in the Orchid GeneScreen allele frequency database.
The size range of each variant was determined using the allele sizes from both the original DNA proﬁles
and the DNA proﬁles generated after re—analysis. The number of samples containing each variant allele

ranged from 5 to 76.

 

 

 

 

 

Locus \ ariant \llele Size Range (11p) 1111.11 i‘llllllti

D3Sl358 9 102.36 - 102.60 50
20.1 147.86 - 148.09 40

FGA 16.1 212.91-213.10 21
16.2 213.96 - 214.20 8

22.3 239.81 — 240.12 8

23.3 243.91 - 244.27 13

24.1 246.02 — 246.21 5

24.3 248.17 — 248.35 7

25.3 252.11 — 252.57 7

31.2 274.35 - 274.71 41

32.2 278.35 — 278.59 11

D2181] 24.3 189.93 - 190.07 29
27.1 200.09 —- 200.19 5

29.3 210.43 — 210.62 15

33.1 224.69 - 224.88 76

34.1 228.84 - 229.01 20

35.1 232.89 - 233.05 11

D1855] 1 1.2 284.09 - 284.24 5
12.2 287.94 — 288.28 8

15.2 299.75 - 299.92 41

20.2 320.05 — 320.30 1 1

21.2 324.03 — 324.44 24

D78820 6.3 262.44 — 262.56 7
9.1 272.14 — 272.46 11

9.3 274.10 - 274.20 5

10.1 275.93 - 276.20 17

10.3 277.94 — 278.15 16

1 1.1 279.85 - 280.08 9

 

 

 

 

 

 

43

Seven FGA variants were not added to the database, even though they were
observed in 5 or more samples. These seven included 41.2, 42.2, 43.2, 44.2, 45.2, 46.2
and 47.2. All but the 41.2 and 46.2 microvariants were conﬁrmed using the SGM PlusTM
kit and the 41.2 microvariant is the only one not found in its ladder. Each of these alleles
was more than 44bp larger than allele 30, the largest in the Proﬁler PlusTM FGA ladder,
and only those within 12bp of the highest and lowest ladder alleles were added to the
database. This was done because as the variants became larger than allele 30, their
observed size ranges expanded, and the chance for extrapolation errors increased. If
necessary, future samples containing these 7 FGA variants can be tested using the SGM
PlusTM kit, as Orchid GeneScreen now uses it on a regular basis for additional loci. ‘

Due to the ambiguity of the Other racial group, only data from the African
American, Caucasian and Hispanic racial groups were added to the database. The variant
allele observations were roughly halved, as the number of individuals in the Orchid
database was nearly half the number of individuals reviewed for this investigation. These
values were rounded to the nearest whole number, while values that were <0.5 were
rounded to 1 (Table 23).

The allele frequencies were re-calculated using the adjusted observations and are
included in Table 23. The minor disparities that were evident between the original
variant allele frequencies (Table 21, above) and the re-calculated frequencies were
attributed to rounding error. For instance, microvariant allele 29.3 at the D21811 locus
was observed 7 times in the African Americans reviewed for this project, with an

observed allele frequency of 0.00024718. The calculated number of observations for

African Americans in the database was 4, with a re-calculated allele frequency of
0.00026023 and a rounding error of 0.00001305.

Thirty-seven of these variant alleles were observed 5 times or fewer in
Caucasians, African Americans and Hispanics, and their frequencies were smaller than
the conservative 5/2N’ values (Table 23). Therefore, these 5/2N’ values, and not the
observed frequencies would be used for CPI calculations. The modiﬁed Orchid database,

which includes the 5/2N’ values is found in Appendix B.

45

Table 23: 28 Adjusted Variant Allele Frequencies and Observations
The total number of recalculated variant allele observations per locus is in bold face. The number
of occurrences per racial group is included, with the corresponding allele frequency in parentheses (given
as a percentage). N’ indicates the number of individuals in the database from each race that had genotypes
for a given locus. The 11’ value indicates the total number of alleles observed at the locus. The 5/2N’ value
(given as a percentage) indicates the minimum allele frequency that can be used for CPI calculations.

a Frequency is smaller than the 5/2N’ value.

 

 

 

 

 

 

 

 

 

 

 

0381358 Total Caucasian African American Hispanic
Variants Found N’ = 7636 N’ = 7602 N’ = 690
n’ = 15274 n’ = 15244 n’ = 1383
5/2N’ = 0.0327 5/2N’ = 0.0329 5/2N’ = 0.362
9 26 1 (0.00654707) 3 25 (0.163999)a 0.0 a
20.1 17 8&085112) 2(0.0l31199) 2(2144613)
FGA Total Caucasian African American Hispanic
Variants Found N’ = 7674 N’ = 7419 N’ = 939
n’ = 15368 n’ = 14947 n’ = 1879
5/2N’ = 0.0326 5/2N’ = 0.0337 5/2N’ = 0.266
16.1 11 0.0 11 (00735934; 0.0
16.2 4 0.0 4 (0.0267612) a 0.0 a
22.3 4 0.0 3 (0.0200709) 1 (00532198) 3
23.3 7 0.0 6 (00401418)a 1 (0.0532198)
24.1 3 0.0 3 (0.0200709) a 0.0 '
24.3 4 1 (000650703) 8 3 (0.0200709) a 0.0
25.3 4 0.0 4 (0.0267612) 00
31,2 17 17 (0.1106195)a 0.0 a 0.0
32.2 6 l (0&0650703) 5 (0.0334515) 0.0
D2181] Total Caucasian African American Hispanic
Variants Found N’ = 7730 N’ = 7652 N’ = 683
n’ = 15467 11’ = 15371 n’ = 1369
5/2N’= 0.0323 5/2N’= 0.0327 5/2N’ = 0.366
24.3 13 0.0 ]2 (00780692)! 1 (0.073046) 3
27.1 3 0.0 a 3 (0.019517) 0.0
33.1 36 3 (0.019396) 8 32 (0.203134) 1 (0.073046)
34.1 11 1 (0.0064654) 910058552) 1 10.07.3046)"
35.1 _ 6 0.0 6 (1)-039035) 0.0
D18S51 Total Caucasian African American Hispanic
Variants Found N’ = 7628 N’ = 7463 N’ = 647
n’ = 15258 n’ = 14974 n’ = 1296
5/2N’ = 0.0328 5/2N’ = 0.0335 5/2N’ = 0.386
11.2 3 0.0 3 (0.0200347) “‘ 0.0
15.2 21 I (0.0065539) 19 (0.1268866) 1 (0.0771605)
20-2 5 0-0 a 5 (0.0333912) 00
21.2 13 1 (0.0065539) 12 (0.0801389) 0.0
D7 S820 Total Caucasian African American Hispanic
Variants Found N’ = 7685 N’ = 7612 N’ = 939
n’ = 15376 n’ = 15235 n’ = 1887
5/2N’ = 0.0325 5/2N’ = 0.0328 5/2N’ = 0.266
6.3 3 3 (0.01951 1) a 0-0 a 0.0
9.1 4 3 (0.01951 1) a 1 (0.0065638)a 0.0 a
9.3 3 0.0 a 2 (0.013128) 1 (0.0529942) a
10.1 8 1 (0.0065036) a 6 (0.039383) 1 (0.0529942)
10.3 8 1 (0.0065036) 0.0 a 7 (0.3709592)
11.1 5 4 (0026015) 3 1 (0.0065638) 0.0

 

46

DISCUSSION

Overview

The goal of this investigation was to gather information on rare, variant alleles
that differ from those found in the allelic ladder by lbp or more. The use of these alleles
in paternity calculations may decrease the need for additional testing, and provide further
support that the correct man has been identiﬁed as the biological father. Eighty-ﬁve
different variant alleles at 12 of the 13 CODIS loci were conﬁrmed during the course of
this investigation. Twenty-eight of these were observed a minimum of 5 times, and were
added to the Orchid GeneScreen allele frequency database. They represent 68.8% of all
samples containing a variant, and approximately 1.6% of the 32,671 DNA proﬁles
reviewed for this investigation. This indicates that an estimated 2,160 samples out of the
135,000 tested annually will contain variant alleles that can now be used in CPI

calculations.

X.1 Microvariant Alleles

A total of 22 microvariant alleles from 182 samples were categorized as X.1
microvariants, which were lbp larger than the smaller, adjacent allele. All of these were
found within the Proﬁler PlusTM ladder. These represented 24% of samples containing
variant alleles, and 0.6% of all DNA proﬁles reviewed for this investigation. The X.1
microvariants were found at the D38 1358, FGA, D21S11, D18S51 and D7S820 loci
(Table 15, above). Currently, none of the allelic ladders used by Orchid GeneScreen

contain X.1 microvariants.

47

Sequence analysis is necessary to determine the type of mutation present and
whether it occurred within the repeat region or in the sequences that flank the repeat.
Previous research has identiﬁed a mutational mechanism resulting in some X.1
microvariants. According to Egyed et al. (2000), X.1 microvariants at the D78820 locus

can be caused by variability in a poly-T region located in the 3’ flanking region, 13 bases

downstream of the core [GATA]n repeat. The D78820 ladder alleles contain 9T8 in this

region, while X.1 microvariants have a T insertion (Figure 4).

 

 

 

Allele Designation 5’ Flanking Repeat 3’ Flanking Region
Region Sequence
9 - - [GATA]9 - - - (T)9ATCT - _
9.1 - - [GATA]9 - - - (T)10ATCT - —

 

From Egyed er al. (2000)
Figure 4: Example of an X.1 Microvariant at the D78820 Locus
Sequence structure of allele 9 and microvariant allele 9.1 at the D78820 locus. Both the 9 and 9.1 alleles
contain the same repeat sequence. The additional bp arises from the insertion of a single T in the 3’
ﬂanking region of the 9.1 microvariant. Allele 9 contains 9T8, while microvariant allele 9.1 contains lOTs.
The dark blocks of different size represent non-repeated DNA sequences.

X.2 Microvariant Alleles

A total of 9 alleles from 97 samples were categorized as X.2 microvariants, which
were 2bp larger than the smaller, adjacent allele. All of these were sized amongst the
alleles in Proﬁler PlusTM or COﬁlerTM ladders. They represented 13% of samples
containing variant alleles, and 0.3% of all DNA proﬁles reviewed for this investigation.
Three different loci contained X.2 microvariants within their ladder ranges: D381358,
D1885] and CSFlPO (Table 16, above). The ladders for the highly polymorphic FGA,
D21811 and D18851 loci contain several X.2 microvariants. All 9 of these microvariants

have been previously reported. Seven were found at the D1885] locus, which has a

48

 

simple tetranucleotide repeat motif of [AGAA]n. Barber and Parkin (1996) observed X.2

microvariants at this locus with a deletion of 2bp, AG, from the 3’-ﬁanking region.

X.3 Microvariant Alleles

Twenty-one different X.3 microvariants from 133 samples were sized amongst
the Proﬁler PlusTM or COﬁlerTM ladder alleles. These were 3bp larger than the smaller,
adjacent allele. They represented 18% of the samples containing conﬁrmed variants, and
0.4% amongst all DNA proﬁles reviewed. Eight different loci contained X.3
microvariants: vWA, FGA, D2181], D1885], D58818, D78820, TH01 and TPOX
(Table 17, above). One of the most common X.3 microvariants, 9.3 at the T1101 locus, is
found in the TH01 ladder in both the COﬁlerTM and SGM PlusTM ampliﬁcation kits. No
other ladders used by Orchid GeneScreen contain X.3 microvariants.

Previous research has identiﬁed the mechanisms behind the additional nucleotides
found in some X.3 microvariants. Similar to the X.1 microvariants at D78820, the X.3
microvariants at this locus can be caused by variability in a poly-T region 13 bases
downstream from the core GATA repeat. According to Egyed et al. (2000), the D78820
ladder alleles contain 9T8 in this region, while X.3 microvariants contain only 8T8,
indicating a deletion of a single T nucleotide. Similarly, several of the X.3 microvariants
at TH01 have been observed with a deletion of a single “A” nucleotide. Brinkmann et al.

(1996) determined that one repeat sequence for the 9.3 microvariant is

[AATG]5ATG[AATG]3, as compared with allele 9, which has a repeat sequence of

[AATG]9.

49

Variant Alleles Below the Ladder

Twelve different variants that sized below the smallest ladder alleles were found
in 99 samples, representing 13% of those samples containing conﬁrmed variants, and
0.3% amongst all DNA proﬁles reviewed. These variants included X.1, X.2 and X.3
microvariants as well as alleles with a full number of repeats, and ranged from 2bp —
13bp below the smallest alleles in the ladder. Seven different loci contained these below
ladder alleles: D3Sl358, FGA, D881179, D18851, D138317, D78820 and TH01 (Table
18,above)

Because the X.1, X.2 and X.3 microvariants are included in this category,
multiple mechanisms exist for the sequence variation. For example, all of the below

ladder alleles at the FGA locus were microvariants. The repeat structure for alleles at
FGA is ['1'1'1'C]3'l'1'I'I"1'I‘CT [C'I'I'l’]n CTCC [TTCC]2. According to Butler (2001),

the X.2 microvariants at FGA, both below and above the ladder can be caused by the
deletion of 2bp, CT, from the TI’CT sequence adjacent to the core [C'I'I'I‘]n repeat. This
is evident in the repeat structure of the 16.2 microvariant:

['1'1'1‘C13 'I'I‘TT 'I'I‘ [CTIT]9 CT CC [TI‘CC]2.
Allele 4 at the TH01 locus and allele 7 at D881179 comprised 2 of the 6 variants that
contained complete repeat units with sizes smaller than those of the ladder alleles. The
repeat sequence of variant allele 4 is [AATG]4, and is one STR unit smaller than allele 5
in the COﬁlerTM ladder for TH01 (Grifﬁths et al., 1998), although it is found in the SGM
PlusTM ladder. Similarly, variant allele 7 has a repeat sequence of [TCTA]7, and is one

STR unit smaller than allele 8 in the D881179 ladder (Grifﬁths et al., 1998).

50

Variant Alleles Above the Ladder

Twenty-one different variant alleles with sizes larger than the ladder alleles were
found in 245 samples. These comprised the largest category of observed variants,
representing 32% of samples containing conﬁrmed variants, and 0.7% of all DNA
proﬁles reviewed. Similar to the below ladder variants, these included X.1, X.2 and X.3
microvariants, as well as alleles with complete STR units, and they ranged in size from
4bp — 78bp larger than the ladder alleles. The D3Sl358, FGA, D18851, D58818, TH01,
TPOX and CSFlPO loci contained these above ladder variants (Table 19, above).

Similar to those found below the FGA ladder, the above ladder X.2 microvariants

can be caused by a 2bp deletion from the region adjacent to the core [C'l'l'l']n repeat

sequence of FGA (Butler, 2001). The published repeat sequences for FGA ladder alleles,
ranging from 42.2 — 48.2 repeats, also contain additional repeated sequences not included

in smaller FGA ladder alleles. For example, the 42.2 allele has a repeat structure of
[I I 1 C14 1 I I 1 W [C I I 118 [CTGT14 [CTTT113 [CTTC13 [CTTT13 CT CC [1T CC14

(Grifﬁths et al., 1998). The sequences in bold face indicate the additional nucleotide
repeat units. The cause of these sequences has not been established, although it is
possible that several single-nucleotide mutations within the core C'l'I'I‘ sequence may
have occurred (e. g. T to G and T to C). Consequently, the larger microvariant alleles at
FGA may be the result of multiple mutations, rather than a single insertion or deletion of
one or more nucleotides.

The 13.3 microvariant at the TH01 locus also has a unique repeat structure, which

also may have arose from multiple mutations. While smaller alleles with complete repeat

units have a core structure of [AATG]n, the 13.3 microvariant contains an additional

51

sequence of AACG amid the core repeat unit [AATG] [AACG] [AATG]3 ATG [AATG]3

(Grifﬁths et al., 1998). Similar to the larger alleles at FGA, it is possible that a T to C
mutation in the AATG sequence is the cause for this additional sequence, indicating that
the 13.3 microvariant may contain more than one mutation. It is unclear why this
additional sequence is present in this allele, and not in the smaller TH01 alleles, including
the 9.3 microvariant. Just like 9.3, the partial repeat in 13.3 arises from a deletion of a
single ‘A’ nucleotide in one of the AATG repeats (Gene et al., 1996, Grifﬁths et al.,
1998). At both FGA and TH01, it is evident that as the alleles become larger in size,
more possibilities for sequence variation exist.

An irregularity was observed in the one sample containing the 28.1 microvariant
at the D1885] locus that was not observed in any other variant allele from this
investigation. When reviewing the electropherograms generated after re-analysis, the
Genotyper® software did not label this allele off-ladder. In fact, this allele did not have a
label, and only the sister allele peak was labeled with its designation. The reason for this
was that the size of microvariant allele 28.1 was larger than the programmed size range
for the D1885] locus in the Genotyper® software. The sample containing this
microvariant was re-tested twice with the same result (Appendix C). It is likely that the
size range for alleles at D1885] is larger than originally believed, and may need to be

broadened in order to accurately assign designations to these larger alleles.

52

Novel Variant Alleles

Fourteen of the variant alleles at 6 different loci were not previously reported by
other researchers, and are presumed to be novel alleles. These alleles encompass each of
the 5 different variant allele categories. All are considered to be extremely rare, as they
were found in only 1 — 5 samples each. Currently, these alleles are being sequenced to
conﬁrm their designations and to determine where within the repeat structure the
mutation occurred. These 14 alleles are summarized in Table 24, and their

electropherograms are included in Appendix C.

Table 24: 14 Novel Variant Alleles .
These 14 variants have not been previously reponed and are considered to be novel alleles. The number in
parentheses indicates the total number of novel variants observed at each locus. The number of samples
containing each variant is included to indicate the relative rarity of each.

 

 

 

 

 

 

 

 

D3Sl358 (1) 15.1 1
FGA(6) 14.3 2
15.3 3

21.1 1

33.1 2

34.1 2

41.2 5

D18551(3) 13.3 4
16.1 1

28.1 1

D58818(2) 12.3 2
l8 2

D7S820 ( 1) 5.2 1
TPOX“) 7.3 l

 

 

Miscalled Suspected Variant Alleles
While 85% of the samples contained variant alleles, the remaining 15% did not.
The 135 samples comprising these 15% were recorded in error (Table 12, above). After

re-reviewing the original DNA proﬁles for these samples, it was evident that several

53

ampliﬁcation or electrophoresis anomalies were overlooked during the initial data
collection. While the author should have detected the majority of these problems before
re-testing, others would not have been apparent until after re-analysis. Each of the 135
samples was placed into four different anomaly categories:

0 Missing or Low RFU Size Standards;

0 DNA Fragment Migration Anomalies;

o Pull-up; and

o Incomplete Adenylation.

Missing or Low RF U Size Standards

Missing or low RFU size standards represented the largest problem observed in
these DNA proﬁles, which affected 89 samples. The GeneScan® software is
programmed to recognize size standards at 150RFU8 or higher (GeneScan® Reference
Guide, 1997). Standards with peak heights below this value (but above SORFUs) were
classiﬁed as low RFU standards. In addition, the Genotyper® software is programmed to
remove labels from standard peaks when their heights are below 50RFU8 (Genotyper®
User Manual, 1998). These were classiﬁed as missing standards. The intensity of the
light emitted by the fluorescently labeled standards may be too low if an insufﬁcient
sample volume is loaded onto the gel (Proﬁler PlusTM User Manual, 1997, Butler, 2001).

When the size standard RFU values are below these settings, some alleles may be
correctly labeled with their designations, while others may be called off-ladder.
Genotyper® uses the two standard peaks to the left of the allele and the two to the right to

assign allele designations. For example, allele 13 at D3Sl358 is sized using both the

54

7 pr and IOObp peaks to the left of the allele and the l39bp and 150bp peaks to the right.
If one of these standards has low RFUs or is missing, allele 13 may be labeled off-ladder,

falsely resembling a variant allele (Figure 5).

    
 

 

 

 

 

 

 

 

 

 

100 150 200 250 300 350 400

1104570? 8681ue (86)]104570P
400
l 200

11045701’ 866ml (86)]104570P
800
00
00
200

[E

11045701’ 86Yellow (86)]104570P
400
200

531

1104570P 86Red (86)]104570P
80
/ 60
4O
. i 2°

‘ u.— I : L—
EE] arm)
mm
160.00

Figure 5: Missing 750p Size Standard Peak

The suspected 12.3 microvariant at the D3Sl358 locus (top panel) was eaused by a missing 750p size
standard (bottom panel), indicated by the arrow. Although a peak is present where the 75bp peak should
be, the height of the peak was below 50 RFUs, and was not labeled by the Genotyper® software.
Therefore, accurate sizing of this allele was not possible. The size of this off-ladder peak was 117.680p,
while allele 16 was correctly sized at 130.19bp. Additional peaks are from markers in the DNA proﬁle thm
were correctly assigned allele designations.

55

Samples with missing size standards included those where the peak was not
visibly evident in the electropherogram and those with a visible peak whose height was
below 50 RFUs. Eighty-three of these samples were missing only the 75bp peak, two of
these were missing the 75bp, 100bp and 139bp peaks and the remaining two were
missing only the 139bp peak. This anomaly only affected alleles at the D3Sl358 and
D881 179 loci. At D3Sl358, missing standards caused all of the suspected 12.3, 15.3,
17.2 and 18.2 microvariants. In addition, missing standards caused all but two of the
suspected 13.3 microvariants and all but one of the suspected 14.3 microvariants at
D3Sl358 (causes for miscalls in the other samples containing these suspected
microvariants discussed below). This anomaly also caused all of the suspected 7.3' and
9.2 microvariants at the D1885] locus.

Two samples contained low RFU size standards and included suspected
microvariants 13.3 at vWA and 18.1 at D1885]. Because all of the standards for these 2
samples were labeled with their bp sizes, they were overlooked as the cause of the off-

ladder alleles in the DNA proﬁles (Figure 6).

56

' 1 1 l
100 150 200 250 300 350 400

   
   

 

 

 

 

 

 

 

 

   

 

 

 

 

 

 

1341041730 P 13 Blue (13)1104730 P
900
600
1 300
[:5] 01 Allele 2
168.36
19
188.76
1311047130 P 13 Green (13)1104730 P
2000
1500
1000
j 500
a *
13'1104730 P 13 Yellow (13)‘1104730 P
800
600
400
200
in B
12 IE IE
13°1104730 P ‘13 Red (13)1104730 P
150
100
J . . 50
75.00 _ 139.00 200.00 [300.00] [340.00]
100.00 150.00 m

l 60.00

Figure 6: Low Size Standard RF Us

The 75bp — 250bp size standard peak heights in this sample (bottom panel) were all below 150RFU8.
Below this value, the Genotyper® software does not always accurately assign designations. As the arrow
indicates, the off-ladder allele at the vWA locus (top panel) was caused by the insufﬁcient RFUs values of
the size standards. As a result, the off-ladder allele was presumed to be a 13.3 microvariant allele, when it
actually was an allele 14. Peaks in the middle two panels are from other markers.

57

DNA Fragment Migration Anomalies

A total of 44 different samples experienced migration anomalies when they were
originally electrophoresed, and the measured sizes of the alleles were incorrect. In 8 of
the samples, these anomalies were detected in the size standards (the others are discussed
below). During electrophoresis, a laser scans one speciﬁc region of the gel for DNA
fragments. Based on settings such as temperature and voltage, the fragments are
expected to migrate through this scan region within a speciﬁc time range. This time
range varies depending on the size of the fragments and is programmed in the
GeneScan® software. If the size standards do not migrate within their expected time
range, the software cannot assign their sizes accurately, leading to allele miscalls '
(GeneScan® Reference Guide, 1997).

Inaccurate sizing of the 75bp size standard resulted in one 14.3 and two 13.3
miscalls at D38 1358. In addition, two samples containing the suspected 7.3
microvariants at D78820 were miscalled due to inaccurate sizing of both the 250bp and
300bp size standards (Figure 7). Alleles in 3 samples that were grouped with the 9.3
microvariant at D7S820 (which was conﬁrmed using other samples) were labeled off-

ladder because of inaccurate sizing of the 300bp size standard peak.

58

1000
500

 

 

 

150
‘100
50

Figure 7: Migration Anomalies in the Size Standards

The incorrect sizes of the 250bp and 3000p size standards, labeled as 249.32bp and 299.20bp (bottom
panel), led to inaccurate sizing of an allele at D78820 (top panel). This allele was labeled off-ladder
because of an increased migration of the size standards, which caused them to appear smaller than they
actually were. The off-ladder allele was thought to be the 7.3 microvariant. but was actually an allele 8.

Migration of DNA fragments during electrophoresis is regulated by several
factors, including electrophoresis temperature and current, pH and conductivity of the TE
buffer and polyacrylamide gel consistency (Shewale et al., 2000). Any of these factors
may cause the DNA to migrate faster or slower through the gel than the expected
migration times. While the exact cause for these anomalies in the size standards is
uncertain, one of the most likely causes is a lack of uniformity in the gel. Minute
contaminants or bubbles within the gel may retard the migration rate of the DNA, while
initiating electrophoresis before the gel has completely polymerized may increase the
migration rate of the DNA.

Migration anomalies were assumed to be responsible for the off—ladder alleles in
36 samples. As recommended by the Genotyper® User’s Manual (1998), each allele

category in the Genotyper® software is 1-0.SObp wide. The size of each allele must

59

therefore be within 10.50bp of the reference allele size programmed in the software
before an allele designation can be assigned. Peaks that do not meet this criterion are
labeled off-ladder. When alleles migrate faster or slower through the gel than expected,
their measured sizes may erroneously fall outside of the actual allele categories. The
purported causes of these anomalies are the same as those that affected the size standard
migration rates (see above). Allele migration anomalies cannot typically be detected
without re-testing, as other visible problems with the DNA proﬁles may not be present.

An increase or decrease in allele migration affected 6 different loci. All of the
X.1 microvariants in this category had delayed migration rates, appearing to be larger
than they actually were. (Figure 8, below). These were:

0 13.1 and 14.1 at D3Sl358;

6.1 at vWA;
o 22.1 and 24.1 at FGA;
o 14.1atD18851;
0 11.1 at D58818; and
o 13.1atD13S317.
Similarly, all of the suspected X.3 microvariants in this category experienced increased

migration rates, and appeared to be smaller than they actually were. These were 21.3 and

23.3 at FGA, and 10.3 at D13S3l7.

6O

l I I
100 150 200 250 300 350 400

58° 1075265 58 Blue 1075265

1500
1000
500

 

 

 

 

   

 

 

 

 

58- 1075265 58 Green 1075255
\ 1500
1000
500
58- 1075265 58 Yellow 1075265
1500
1000
500
11
58- 1075265 58 Red 1075265
400
200

 

 

 

300.00 [3710.80] 400.00

'10000'

 

 

Figure 8: Allele Migration Anomaly at the D1885] Locus

The arrow indicates the suspected 14.1 microvariant at D1885] (second panel from top) that was a result of
delayed migration of allele 14. The migration of the allele was likely slowed by minute contaminants in the
gel, leading to the presence of the off-ladder allele. Peaks to the left of D1885] in the top three panels are
from other markers, while the size standards are shows in the bottom panel.

61

Pull- Up Peaks

Pull-up was a factor in one of the samples recorded as having a suspected variant
allele. It results from a failure of the GeneScan® software to accurately resolve the
ﬂuorescent dyes used to label the DNA fragments prior to electrophoresis. While each of
the dye colors emits its maximum ﬂuorescence at a different wavelength, a substantial
overlap occurs between several of the dyes, and the software is used to subtract out this
overlap. When the sample is overloaded and allele peak heights exceed the detection
limits for the instrument, the software may not be able to accurately subtract out the
overlapping dyes, and several loci may be affected. As a result, an allele peak from one
dye color can be “pulled-up” into the dye color(s) directly above or below the affeCted
locus (Proﬁler PlusTM User Manual, 1997, Butler, 2001).

The off-ladder allele in the sample’s original DNA proﬁle was the suspected 14.3
microvariant at D881179, which was paired with an allele 1]. All of the loci had peak
heights averaging over 2500 RFUs each, indicating an excess of DNA. Allele 12 at
D58818 had a peak height of over 3000 RFUs, and the signal overload from this allele
resulted in a pull-up peak at D881179 (Figure 9). After re-analysis, the only peak at
D881179 was allele 1]. While pull-up peaks in a DNA proﬁle are typically easy to

identify, the pull-up peak in this particular DNA proﬁle was overlooked.

62

744091403 P 74 Green (74)1091403 P

 

 

4000
1‘(//’ 3000
2000
1000
_ A -- A - - a
11 Ill it! II!
13912
OLMkk?
15578
74-1091403 P 74 Yellow (74)1091403 P

D58818 3000
2000
1 1000

 

 

 

In"
El

 

7401091403 P 74 Red (74)1091403 P
/ 800
600
400
11 -.1_11 1-- 1 1 11 20°

Figure 9: Off-Ladder Pull-Up Peak at the D881179 Locus

The top arrow indicates the pull-up peak at D881 179 (top panel) caused by allele 12 at the 058818 locus
(middle panel), the leftmost locus in the yellow dye color. The sample was highly concentrated as
evidenced by the high RFU values in the scale to the right of the allele peaks. The size of the pull-up peak
(155.78bp) was very close to the size of allele 12 (155.87bp). The pull-up peak from allele 12 is also
evident in the size standards (bottom panel), as indicated by the arrow. A smaller pull-up peak (labeled
139.00bp) at the D58818 locus was caused by allele 1 1 in 0881 179, which had a size of 139.12bp. After
re-running. the only peak present at D881 179 was allele ll. Peaks to the right of the alleles in the top and
middle panels are from other markers.

Incomplete Adenylation

The ﬁnal cause for miscalled alleles in the original DNA proﬁles was incomplete
adenylation. During PCR, a single nucleotide is added to the 3’ end of the DNA target
sequence. This nucleotide is typically adenosine (A) and PCR products with this
additional nucleotide are in the “+A” form. The computer software assigns designations
to peaks in the “+A” form. The ﬁnal step of PCR gives the DNA polymerase extra time

to complete this process. When an excess of DNA is present in the sample, the 45

63

minutes allotted for this extension may not be sufﬁcient. As a result, some of the DNA
will be in the “+A” form, and the remaining portion of DNA will be without the
additional adenosine, or “-A” form. Alleles that containing both “+A” and “-A”
sequences are evident as split peaks, where one peak is lbp shorter than the “+A” target
sequence. Due to the closeness in size of these two peaks, they may not be completely
resolved, and the smaller peak may be labeled off-ladder (Proﬁler PlusTM User Manual,
1997).

A split peak at the D7S820 locus appeared to be a suspected 7.3 microvariant
paired with an allele 8. This sample contained a high DNA concentration, and the peaks
at this locus had heights of approximately 2000RFU8. The microvariant was lbp Smaller
than its sister allele. After re-analysis, only allele 8 was present at the locus. It was
evident that the suspected 7.3 microvariant was not a true allele, and instead was a peak

in the “—A” form. The target sequence, allele 8, was in the “+A” form (Figure 10).

-A/ \+A

     

0L Allele ?

Figure 10: Incomplete Adenylation at the D78820 Locus

The suspected 7.3 microvariant (left arrow) was caused by incomplete adenylation of allele 8 (right arrow),
the target sequence. This resulted in a single allele with split peaks, where the smaller peak was in the
“-A” form, and the larger peak was in the “+A” form. The suspected 7.3 allele had a size of 266.73bp,
nearly lbp smaller than allele 8, which had a size of 267.72.

Racial Group Variant Allele m

During calculation of the variant allele frequencies, it became evident that African
Americans contained the highest number of different variant alleles. Of the 85 conﬁrmed
variants, 26 were observed only in African Americans, while an additional 21 were
observed predominantly in the group (i.e. at least 2 or more additional observations over
other racial groups). In contrast, only 13 variants were observed exclusively in
Caucasians, with 4 observed most often in this group. Seven of the variants were
observed only in Hispanics, and 1 occurred predominantly in this group (Table 21,
above). More Caucasian samples (16,257) were reviewed for this investigation than
African American samples (14,160), and intuitively one would expect a greater number
of variants in the former group. However, people of African descent have existed for a
longer period of time when compared with other groups, and are known to show more

variability in their DNA (Campbell, 1996).

Orchid GeneScreen Allele Frequency Database

The frequencies of the alleles already in the Orchid database decreased slightly
with the addition of the 28 variant alleles. In addition, two of the 28 variants, 31.2 and
32.2 at FGA, are ladder alleles in the SGM PlusTM kit. These alleles are not found in the
SGM database and the manual suggests a conservative frequency value of 1.3% be used
in forensic and paternity calculations. In contrast, the 31.2 and 32.2 alleles were found in
41 and 11 samples, respectively, in this investigation (not surprising, as the number of
Caucasians reviewed for this study was nearly 82 times greater than in the SGM

database, and nearly 73 times greater for African Americans). The adjusted frequency

65

value for the 31.2 microvariant in Caucasians was 0.1 106195%, which was larger than
0.0326%, the FGA 5/2N’ value for Caucasians in the Orchid database, but smaller than
1.3%. In contrast, both observed frequencies for the 32.2 microvariant (0.00650703% for
Caucasians and 0.0334515% for African Americans) were less than the 5/2N’ values in
the Orchid database (0.0326% for Caucasians and 0.0337% for African Americans), and
smaller than 1.3%. Therefore, the Orchid frequencies will be used for these alleles, and

will lead to higher P] values than if 1.3% were used.

Utilization of Variant Alleles for Patemitv Index Calculations

Recently, protocols were put into place by the Orchid laboratory directors for the
detection and utilization of the 28 variant alleles added to the database. The laboratory’s
LIMS software was reprogrammed to recognize these variants for use in paternity
calculations. When reviewing DNA proﬁles containing off-ladder alleles, laboratory
technicians refer to a list of the variants with their observed size ranges to determine
whether or not the allele is variant. If the size of the off-ladder allele falls within the
corresponding size range, it is considered variant and the peak is manually labeled with
its designation and size. Data from this locus can then be used in CPI calculations.
However, if the size of the off-ladder allele is very close to the observed size range but
does not fall within the range, the labels from the peaks are manually removed and data
from that locus are not used. The laboratory supervisors also review the DNA proﬁles
and have the ﬁnal say whether or not to use the variant allele data.

The genetic results of two paternity cases tested by Orchid GeneScreen are

included below to demonstrate the increase in the CPI value when using loci with variant

66

alleles (Tables 25 and 26). Calculations used to determine the P] value for each locus are
included in Appendix D. The ﬁrst example in Table 25 uses frequency data from the
33.1 microvariant at D2181], which was found in more samples (76) than any other
variant during this investigation. This demonstrates the expected increase in CPI when

one of the more “common” microvariants is the obligate paternal allele.

Table 25 : CPI Calculations - Most Common Variant Allele

Allele designations are listed for the mother, child and alleged father for each of the loci with data for a

case where the alleged father was included. P] values were determined using calculations from Traver,

1998 (Appendix D). The values in both P] columns were calculated using the African American allele
frequencies listed in the Orchid GeneScreen allele frequency database. The second Pl column also includes

variant data from the D2181] locus in bold face. The CPI values for both columns were calculated by
multiplying the P] values together. The calculated probability of paternity is included at the bottom of each

of these columns. ‘

D13S3I7 l 13 12 ll 12 1.17 1.17
D1885] 13 17 1 13 12,19 8.03 8.03
D7S820 12 8 12 8 3.21 3.21
D8Sll79 10 15 14 15 14 2.99 2.99

FGA 25 25 28 8.12 8.12

vWA 1 15 1 17 15 17 2.46 2.46

D2181] 30 31 33.1 31 33.1 - 240.17
CPI: CPI: 1

Probability of Probability of
= 99.94% = 99.99%

 

The addition of the D2181] 33.] PI value of 240.17 increased the CPI value from 1,801
to 432,601. This indicates a probability of 1 in 432,601 of selecting a random, unrelated
man from the same racial group as the biological father. While the CPI value of 1,801
was sufﬁcient for reporting an inclusion for this case, another case may have required a
higher CPI value, and additional testing would have been necessary prior to reporting the

CPI. Because variant alleles are relatively rare when compared to the ladder alleles, their

67

frequencies are small. The smaller these frequencies are, the larger the CPI value
becomes. The dramatic increase in the CPI value in the case above clearly demonstrates
that the use of loci containing even the most common variant allele will only provide
added support for the conclusion of paternity when an inclusion is indicated.

In Table 25, the probability of paternity was calculated to be 99.94% before the
inclusion of the variant allele, and 99.99% after the inclusion. Orchid GeneScreen
guarantees a minimum probability of paternity of 99.0% with indication of parentage, 80
probability values higher than this only increase the conﬁdence in the paternity
conclusions. The 0.05% increase in the probability of paternity further strengthens the
hypothesis that the correct man has been identiﬁed as the biological father.

While the CPI using the most common variant allele increased by a factor of over
200, the CPI value for a case containing one of the “rarest” variant alleles is even more
compelling, as shown in Table 26. The 9.3 microvariant at the D7S820 locus was found

in only 5 samples, the minimum number required for inclusion in the database.

68

Table 26: CPI Calculations - Least Common Variant Allele
Allele designations are listed for the mother, child and alleged father for each of the loci with data for a
case where the alleged father was included. P] values were determined using calculations from Traver,
1998 (Appendix D). The values in both P] columns were calculated using the African American allele
frequencies listed in the Orchid GeneScreen allele frequency database. The second Pl column also includes
variant data from the D78820 locus in bold face. The CPI values for both columns were calculated by
multiplying the P] values together. The calculated probability of paternity is included at the bottom of each

of these columns. a PI calculated using 5/2N’ allele frequency (0.0328%) instead of observed allele
frequency (0.013128%)

D13S3l7
D1885]
D2181]

D3Sl358
D58818

D881 179
FGA
vWA

D7 8820

12
15 18
28 29

15
9 12
10 16

23

15

9,12

8 12
17,18
28 30
15 16
9 12
15 16
19 22
15

9.3, 12

19.23
3.06
2.69
1.52
1.34
5.15
7.57
4.69

19.23
3.06
2.69
1.52
1.34
5.15
7.57
4.69 .

1524’

CPI = CPI = 89
Probability of Probability of
= 99.99% = 99.99%

 

The addition of the D78820 9.3 P] value of 1524 increased the CPI value from 58,949 to
89,838,459. This indicates a probability of 1 in 89,838,459 of selecting a random,
unrelated man from the same racial group as the biological father. The 5/2N’ frequency
for the D7S820 locus was used to calculate the P], as the observed frequency for 9.3 was
sufﬁciently small. If the observed frequency had been used, the P] value would have
more than doubled to 3809, and the CPI value would have increased to 224,516,759.
This indicates the results of using a more conservative value when the allele is
considerably rare in the racial group. The probability of paternity was calculated to be
99.99% both before and after the addition of the D78820 locus. While there was no
change at two decimal places, the CPI alone indicates that the strength of the genetic

evidence increased because of the addition of the D78820 PI value.

69

In both cases, the obligate paternal allele was a variant. This is not a requirement
in order to use data from a locus containing a variant allele in CPI calculations. Data
from any locus containing one of the 28 variant alleles, whether obligate or not, can now
be used in CPI calculations. The use of these variant alleles in CPI calculations ensures

that data are not discarded solely because the analysis software does not recognize them.

Reductions in Orchid GeneScreen Lmttorv Expenditures

The decline in additional testing required for paternity cases containing variant
alleles is expected to cause a decrease in laboratory expenditures. Currently, the exact
role these variants will play in the reduction of laboratory costs is unknown. Nearly 12%
of the samples tested annually require additional loci before paternity conclusions can be
made. The 28 variant alleles added to the Orchid database are found in approximately
1.6% of samples, or 2,160 samples tested yearly. It is probable that the retest rate could
decrease to 10.4% with the use of variant allele data. A decrease in the number of
samples requiring additional testing should lead to the purchase of fewer ampliﬁcation
kits annually.

Other factors that affect laboratory costs are more difﬁcult to pinpoint. These
include the number of combined man-hours needed to complete the testing on a case and
whether or not the case belongs to a public or private account. It is likely that the cost
saving beneﬁts of utilizing variant allele data will not be known until after they have been

used for at least one year.

70

Suggestions for Future Research

Although this project uncovered valuable information about variant alleles and
their usefulness in paternity dispute resolutions, several avenues are still in need of
pursuit. For instance, plans are being discussed to establish a more accurate size range
for each of the 28 variants added to the database. Currently, the observed size range is
used to determine whether or not a variant allele is present. However, there have been
circumstances where the size of an off-ladder allele is very close to the observed variant
size range, but does not fall within the range. Recently, a sample contained an off-ladder
allele with a size of 200.07bp at the D2181] locus. This allele was 0.02bp smaller than
the size range of 200.09 — 200.19bp for the 27.1 microvariant, and was therefore not
considered a variant. It is possible that this off-ladder allele was a 27.1 microvariant, but
the laboratory protocols dictate that variant alleles must size within the observed range.

In order to more accurately deﬁne the size ranges for these alleles, the laboratory
directors have proposed that each sample containing one of these variants be re-ampliﬁed
between 5 — 10 times and electrophoresed on the same number of gels. According to the
Proﬁler PlusTM User Manual (1997), allele sizes may differ between gel runs on the same
instrument due to variation in gel concentration and thickness, run temperature, and the
distance between the wells and the laser-scanning region. In addition, the slight
procedural and reagent variations that may affect several different gels can result in
greater size variation than would occur with samples analyzed on the same gel. While
the size of the variants found in these samples may measure within the observed ranges, it

is anticipated that some will have sizes beyond the range. In this way, variant alleles in

71

future samples that would have sized outside of the current observed range will be

recognized as variant, and their data will not be discarded.

Conclusions

This investigation uncovered many rare alleles that are found in individuals of
different racial descent, which can play a large role in resolving paternity disputes. The
author speculates that the use of these additional alleles may someday beneﬁt those
involved in forensic DNA testing. Perhaps many of these will be included in allelic
ladders used for DNA analysis in the future, or technological advances will allow these
alleles to be readily recognized. It is important that scientists continue to research these
and other rare alleles, as an increase in the number of known alleles will only improve the

discriminatory power of human identity testing.

72

APPENDICES

73

APPENDIX A

Appendix A contains the alleles with their corresponding frequencies found in the Orchid

GeneScreen Allele Frequency Database (prior to the addition of the 28 variant alleles).

74

Table 27 : Original Orchid GeneScreen Allele Frequency Database
Allele frequencies are given as percentages and were calculated prior to the addition of the variant allele
data. The N values indicate the number of individuals of a given racial group that had genotypes at that
locus. For each racial group, frequencies were calculated based on the total number of alleles present in

 

 

 

 

each locus.
D3Sl358 Caucasian African Hispanic
N’ = 7636 American N’ = 690
N’ = 7602
11 0.0851 0.0329 0.2174
12 0.0655 0.454 0. 145
13 0.308 0.77 0.507
14 12.40 9.05 7.50
15 26.90 29.20 36.60
15.2 0.0 0.0987 0.0
16 24.30 33.0 28.60
17 20.00 20.70 15.70
18 14.60 6.30 9.93
19 1.25 0.48 0.797
20 0.0131 0.0 0.0725
vWA Caucasian African Hispanic
N’ =7837 American N’ = 703
N’ = 7854
1 1 0.0128 0.739 0.213
12 0.0447 0.115 0.142
13 0.0957 1.69 0.498
14 9.65 6.32 6.26
15 9.90 21.30 10.20
16 21.80 26.0 31.50
17 27.0 20.30 26.40
18 21.40 13.90 16.60
19 8.47 6.97 6.69
20 1.55 2.23 1.42
21 0.109 0.427 0.0
22 0.0128 0.07 0.0

 

75

Table 27 (cont’d)

 

 

 

 

FGA Caucasian African Hispanic
N’ = 7674 American N’ = 939
N’ = 7419
15 0.0 0.0 0.0532
17 0.0847 0.148 0.161
17.2 0.0 0.0809 0.0
18 1.62 0.748 0.959
18.2 0.0 1.24 0.0
19 6.36 6.65 8.15
19.2 0.0065 0.283 0.107
20 14.70 6.20 10.5
20.2 0.15 0.135 0.0
21 17.70 11.20 12.10
21.2 0.319 0.148 0.0
22 17.70 18.0 14.30
22.2 1.0 0.202 0.213
23 14.10 17.8 14.70
23.2 0.437 0.101 0.373
24 13.40 16.8 15.40
24.2 0.1 1 1 0.0135 0.0532
25 8.50 9.89 13.2
25.2 0.013 0.0212 0.0
26 3.0 5.06 6.50
26.2 0.013 0.0135 0.0
27 0.593 3.19 2.29
28 0.156 1.25 0.639
29 0.0261 0.499 0.107
29.2 0.0 0.0135 0.0
30 0.013 0.175 0.107
30.2 0.0 0.135 0.0
31 0.0065 0.472 0.0532
D881179 Caucasian African Hispanic
N’ = 7731 American N’ = 679
N’ = 7796
8 1.69 0.237 0.663
9 1.27 0.487 0.957
10 9.15 2.32 8.62
l 1 7.24 4.45 6.11
12 14.70 11.90 12.20
13 32.60 20.30 29.50
14 19.80 33.50 25.30
15 10.60 19.40 13.20
16 2.67 6.06 3.09
17 0.362 1.19 0.368
18 0.0194 0.122 0.0736
19 0.0 0.0128 0.0

 

Table 27 (cont’d)

 

 

 

 

D2181] Caucasian African Hispanic
N’ = 7730 American N’ = 683
N’ = 7652
24.2 0.0776 0.0 0.0
25 0.0129 0.0327 0.0
25.2 0.0776 0.0131 0.0
26 0.207 0.124 0.146
26.2 0.0194 0.0 0.0
27 3.32 5.60 1.98
28 15.80 24.30 10.70
28.2 0.0194 0.0065 0.0
29 21.20 18.90 21.40
29.2 0.104 0.0588 0.366
30 25.80 18.70 25.60
30.2 3.53 1.92 3.0
31 7.37 8.09 8.27
31.2 8.96 5.20 11.70
32 1.42 1.720 0.952
32.2 8.54 6.94 10.60
33 0.207 0.457 0.146
33.2 2.90 3.08 4.25
34 0.0065 0.588 0.293
34.2 0.349 0.314 0.0732
35 0.0194 3.14 0.366
35.2 0.0388 0.0261 0.0
36 0.0065 0.608 0.0732
36.2 0.0065 0.0 0.0
37 0.0 0.137 0.0732
38 0.0 0.0523 0.0
D1885] Caucasian African Hispanic
N’ = 7628 American N’ = 647
N’ = 7463
8 0.0 0.0067 0.0
9 0.078700 0.127 0.0
10 0.904600 0.194 0.773
10.2 0.0 0.161 0.0773
11 1.120900 0.496 1.16
12 14.486100 6.25 12.20
13 12.624500 4.76 11.10
13.2 0.013100 0.496 0.0773
14 16.439400 6.58 17.0
14.2 0.0 0.369 0.0
15 14.407400 16.60 14.10
16 12.814600 18.10 11.80
17 11.602000 16.40 17.10
18 7.734700 11.60 6.65
19 4.404800 9.13 3.09
19.2 0.0 0.0402 0.0
20 1.730500 5.37 1.93
21 0.976700 2.22 1.16
22 0.458800 0.864 1.31
23 0.144200 0.214 0.309
24 0.052400 0.0335 0. 155
25 0.0 0.0201 0.0773
26 0.006600 0.0 0.0

 

Table 27 (cont’d)

 

 

 

 

 

 

D58818 Caucasian African Hispanic
N’ = 7829 American N’ = 705
N’ = 7845
3 0.0 0.0064 0.0
6 0.0064 0.0 0.0
7 0.204 0.331 4.82
8 0.326 5.55 1.42
9 3.55 1.90 4.75
10 5.84 5.98 4.89
1 1 36.90 23.70 40.60
12 36.20 35 .30 30.50
13 15.90 25.10 12.30
14 0.926 1.75 0.638
15 0.102 0.293 0.0
16 0.0 0.102 0.0
17 0.0 0.0191 0.0
0138317 Caucasian African Hispanic
N’: 7814 American N’ = 703
N’ = 7833
7 0.0256 0.0128 0.0
8 12.0 2.60 7.89
9 7.54 2.18 16.80
10 6.18 2.73 7.89
l 1 31.10 29.40 22.0
12 28.30 42.90 26.60
13 10.40 15.20 12.70
14 4.43 4.86 5.97
15 0.141 0.102 0.142
16 0.0 0.0191 0.0
D78820 Caucasian African Hispanic
N’ = 7685 American N’ = 939
N’= 7612
6 0.013 0.105 0.0
7 2.08 0.769 1.38
8 15.60 20.40 12.80
9 16.50 11.40 8.63
10 26.80 33.90 26.60
1 l 20.40 20.40 28.0
12 14.50 10.80 18.40
13 3.33 2.02 3.73
14 0.709 0.21 0.479
15 0.0195 0.0 0.0
16 0.0065 0.0 0.0

 

APPENDIX B

Appendix B contains the alleles with their corresponding frequencies found in the Orchid

GeneScreen Allele Frequency Database (after addition of the 28 variant alleles).

79

Table 28: Modified Orchid GeneScreen Allele Frequency Database
Only loci containing variant alleles are shown. Allele frequencies are given as percentages, with variant
allele data in bold face. The N‘ values indicate the number of people of a given racial group that had
genotypes for a given locus. The n’ values indicate the total number of alleles (including variants) present
in the database. Allele frequencies were re-calculated by dividing the number of observations by n’. The
conservative 5/2N’ values vary as N’ differed for each locus.

 

 

 

 

D381358 Caucasian African Hispanic
N’ = 7636 American N’ = 690
n’ = 15274 N’ = 7602 n’ = 1383
5/2N’ = 0.0327 n’ = 15244 5/2N’ = 0.362
5/2N’ = 0.0329
9 0.00654707 0.163999 0.0
1 1 0.0850889 0.0328137 0.216928
12 0.0654914 0.4528087 0.144685
13 0.3079597 0.7679795 0.5059
14 12.3983763 9.026253 7.483731
15 26.8964777 29. 1233797 36.520607
15.2 0.0 0.09844] 0.0
16 24.2968 181 32.9134086 28.53796]
17 19.9973812 20.6456835 15.665944
18 14.5980883 6.2834689 9.90846
19 1 .2498363 0.4787405 0.79527]
20 0.0130983 0.0 0.072343
20.] 0.085112 0.0131199 0.144613
FGA Caucasian African Hispanic
N’ = 7674 American N’ = 939
n’ =15368 N’ = 7419 n’ =1879
5/2N’ = 0.0326 n‘ = 14947 5/2N’ = 0.266
5/2N’ = 0.0337
15 0.0 0.0 0.0531717
16.] 0.0 0.0735934 0.0
16.2 0.0 0.0267612 0.0
17 0.0845898 0.1469207 0.1609143
17 .2 0.0 0.0803] 0.0
18 1.6178917 0.7425453 0.9584896
18.2 0.0 1.2309574 0.0
19 6.3517231 6.6015053 8.1456626
19.2 0.00649154 0.2809362 0.1069431
20 14.6808693 6.1547869 10.49441 19
20.2 0.1498048 0.1340155 0.0
21 17.676965] 1 1.1 183247 12.0935604
21.2 0.3185849 0.1469207 0.0
22 17.676965] 17.8687362 14.2923896
22.2 0.9986986 0.2005269 0.2128866
22.3 0.0 0.0200709 0.0532198
23 14.0816502 17.6701947 14.6921767
23.2 0.4364313 0.1002635 0.3728015
23.3 0.0 0.0401418 0.0532198
24 13.3825612 16.677487] 15.3918042
24.] 0.0 0.0200709 0.0
24.2 0.1108555 0.0134016 0.0531717
24.3 0.00650703 0.0200709 0.0
25 8.4889381 9.8178778 13.192975
25.2 0.0129831 0.0210454 0.0
25.3 0.0 0.0267612 0.0
26 2.9960958 5.0231003 6.4965407
26.2 0.0129831 0.0134016 0.0
27 0.5922283 3.1667371 2.2887813
28 0.155797 1.2408845 0.6386599

 

80

Table 28 (cont’d)

 

 

 

 

FGA Caucasian African Hispanic
(cont’d) N’ = 7674 American N’ = 939
n’=15368 N’=7419 n’=1879
5/2N’ = 0.0326 n’ = 14947 5/2N’ = 0.266
5/2N’ = 0.0337
29 0.026066 0.495361 1 0.1069431
29.2 0.0 0.0134016 0.0
30 0.0129831 0.1737238 0.1069431
30.2 0.0 0.1340155 0.0
31 0.00649154 0.468558 0.0531717
31.2 0.1106195 0.0 0.0
32.2 0.00650703 0.0334515 0.0
D2181] Caucasian African Hispanic
N’ = 7730 American N’ = 683
n’ = 15467 N’ = 7652 n’ = 1369
5/2N’ = 0.0323 n’ = 15371 5/2N’ = 0.366
5/2N’ = 0.0327
24.2 0.077565 0.0 0.0
24.3 0.0 0.078069 0.073046
25 0.012894 0.032557 0.0
25.2 0.077565 0.013043 0.0
26 0.206906 0.12346 0.14568
26.2 0.019391 0.0 0.0
27 3.318497 5.57559 1.975661
27.1 0.0 0.019517 0.0
28 15.792849 24. 19408 10.676552
28.2 0.019391 0.0064717 0.0
29 21.190405 18.817618 21.353104
29.2 0.103953 0.05 8544 0.365198
29.3 0.025862 0.026023 0.0
30 25.788324 18.618489 25.543901
30.2 3.528402 1.91 1631 2.993426
31 7.366665 8.054737 8.251877
31.2 8.955945 5.177334 1 1.674361
32 1.419357 1.712503 0.949914
32.2 8.536135 6.90975 10.576771
33 0.206906 0.455008 0.14568
33.1 0.019396 0.208184 0.073046
33.2 2.898688 3.066575 4.240687
34 0.006497] 0.585437 0.292358
34.1 0.0064654 0.058552 0.073046
34.2 0.348842 0.312631 0.07304
35 0.019391 3.126313 0.365198
35.1 0.0 0.039035 0.0
35.2 0.038782 0.025986 0.0
36 0.0064971 0.60535 0.07304
36.2 0.0064971 0.0 0.0
37 0.0 0.136403 0.07304
38 0.0 0.052072 0.0

 

Table 28 (cont’d)

 

 

 

 

D1885] Caucasian African Hispanic
N’ = 7628 American N’ = 647
n’ = 15258 N’ = 7463 n’ = 1296
5/2N’ = 0.0328 n’ = 14974 5/2N’ = 0.386
5/2N’ = 0.0335
8 0.0 0.00667852 0.0
9 0.07869 0. 1265929 0.0
10 0.904481 0.1933781 0.7718071
10.2 0.0 0.1604839 0.0771807
11 1.120753 0.49441 1.1582099
11.2 0.0 0.0200347 0.0
12 14.484201 6.2299653 12.1811728
12.2 0.0 0.026713 0.0
13 12.622845 4.7447416 1 1.0828704
13.2 0.013098 0.49441 0.0771807
14 16.437245 6.5589074 16.9737654
14.2 0.0 0.3678171 0.0
15 14.40551 1 16.5467878 14.0782407
15.2 0.0065539 0.1268866 0.0771605
16 12.81292 18.0419794 11.7817901
17 11.600479 16.3474289 17.0736111
18 7.733686 1 1.5628155 6.6397377
19 4.404223 9.1007333 3.0852315
19.2 0.0 0.040071 1 0.0
20 1.730273 5.3527862 1.9270216
20.2 0.0 0.0333912 0.0
21 0.976572 2.2128837 1 . 1582099
21.2 0.0065539 0.0801389 0.0
22 0.45874 0.8612304 1.3079784
23 0.144181 0.213314 0.3085231
24 0.052393 0.0333926 0.1547608
25 0.0 0.0200356 0.0771807
26 0.0065991 0.0 0.0
D7S820 Caucasian African Hispanic
N’ = 7685 American N’ = 939
n’ =15376 N’ = 7612 n’ = 1887
5/2N’ = 0.0325 n’ = 15235 5/2N’ = 0.266
5/2N’ = 0.0328
6 0.012995 0.104924 0.0
6.3 0.019511 0.0 0.0
7 2.079188 0.768445 1.3734181
8 15.593913 20.385271 12.7389507
9 16.493561 1 1.391769 8.5888394
9.1 0.019511 0.0065638 0.0
9.3 0.0 0.013128 0.0529942
10 26.789542 33.875523 26.473132
10.1 0.0065036 0.039383 0.0529942
10.3 0.0065036 0.0 0.3709592
1 1 20.39204 20.385271 27.866454?
11.1 0.026015 0.0065638 0.0
12 14.494342 10.792202 18.3122417
13 3.328701 2.018542 3.7122099
14 0.708723 0.209848 0.4767154
15 0.019492 0.0 0.0
16 0.0064975 0.0 0.0

 

82

APPENDIX C

Appendix C contains electropherograms for the 14 novel variant alleles observed during

this investigation.

83

1500
1000
500

 

Figure 11: 15.1 Microvariant at the D3Sl358 Locus

As indicated by the arrow, this 15.1 microvariant at the D3Sl358 locus was the larger sized peak in a split
peak that also contained allele 15. It had a size of 127.30bp with a height of approximately 1500RFUs.
While not completely resolved due to the closeness in size to its sister allele, the 15.1 microvariant allele
was not a result of incomplete adenylation. This novel allele was lbp larger than allele 15, which has an
average size of 126.34bp when analyzed on an ABI 377, and allele 15 in this sample had a size of
126.40bp. Peaks to the right of D3Sl358 are from other markers.

4000

\ 3000

2000
[[5 0L Allele ?
IE 20779 SE
m

1 000
Figure 12: Below Ladder 14.3 Microvariant at the FGA Locus
As indicated by the arrow, this 14.3 microvariant at the FGA locus had a size of 207.79bp with a height of
approximately 2500RFUs. This novel allele was 13bp smaller than allele 18, which has an average size of
220.00bp when analyzed on an ABI 377. The microvariant was paired with a 21.2 allele, which had a size
of 234.85bp and a height of approximately 2500RFUs. Peaks to the left of FGA are from other markers.

 

1000
500

 

 

 

Figure 13: Below Ladder 15.3 Microvariant at the F GA Locus

The arrow indicates the 15.3 microvariant at the FGA locus, which had a size of 211.88bp and a height of
approximately 1100RFUs. This novel allele was 9bp smaller than allele 18, which has an average size of
220.00bp when analyzed on an ABI 377. The microvariant was paired with an allele 24, which had a size
of 245.11bp and a height of approximately l200RFUs. Peaks to the left of FGA are from other markers.

84

‘(//r 3000
2000
1000

 

Figure 14: 21.1 Microvariant at the FGA Locus

The arrow indicates the 21.1 microvariant at the FGA locus, which had a size of 233.70bp and a height of
approximately 2500RFUs. This novel allele was lbp larger than allele 21, which has an average size of
232.38bp when analyzed on an AB] 377. The microvariant was paired with an allele 19, which had a size
of 224.36bp and a height of approximately 3000RFUs. Peaks to the left of FGA are from other markers.

2000
1500‘
1000
500

   

Figure 15: Above Ladder 33.1 Microvariant at the FGA Locus

As indicated by the arrow, this 33.1 microvariant at the FGA locus had a size of 281.74bp with a height of
approximately 1000RFUs. This novel allele was 13bp larger than allele 30, which has an average size of
268.62bp when analyzed on an ABI 377. The microvariant was paired with an allele 20, which had a size
of 228.56bp and a height of approximately 1200RFUs. Peaks to the left of FGA are from other markers.

4000
2000

 

Figure 16: Above Ladder 34.1 Microvariant at the FGA Locus

The arrow points to the 34.1 microvariant allele at the FGA locus, which had a size of 285.70bp and a
height of approximately 2000RFUs. This novel allele was 17bp larger than allele 30, which has an average
size of 268.62bp when analyzed on an ABI 377. The microvariant was paired with an allele 22, which had
a size of 236.88bp and a height of approximately 2500RFUs. Peaks to the left of FGA are from other
markers.

85

4000
2000

257.06

Figure 17: Above Ladder 41.2 Microvariant at the F GA Locus

The arrow indicates the 41.2 microvariant at the FGA locus, which had a size of 314.37bp and a height of
approximately 2000RFUs. This novel allele was 46bp larger than allele 30, which has an average size of
268.62bp when analyzed on an ABI 377. The microvariant was paired with an allele 27, which had a size
of 257.06bp and a height of approximately 3000RFUs. Peaks to the left of FGA are from other markers.

 

2000
1500
1000
500

 

 

8.

Figure 18: 13.3 Microvariant at the D1885] Locus

The arrow points to microvariant allele 13.3 at the D1885] locus, which had a size of 293.18bp and a
height of approximately lOOORFUs. This novel allele was lbp smaller than allele 16, which has an average
size of 301 .81bp when analyzed on an ABI 377. It was paired with an allele 12, which had a size of
286.09bp and a height of approximately 800 RFUs. Peaks to the left of D1885] are from other markers.

 

4ooo
‘(// 3000
2000

1000

l

I; FE] .522; IE
m
sam

Figure 19: 16.1 Microvariant at the D1885] Locus

The arrow points to microvariant allele 16.1 at the D1885] locus, which had a size of 303.01bp and a
height of approximately 3000RFUs. This novel allele was lbp larger than allele 16, which has an average
size of 301.81bp when analyzed on an ABI 377. It was paired with an allele 13, which had a size of
290.22bp and a height of approximately 3000 RFUs. Peaks to the left of D1885] are from other markers.

86

4000

 

l/
2000
~ _ '4 A I .l
a IE [B
297.96 m

Figure 20: Above Ladder 28.] Microvariant at the D1885] Locus

The arrow indicates the 28.1 microvariant at the D1885] locus, which had a size of 351.52bp and a height

of approximately 2500RFUs. This novel allele was 9bp larger than allele 26, which has an average size of
342.71bp when analyzed on an ABI 377. It was paired with an allele 15, which had a size of 297.96bp and
a height of approximately 5000 RFUs. Peaks to the left of D1885] are from other markers.

1500
1000
500

 

 

Figure 21: 12.3 Microvariant at the D58818 Locus

As indicated by the arrow, this 12.3 microvariant at the D58818 locus had a size of 158.76bp and a height
of approximately 1500RFUs. This novel allele was 3bp larger than allele 12, which has an average size of
155.72bp when analyzed on an AB] 377. It was paired with an allele 12, which had a size of 155.87bp and
an approximate height of 1700RFUs in this sample. Peaks to the right of D58818 are from other markers.

4000
3000
2000

"E—z] ii - - - 1000
[E

 

 

Figure 22: Above Ladder Variant Allele 18 at the D58818 Locus

The arrow points to variant allele 18 at the D58818 locus, which had a size of 179.32bp and a height of
approximately 2500RFUs. This novel allele was 8bp larger than allele 16, which has an average size of
171.32bp when analyzed on an ABI 377. It was paired with an allele 13, which had a size of 159.64bp and
an approximate height of 2500RFUs. Peaks to the right of D58818 are from other markers.

87

2000
1500
1000
500

 

 

 

Figure 23: Below Ladder 5.2 Microvariant at the D78820 Locus

The arrow points to the 5.2 microvariant at the D7S820 locus, which had a size of 257.78bp and a height of
approximately 1500RFUs. This novel allele was 2bp smaller than allele 6, which has an average size of
259.42bp when analyzed on an ABI 377. It was paired with an allele 8, which had a size of 267.45bp and
an approximate height of 1500RFUs. Peaks to the left of D7S820 are from other markers.

2000

1500
1000

500

 

El

 

Figure 24: 7.3 Microvariant at the TPOX Locus
As indicated by the arrow, the 7.3 microvariant at the TPOX locus had a size of 225.67bp and a height of

approximately 1000RFUs. This novel allele was 3bp larger than allele 7, which has an average size of
222.40bp when analyzed on an ABI 377. It was paired with an allele 12, which had a size of 242.9]bp and
an approximate height of 1500RFUs. Peaks to the left and right of TPOX are from other markers.

88

APPENDIX D

Appendix D contains the formulas used to calculate the Paternity Index (PI) values for
each locus.

89

Table 29: Formulas for Paternity Indices
The genotypes for the Mother (M), Child (C) and Alleged Father (AF) are given using letters A-D, which
represent different allele designations in a single locus. Allele frequencies for alleles A and B are
represented by a and b, respectively. Single letters in the ﬁrst three columns indicate homozygous loci,
while double letters represent heterozygous loci. Dashes in the M Genotype column indicate the PI value
was determined using only the AFs allele frequencies. Data from Traver (1998).

ED
BC
BC
BC

B
B
B

AB
AB
AB
AB
AB
AB

AB
AB
AB
AB
AB
AB
AB
AB
AB

 

90

1/2a
1/2a
1/2a
lla
1/2a
1/2a
l/a
a+b
1 a+b
1 a+b
1/2a
1/2a
1/a
1/23
1/a
l/4a
a+b 4ab
1/2a
l/2a
l/a

BIBLIOGRAPHY

91

BIBLIOGRAPHY

Applied Biosystems Corporation. (1997). “ABI PRISM® GeneScan® Reference Guide:
Chemistry Reference for the ABI PRISM® 377 and ABITM 373 DNA
Sequencers.” Foster City, CA.

Applied Biosystems Corporation. (1998). “ABI PRISM® Genotyper® 2.5 User’s
Manual.” Foster City, CA.

Applied Biosystems Corporation. (1997). “AmpFl’.STR® COﬁlerTM PCR Amplification
Kit: User's Manual.” Foster City, CA.

Applied Biosystems Corporation. (1997). “AmpF£STR® Proﬁler PlusTM PCR
Ampliﬁcation Kit: User's Manual.” Foster City, CA.

Applied Biosystems Corporation. (2001). “AmpFl’.STR® SGM PlusTM PCR
Amplification Kit: User's Manual.” Foster City, CA.

Barber, M.D., McKeown, BJ. and Parkin, B.H. (1996). “Structural Variation in the
Alleles of a Short Tandem Repeat System at the Human Alpha Fibrinogen
Locus.” International Journal of Legal Medicine. 108: 180 — 185.

Barber, MD. and Parkin, B.H. (1996). “Sequence Analysis and Allelic Designation of
the Two Short Tandem Repeat Loci D1885] and D881179.” International
Journal of Legal Medicine. 109: 62 — 65.

Brinkmann, B., Sajantila, A., Goedde, H.W., Matsumoto, H., Nishi, K. and Wiegand, P.
(1996). “Population Genetic Comparisons Among Eight Populations Using
Allele Frequency and Sequence Data from Three Microsatellite Loci.” Europgan
Journal of Human Genetics. 4: 175 — 182.

Budowle, B., Nhari, L.T., Moretti, T.R., Kanoyangwa, S.B., Masuka, E., Defenbaugh,
DA. and Smerick, J .B. (1997). “Zimbabwe Black Population Data on the Six
Short Tandem Repeat Loci — CSFlPO, TPOX, TH01, D38 1358, VWA, and
FGA.” Forensic Science Intemationil. 90(3): 215 - 22].

Butler, J.M. (2001). Forensic DNA Typing. San Diego: Academic Press.

Campbell, NA. (1996). Biology. Menlo Park: The Benjamin/Cummings
Publishing Company, Inc.

92

Crouse, C.A., Rogers, 8., Amiott, 13., Gibson, 8. and Masibay, A. (1999). “Analysis and
Interpretation of Short Tandem Repeat Microvariants and Three-Banded Allele

Patterns Using Multiple Allele Detection Systems.” Journal of Forensic Sciences.
44: 87 — 94.

Egyed, B., Furedi, 8., Angya], M., Boutrand, L., Vandenberghe, A., Woller, J. and Padar,
Z. (2000). “Analysis of Eight STR Loci in Two Hungarian Populations.”
Forensic Science International. 113(1 - 3): 25 — 27.

Gene, M., Huguet, E., Moreno, P., Sanchez, 0, Carracedo, A. and Corbella, J. (1996).
“Population Study of the STRs HUMTHO] (Including a New Variant) and
HUMVWA31A in Catalonia (Northeast Spain)” International Journal of Legal
Medicine. 108: 318 - 320.

Gill, P., Urquhart, A., Millican, 13.8., Oldroyd, N.J., Watson, 8., Sparkes, R. and
Kimpton, GP. (1996). “A New Method of STR Interpretation Using Inferential

Logic — Development of a Criminal Intelligence Database.” International Joumg
of Legal Medicine. 109: 14 — 22.

Grifﬁths, R.A.L., Barber, M.D., Johnson, P.E., Gillbard, 8.M., Haywood, M.D., Smith,
C.D., Arnold, J ., Burke, T., Urquhart, A. and Gill, P. (1998). “New Reference
Allelic Ladders to Improve Allelic Designation in a Multiplex STR System.”
International Journal of Legal Medicine. 111(5): 267 — 272.

Huang, N.B., Schumm, J .W. and Budowle, B. (1995). “Chinese Population Data on
Three Tetrameric Short Tandem Repeat Loci — HUMTHO] , TPOX, and CSF 1P0
— Derived Using Multiplex PCR and Manual Typing.” Forensic Science
International. 71: 131 — 136.

Margolis-Nunno, H., Brenner, L., Cascardi, J. and Kobilinsky, L. (2001). “A New
Allele of the Short Tandem Repeat (STR) Locus, CSFlPO.” Journal of Forensic
Sciences. 46: 1480 — 1483.

National Research Council. (1996). The Evaluation of Forensic DNA Evidence. The
National Academy Press.

Orchid Gene Screen. (2001). “Standard Operating Procedures for Paternity
Laboratory Testing.” East Lansing, MI.

Shewale, J .G., Richey, S.L. and Sinha, SK. (2000). “Detection and Correction of a
Migration Anomaly on a 310 Genetic Analyzer.” Journal of Forensic Sciences.
45(6): 1339 -— 1342.

STRBase Internet Website. (1997). http://www.cstl.nist.gov/biotech/strbase/. NIST
Biotechnology Division.

93

Traver, M. Appendices 5 (“Numeric Statements of the Strength of the Genetic
Evidence”) and 6 (“Formulas for Paternity Index and RMNE for Simple
Codominant Systems”). (1998). _P_arentage Testing Accreditation Requirements
Mimi, Third Edition. American Association of Blood Banks.

94

General References

Bar, W., Brinkman, B., Budowle, B., Carracedo, A., Gill, P., Lincoln, P., Mayr, W. and
Olaisen, B. (1997). “DNA Recommendations — Further Report of the DNA
Commission of the ISFH Regarding the Use of Short Tandem Repeat
Systems.” International Journal of Legal Medicine. 110: 175 — 176.

Brinkmann, B., Meyer, E. and Junge, A. (1996). “Complex Mutational Events at the
HumD21811 Locus.” Human Genetics. 98: 60 — 64.

Brito, R.M., Ribeiro, T., Viriato, L., Vieira-Silva, C., Espinheira, R., Pinto-Ribeiro, I. and
Geada, H. (2000). “Sequence Variation of New Alleles at the Short Tandem
Repeat D198253 Locus.” Journal of Forensic Sciences. 45: 932 — 934.

Budowle, B., Moretti, T.R., Baumstark, A.L., Defenbaugh, DA. and Keys, KM. (1999).
“Population Data on the Thirteen CODIS Core Short Tandem Repeat Loci in
African Americans, US. Caucasians, Hispanics, Bahamians, Jamaicans, and .
Trinidadians.” Jouml of Forensic Sciences. 44: 1277 — 1286.

Espinheira, R., Geada, H., Ribeiro, T. and Reys, L. (1996). “STR Analysis — HUMTHO]
and HUMFES/FPS for Forensic Application. Advances in Forensic
Haemogenetics. 6: 528.

Henke, J. and Henke, L. (1999). “Mutation Rate in Human Microsatellites.” American
Journal of Human Genetics. 64(5): 1473 -- 1474.

Junge, A. and Madea, B. (1998). “Validation Studies and Characterization of Variant
Alleles at the Short Tandem Repeat Locus D128391.” International Journal of
Legal Medicine. 112: 67 — 69.

Melvin, J .R., Jr., Kateley, J.R., Jr., Oaks, M.K., Simson, L.R., Jr. and Maldonado, W.E.
(1998). “Paternity Testing.” Forensic Science HmdbcmiciVolume H. Ed. R.
Saferstein. Englewood Cliffs: Prentice Hall.

Mizuno, N., Sekiguchi, K., Sato, H. and Kasai, K. (2003). “Variant Alleles on the Penta
E Locus in the PowerPlex® 16 Kit.” Journal of Forensic Sciences. 48: l — 4.

Momhinweg, E., Luckenbach, C., Fimmers, R. and Ritter, H. (1998). “D381358:
Sequence Analysis and Gene Frequency in a German Population.” Forensic
Science International. 95(2): 173 - 178.

Rudin, N. and Inman, K. (2002). An Introduction to Forensic DNA Analysis. Boca
Raton: CRC Press.

95

Ruitberg, C.M., Reeder, DJ. and Butler, J .M. (2001). “STRBase: A Short Tandem
Repeat DNA Database for the Human Identity Testing Community.” Nucleic
Acids Research. 29: 320 — 322.

Schanﬁeld, M8. (2000). “Parentage Testing.” Encyclopedia of Forensic Sciences.
504 — 515.

Schanﬁeld, M8. (2000). “Polymerase Chain Reaction — Short Tandem Repeats.”
Encyclopedia of Forensic Sciences. 526 — 535.

Sprecher, C.J., Puers, C., Lins, A.M. and Schumm, J.W. (1996). “General Approach to
Analysis of Polymorphic Short Tandem Repeat Loci.” BioTechniques. 20:
266 - 276.

Urquhart, A., Oldroyd, N.J., Kimpton, GP. and Gill, P. (1995). Highly Discriminating
Heptaplex Short Tandem Repeat PCR System for Forensic Identiﬁcation.
BioTechniques. 18: 116 — 121.

Zhou, H.G., Sato, K., Nishimaki, Y., Fang, L. and Hasekura, H. (1997). The .
HumD21811 System of Short Tandem Repeat DNA Polymorphisms in Japanese
and Chinese. Forensic Science Intemationﬂ. 86(1 — 2): 109 — 188.

 

96

 

 

(lllll‘lﬂlllllllllﬂlﬂlllllﬂlljlll