Y-DNA project help

From ISOGG Wiki
Jump to: navigation, search
Ideogram of the Y chromosome

Newcomers of Y chromosome DNA tests and in Y-DNA projects (Haplogroups, Surname, Geographical) are often overwhelmed by the many technical terms used for testing Y-DNA listed here and in the Genetics Glossary.

Contents

Y chromosome

schematic how SNP mutations create haplogroups (clades) in the Y-tree
The Y chromosome (Y-DNA) is a DNA structure found in the nucleus of a male cell. Humans have 23 pairs of Chromosomes, 22 pair of autosomes and one pair of sex chromosomes, XX for females and XY for males. The Y chromosome is passed on without recombination by a father to his sons.

SNP

A SNP (Single-nucleotide polymorphism) happens when a single place in the genome sequence is altered during the cell formation process and this mutation persists in the progeny. A person has many inherited SNPs that together create a unique DNA pattern for that individual. Snips clarify the branching of a tree-separation of different subhaplogroups and to discover deep ancestry. A terminal SNP is the defining SNP of the latest subclade known by current research. It should be unique (UEP) and constant in time. ISOGG mantains a Y-SNP Index where synonymous names are listed.

Clade

Clade comes from the Greek word Klados = branch. A Clade on the Y chromosome tree is also called a Haplogroup. Subclade describes a sub-clade being downstream (occurring later in time). A Clade includes all the descendants of a single founder (common ancestor).

Ancestral/Upstream, Derived/Downstream

In the Y-tree older nodes (ancestors, toward the root) are Ancestral/Upstream. Younger nodes (descendants, toward the present) are Derived/Downstream.

Haplogroup

A Haplogroup is a branch of the human family tree. All men in the same Y-DNA haplogroup share the same SNP (unique marker in the Y-chromosome) which they have inherited from their common ancestor. The haplogroup is like a name for that common ancestor person. The haplogroup tells about current distribution and the migration patterns of the descendants of the haplogroup founder. The major Eurasian Y-DNA-haplogroups (E1b, G2a, I1, I2, J1, J2, R1a, R1b, etc.) formed over tens of thousands of years. In 2012 for many Y-DNA recent SNPs (under 3,000 years old) are available. These types of SNPs are informative for the historical time and sometimes can allow a connection to genealogical time.

Nomenclature System (YCC)

2002 the Y Chromosome Consortium (YCC) proposed two widely accepted nomenclature systems for Y-DNA haplogroups.[1] Major haplogroups are labeled with large capital letters (A–T). "Paragroups" are distinguished from haplogroups by using the * (star) symbol, which represents chromosomes belonging to a clade but not its subclades. Examples for the haplogroup definded by the SNPs L21/M529/S145 and L459:

  • Hierarchical system: R1b1a2a1a1b4 (FTDNA 2009), R1b1a2a1a1b3 (ISOGG 2012 v7.62), R1b1b2a1a2f (23andMe 2009).
  • Shorthand - SNP system: R-L21, R1b-L21, R-M529, R-S145. This system is more robust to changes in topology but widespread SNPs have often up to three synonymous names. For seldom and new terminal SNPs there is also the risk that they are not unique (recurrent).

Name Versions - Y-Chromosome Phylogenetic Trees

Since 2002 many new ramifications (SNPs) even in basal branches and subclades where found. The YCC [2], other scientific papers, societies and companies published substantial refining and updates to the Y-Chromosome Phylogenetic Tree, where the haplogroup names for deep clades often are different. In Genetic Genealogy the following name versions are important:

  • FTDNA: since 2005 uses mainly the Hierarchical system; in some places the Shorthand - SNP system.
  • 23andMe: No public version is available. If you are a customer you can view the linked information.
  • ISOGG: the Y-DNA Haplogroup Tree since 2006 is updated several times per year according to newest evidence from publications, FTDNA lab research and Y-DNA Projects research
  • Since autumn 2012: many scholars, companies and genetic genealogists agree [3] that the Shorthand - SNP system is the best solution to avoid confusion for the future. FTDNA has announced to merge totally to this system with the next major website update.

Y-STR - DYS values

STR (Short tandem repeat) is a short DNA motif (pattern). Y-STRs occur on the Y-DNA. DYS (DNA Y-Chromosome Segment) numbers show the repeats of an STR on that position. A DYS value tipically mutates with a certain (low) probability to a higher or lower value from generation to generation. By this DYS values are not unique and not constant in time.

Y-DNA - Haplotype

Y-DNA Haplotype is defined as one person's set of values for the DYS locations. A set of DYS values is highly informative for tracing recent ancestry (genealogical time). The quantity of needed DYS values depends on the research goal and the frequency of nearby haplotypes. For surname projects 12 or 25 markers can be enough, while for extended haplotpye studies (lineage distinction, pre surname time, SNP research) and to find more distant matches 37 to 111 markers are used. Modal haplotype is the most commonly occurring haplotype derived from a specific group. It should be near or identical to the common ancestor of that group.

Y-DNA - Matches

Y-DNA Matches are other kits (tested males) that have the same or similar numbers for the DYS values. While the same values usually are only probable in near relatives (father, son, brother, grandfather, cousins), step mutations can show a relation until many generations ago. In major european haplogroups (R1b-U106, R1b-U152, R1b-L21, I1-M255, E1b-M78, J2a-L26, G2a-L30, I2-M223, etc.) many subclades have overlapping haplotypes. In this cases only by high DYS coverage and positive testing of a recent terminal SNP the recent common ancestor is proven.

Cluster

A group of kits all together near by haplotype (Y-DNA matches) and having no discovered unique SNP defining a haplogroup is called Cluster. The given name is only temporary defined and usually only used within the project. It is interesting for members of a Cluster to work together for discovery of new SNPs (WTY, Y-Sequencing, Deep Clade test) and test funding of interesting and informative low coverage samples.

Deep Clade or single SNP test

Predominant Y-DNA-haplogroups in Europe
If a terminal SNP can be predicted with high probability or not is depending on the haplotype frequency and number of DYS values. For a strong prediction, usually possible for seldom haplotypes with enough matches and kits with 67 or 111 markers, the testing of single SNPs ($29) is often the better choice. If a kit has low DYS coverage (12 to 37 markers), has no SNPs tested and matches to a huge haplotype cluster from different subclades a Deep Clade test is often the smarter solution. As of fall 2012 Geno 2.0 test ($199) replaces the old Deep Clade test ($129).

New Deep Clade test (Geno 2.0)

Since October 2012 the new Geno 2.0 test is available for $199.95 at the Genographic Project page and includes deep SNP results for mtDNA, auDNA and X-DNA. According to statements from the FTDNA president Bennett Greenspan this test does replace the FTDNA Deep Clade test ($129).

  • If you have received your results please transfer your data from Genographic Project Geno 2 (in Profile / Expert Options) for free to Family Tree DNA. Only by doing this Y-SNP results will be available in FTDNA projects.

Single SNP test - Order an Advanced Test

  • Login to FTDNA (MyFTDNA) > Order an Upgrade > Order an Advanced Test > Select Filter Test Type SNP
  • Search for recommended SNP (for example L26) and Add it to the cart ($39). Repeat the last step for additional SNPs.
  • Make your order clicking on Next, etc.

Y-STR DYS Upgrade (Y-Refine)

  • Login to FTDNA (MyFTDNA) > Order an Upgrade > Order a Standard Test > Select A Product
  • Select your desired Upgrade:
    • 12 Marker kits: Y-Refine12to25 ($49), Y-Refine12to37 ($99), Y-Refine12to67 ($189);
    • 25 Marker kits: Y-Refine25to37 ($49), Y-Refine25to67 ($148);
    • 37 Marker kits: Y-Refine37to67 ($99), Y-Refine37to111 ($200);
    • 67 Marker kits: Y-Refine67to111 ($129)
  • Make your order clicking on Next, etc.

Paternal Ancestor Info

For every Y-DNA project it is very informative and sometimes important if the oldest paternal line (biological father of the father of the father etc.) information is given completely. This setting can be entered/changed in myFTDNA > My Account > Most Distant Ancestors: Direct Paternal:

  • Country of Origin: enter the oldest known or strongly suspected country of origin of your Y-DNA. For example: "Germany". Otherwise enter "Unknown Origin".
  • Name: enter name, year born/dead, municipality/city/town/village, province/county/region; example: "John Schmid, 1788-1852, Augsburg, Swabia". If only one date is known add informations like "b. 1788", "d. 1852", the place information should be the oldest known (usually birth, then marriage or other records and then dead). If there was a known Non-paternity event (surname change) and the biological father is unknown please indicate the latest known male ancestor and his mother ("Martin son of Maria Miller, b. 1822, Augsburg, Swabia"). It is a common error that the paternal ancestor of Maria is indicated ("Georg Miller, 1730, Swabia") because this ancestor is not the genealogical Y-DNA ancestor (strict paternal line).
  • Ancestral Location Direct Paternal: enter the full location information: house/place/street, municipality/city, ZIP, province/county/region, country. For example: "Hauptmarktplatz, Hoher Weg, Augsburg, Swabia, Bavaria, Germany". Enter the coordinates: Latitude and Longitude

GEDCOM/Family Tree

For surname projects or when your paternal ancestors did often change location (region, countries) the uploading of a GEDCOM file (see Genealogy software) with all the paternal ancestors included will greatly help research and the finding of new connections. You help especially adoptees or unknown male offspring to find the possible generation of connection.

References

  1. Y Chromosome Consortium (2002-02). "A Nomenclature System for the Tree of Human Y-Chromosomal Binary Haplogroups". Genome Research. doi:10.1101/gr.217602. http://genome.cshlp.org/content/12/2/339.full. Retrieved 2012-04-12. 
  2. Karafet TM, Mendez FL, Meilerman MB, Underhill PA, Zegura SL, Hammer MF (2008-05). "New binary polymorphisms reshape and increase resolution of the human Y chromosomal haplogroup tree". Genome Research. doi:10.1101/gr.7172008. http://genome.cshlp.org/content/18/5/830. Retrieved 2012-04-12. 
  3. Let's All Start Using Terminal SNP Labels Instead of Y Haplogroup Subclade Names, Okay? http://www.yourgeneticgenealogist.com/2012/09/lets-all-start-using-terminal-snp.html

See also

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox
Print/export