[ Japanese | English ]
About This Database

Genotype Data (Phase II)

Data description
Data name
Genotype Data (Phase II)
DOI
10.18908/lsdba.nbdc00036-005
Description of data contents
A list of SNP genotypes in D2(Phase II). SNP genotypes in D1 (Phase I, Perlegen 281K SNPs) and those determined using Affymetrix 500K Array for overlapping 74 CHM samples were merged and QC'ed.
Data file
File name :
mole_info_DhaploD2.txt.gz
File URL :
File size :
13.7MB
Simple search URL
-
Data acquisition method

Genotypes were determined using Perlegen 281K arrays and Affymetrix 500K arrays

Data analysis method

The analyses by Perlegen 281K arrays are described in Kukita et al. (2005). The analyses by Affymetrix 500K are described in Higasa et al. (2009). Affymetrix markers were mapped to NCBI Build 35 (Higasa mapping was on Build 36), and merged with Perlegen data. The original Affymetrix data contained a small fraction (< 1%) of SNPs that were genotyped heterozygote. These typings were judged to be errors because the materials were presumed to be homozygotes. Thus, all heterozygous calls (in Affymetrix data) were forced to be no calls. Many of the heterozygous calls were also low S/N calls and frequently irreproducible when tested.

Number of data entries

581,235 entries

Data detail
Data item Description
rs

RefSNP accession ID (rs number)

chr

Chromosome number that the SNP resides (1 - 22, X)

pos

Nucleotide position on chromosome that the SNP resides

allele1

allele 1

allele2

allele 2

gtype

genotypes of 74 samples of CHMs