[ Japanese | English ]
List of Data Metadata
Showing 61 to 65 of 416 entries
Show entries Advanced Search
Data name Database name DOI Description of data contents Data file Simple search URL Data acquisition method Data analysis method Number of data entries Data detail
LD bin list (Phase II) D-HaploDB 10.18908/lsdba.nbdc00036-003

LD bin list of D2 (Phase II). LD bin is a group of SNPs that mutually shows high LD (r2 > 0.8). See below for detail.

dhaplo_d2_ld_bin_list.zip
(3.0MB )
http://togodb.biosciencedbc.jp/togodb/view/dhaplo_d2_ld_bin_list#en

Perlegen 281K arrays、Affymetrix 500K arrays

Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that every pairs within the bin shows r2 >0.8, by the greedy method of ldSELECT (Carson et al., AJHG 74, 106-120, 2004) after the program was modified so that it can handle haploid data.

254,762 entries
Data detail open_in_full
LD bin list (Phase III) D-HaploDB 10.18908/lsdba.nbdc00036-004

LD bin list of D3 (Phase III). LD bin is a group of SNPs that mutually shows high LD (r2 > 0.8). See below for detail.

dhaplo_d3_ld_bin_list.zip
(3.1MB)
http://togodb.biosciencedbc.jp/togodb/view/dhaplo_d3_ld_bin_list#en

Affymetrix SNP 6.0

Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that every pairs within the bin shows r2 >0.8, by the greedy method of tagzilla (which is a rapid method of bin selection based on the principle of ldSELECT (Carson et al., AJHG 74, 106-120, 2004)).

250,751 entries
Data detail open_in_full
LD_bin Data (Phase II) D-HaploDB 10.18908/lsdba.nbdc00036-007

Results of LD bin calculations for D2 (Phase II) data sets. Files are in GFF format, and contains two kinds of lines, that are distinguishable by column# 3.
- LD_BIN line: SNPs in LD bin. tagSNPs and best tagSNPs (*) are marked.
- LD_BIN_BOUNDARIES: Limit of LD bin
*SNP that shows the highest mean r2 among the SNPs in the bin

bin_2R80M5.gff.gz
(12.1MB)
-

Genotype Data (Phase II)

Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that SNP pairs within the bin shows r2 >0.8, by the greedy method of ldSELECT (Carson et al., AJHG 74, 106-120, 2004) after the program was modified so that it can handle haploid data. tagSNP is the SNP that shows r2 >0.8 for all other members of the bin. Best tagSNP is the SNP that shows the highest mean r2 for all other members of the bin.

SNP: 541,686 entries LD Bin: 254,762 entries
Data detail open_in_full
LD_bin Data (Phase III) D-HaploDB 10.18908/lsdba.nbdc00036-008

Results of LD bin calculations for D3 (Phase III) data sets. Files are in GFF format, and contains two kinds of lines, that are distinguishable by column# 3.
- LD_BIN line: SNPs in LD bin. tagSNPs and best tagSNPs (*) are marked.
- LD_BIN_BOUNDARIES: Limit of LD bin
*SNP that shows the highest mean r2 among the SNPs in the bin

bin_3R80M5Zb36.gff.gz
(12.8MB)
-

Genotype Data (Phase III)

Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that SNP pairs within the bin shows r2 >0.8, by the greedy method of Tagzilla, which is based on the principle similar to ldSELECT (Carson et al., AJHG 74, 106-120, 2004). tagSNP is the SNP that shows r2 >0.8 for all other members of the bin. Best tagSNP is the SNP that shows the highest mean r2 for all other members of the bin.

SNP: 565,646 entries LD Bin: 250,751 entries
Data detail open_in_full
SNP List (Phase II) D-HaploDB 10.18908/lsdba.nbdc00036-001

A list of SNPs in D2 (Phase II). SNP genotypes in D1 (Phase I, Perlegen 281K SNPs) and those determined using Affymetrix 500K Array for overlapping 74 CHM samples were merged and QC'ed. LD bins were then determined.

dhaplo_d2_snp_list.zip
(16.6MB )
http://togodb.biosciencedbc.jp/togodb/view/dhaplo_d2_snp_list#en

Perlegen 281K arrays, Affymetrix 500K arrays

The analyses by Perlegen 281K arrays are described in Kukita et al. (2005). The analyses by Affymetrix 500K are described in Higasa et al. (2009). Affymetrix markers were mapped to NCBI Build 35 (Higasa mapping was on Build 36), and merged with Perlegen data. The original Affymetrix data contained a small fraction (< 1%) of SNPs that were genotyped heterozygote. These typings were judged to be errors because the materials were presumed to be homozygotes. Thus, all heterozygous calls (in Affymetrix data) were forced to be no calls. Many of the heterozygous calls were also low S/N calls and frequently irreproducible when tested. See "LD bin List" for the calculation of LD bins.

581,235 entries
Data detail open_in_full
Data name Database name DOI Description of data contents Data file Simple search URL Data acquisition method Data analysis method Number of data entries Data detail