| Data name ⇅ | Database name ⇅ | DOI ⇅ | Description of data contents ⇅ | Data file ⇅ | Simple search URL ⇅ | Data acquisition method ⇅ | Data analysis method ⇅ | Number of data entries ⇅ | Data detail |
|---|---|---|---|---|---|---|---|---|---|
| LD bin list (Phase II) | D-HaploDB | 10.18908/lsdba.nbdc00036-003 |
LD bin list of D2 (Phase II). LD bin is a group of SNPs that mutually shows high LD (r2 > 0.8). See below for detail. |
dhaplo_d2_ld_bin_list.zip
(3.0MB ) |
http://togodb.biosciencedbc.jp/togodb/view/dhaplo_d2_ld_bin_list#en |
Perlegen 281K arrays、Affymetrix 500K arrays |
Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that every pairs within the bin shows r2 >0.8, by the greedy method of ldSELECT (Carson et al., AJHG 74, 106-120, 2004) after the program was modified so that it can handle haploid data. |
254,762 entries |
Data detail
open_in_full
|
| LD bin list (Phase III) | D-HaploDB | 10.18908/lsdba.nbdc00036-004 |
LD bin list of D3 (Phase III). LD bin is a group of SNPs that mutually shows high LD (r2 > 0.8). See below for detail. |
dhaplo_d3_ld_bin_list.zip
(3.1MB) |
http://togodb.biosciencedbc.jp/togodb/view/dhaplo_d3_ld_bin_list#en |
Affymetrix SNP 6.0 |
Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that every pairs within the bin shows r2 >0.8, by the greedy method of tagzilla (which is a rapid method of bin selection based on the principle of ldSELECT (Carson et al., AJHG 74, 106-120, 2004)). |
250,751 entries |
Data detail
open_in_full
|
| LD_bin Data (Phase II) | D-HaploDB | 10.18908/lsdba.nbdc00036-007 |
Results of LD bin calculations for D2 (Phase II) data sets. Files are in GFF format, and contains two kinds of lines, that are distinguishable by column# 3.
|
bin_2R80M5.gff.gz
(12.1MB) |
- |
Genotype Data (Phase II) |
Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that SNP pairs within the bin shows r2 >0.8, by the greedy method of ldSELECT (Carson et al., AJHG 74, 106-120, 2004) after the program was modified so that it can handle haploid data. tagSNP is the SNP that shows r2 >0.8 for all other members of the bin. Best tagSNP is the SNP that shows the highest mean r2 for all other members of the bin. |
SNP: 541,686 entries LD Bin: 254,762 entries |
Data detail
open_in_full
|
| LD_bin Data (Phase III) | D-HaploDB | 10.18908/lsdba.nbdc00036-008 |
Results of LD bin calculations for D3 (Phase III) data sets. Files are in GFF format, and contains two kinds of lines, that are distinguishable by column# 3.
|
bin_3R80M5Zb36.gff.gz
(12.8MB) |
- |
Genotype Data (Phase III) |
Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that SNP pairs within the bin shows r2 >0.8, by the greedy method of Tagzilla, which is based on the principle similar to ldSELECT (Carson et al., AJHG 74, 106-120, 2004). tagSNP is the SNP that shows r2 >0.8 for all other members of the bin. Best tagSNP is the SNP that shows the highest mean r2 for all other members of the bin. |
SNP: 565,646 entries LD Bin: 250,751 entries |
Data detail
open_in_full
|
| SNP List (Phase II) | D-HaploDB | 10.18908/lsdba.nbdc00036-001 |
A list of SNPs in D2 (Phase II). SNP genotypes in D1 (Phase I, Perlegen 281K SNPs) and those determined using Affymetrix 500K Array for overlapping 74 CHM samples were merged and QC'ed. LD bins were then determined. |
dhaplo_d2_snp_list.zip
(16.6MB ) |
http://togodb.biosciencedbc.jp/togodb/view/dhaplo_d2_snp_list#en |
Perlegen 281K arrays, Affymetrix 500K arrays |
The analyses by Perlegen 281K arrays are described in Kukita et al. (2005). The analyses by Affymetrix 500K are described in Higasa et al. (2009). Affymetrix markers were mapped to NCBI Build 35 (Higasa mapping was on Build 36), and merged with Perlegen data. The original Affymetrix data contained a small fraction (< 1%) of SNPs that were genotyped heterozygote. These typings were judged to be errors because the materials were presumed to be homozygotes. Thus, all heterozygous calls (in Affymetrix data) were forced to be no calls. Many of the heterozygous calls were also low S/N calls and frequently irreproducible when tested. See "LD bin List" for the calculation of LD bins. |
581,235 entries |
Data detail
open_in_full
|
| Data name | Database name | DOI | Description of data contents | Data file | Simple search URL | Data acquisition method | Data analysis method | Number of data entries | Data detail |
List of Data Metadata