[ Japanese | English ]
About This Database

LD_bin Data (Phase III)

Data description
Data name
LD_bin Data (Phase III)
DOI
10.18908/lsdba.nbdc00036-008
Description of data contents
Results of LD bin calculations for D3 (Phase III) data sets. Files are in GFF format, and contains two kinds of lines, that are distinguishable by column# 3. - LD_BIN line: SNPs in LD bin. tagSNPs and best tagSNPs (*) are marked. - LD_BIN_BOUNDARIES: Limit of LD bin *SNP that shows the highest mean r<sup>2</sup> among the SNPs in the bin
Data file
File name :
bin_3R80M5Zb36.gff.gz
File URL :
File size :
12.8MB
Simple search URL
-
Data acquisition method

Genotype Data (Phase III)

Data analysis method

Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that SNP pairs within the bin shows r2 >0.8, by the greedy method of Tagzilla, which is based on the principle similar to ldSELECT (Carson et al., AJHG 74, 106-120, 2004). tagSNP is the SNP that shows r2 >0.8 for all other members of the bin. Best tagSNP is the SNP that shows the highest mean r2 for all other members of the bin.

Number of data entries

SNP: 565,646 entries
LD Bin: 250,751 entries

Data detail
Data item Description
#1

seqname

#2

source

#3

feature

#4

start

#5

end

#6

score

#7

strand

#8

frame

#9

attributes