[ Japanese | English ]
About This Database

LD_bin Data (Phase II)

Data description
Data name
LD_bin Data (Phase II)
DOI
10.18908/lsdba.nbdc00036-007
Description of data contents
Results of LD bin calculations for D2 (Phase II) data sets. Files are in GFF format, and contains two kinds of lines, that are distinguishable by column# 3. - LD_BIN line: SNPs in LD bin. tagSNPs and best tagSNPs (*) are marked. - LD_BIN_BOUNDARIES: Limit of LD bin *SNP that shows the highest mean r<sup>2</sup> among the SNPs in the bin
Data file
File name :
bin_2R80M5.gff.gz
File URL :
File size :
12.1MB
Simple search URL
-
Data acquisition method

Genotype Data (Phase II)

Data analysis method

Common SNPs (MAF > 5%) were selected, and LDs (r2 values) between every SNP pairs within 300 kb were calculated. LD bins were collected so that SNP pairs within the bin shows r2 >0.8, by the greedy method of ldSELECT (Carson et al., AJHG 74, 106-120, 2004) after the program was modified so that it can handle haploid data. tagSNP is the SNP that shows r2 >0.8 for all other members of the bin. Best tagSNP is the SNP that shows the highest mean r2 for all other members of the bin.

Number of data entries

SNP: 541,686 entries
LD Bin: 254,762 entries

Data detail
Data item Description
#1

seqname

#2

source

#3

feature

#4

start

#5

end

#6

score

#7

strand

#8

frame

#9

attributes