[ Japanese | English ]
About This Database

Exon

Data description
Data name
Exon
DOI
10.18908/lsdba.nbdc00371-003
Description of data contents
Exons in variants
Data file
File name :
astra_exon.zip
File URL :
File size :
5.9 MB
Simple search URL
http://togodb.biosciencedbc.jp/togodb/view/astra_exon#en
Data acquisition method

For the five organisms (H. sapiens, M. musculus, D. melanogaster, and A. thaliana, C. elegans) other than O. sativa, cDNA sequences were obtained from UniGene database. For the UniGene cDNAs, we chose those sequences that presumably code for mature protein coding sequences (CDSs) according to the annotation. For O. sativa, a full-length 32 k cDNA clone set and information of coding sequences were obtained from the Laboratory of Gene Expression, Department of Molecular Genetics, National Institute of Agrobiological Sciences (Kikuchi et al., 2003; ftp://cdna01.dna.affrc.go.jp/pub/data/CURRENT).
The genomic sequences of H. sapiens, M. musculus, D. melanogaster, and A. thaliana were obtained from NCBI (ftp://ftp.ncbi.nih.gov/genomes/). The genomic sequences of C. elegans and the draft contigs of O. sativa were obtained from Sanger Center (ftp://ftp.sanger.ac.uk/pub/) and TIGR Institute (ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotaion_dbs/pseudomolecules/version_3.0/), respectively.

Data analysis method

As the first, mapping between full-length cDNAs and genome sequences by MEGABLAST. Following that, convertion to mapping data into bit arrays, detection of splicing patterns and distribution to the types.

Number of data entries

676,111 entries

Data detail
Data item Description
Exon ID

Specific exon ID for this database

Species

Species name

Locus ID

Specific locus ID for this database

cDNA ID

Specific cDNA ID for this database

Left in cDNA

The left position of exon onto cDNA

Right in cDNA

The right position of exon onto cDNA

Left in genome

The left position of exon onto genome

Right in genome

The right position of exon onto genome