[ Japanese | English ]
About This Database

Splicing pattern

Data description
Data name
Splicing pattern
DOI
10.18908/lsdba.nbdc00371-004
Description of data contents
The patterns of alternative splicing/transcriptional initiation
Data file
File name :
astra_splicing_pattern.zip
File URL :
File size :
1.2 MB
Simple search URL
http://togodb.biosciencedbc.jp/togodb/view/astra_splicing_pattern#en
Data acquisition method

For the five organisms (H. sapiens, M. musculus, D. melanogaster, and A. thaliana, C. elegans) other than O. sativa, cDNA sequences were obtained from UniGene database. For the UniGene cDNAs, we chose those sequences that presumably code for mature protein coding sequences (CDSs) according to the annotation. For O. sativa, a full-length 32 k cDNA clone set and information of coding sequences were obtained from the Laboratory of Gene Expression, Department of Molecular Genetics, National Institute of Agrobiological Sciences (Kikuchi et al., 2003; ftp://cdna01.dna.affrc.go.jp/pub/data/CURRENT).
The genomic sequences of H. sapiens, M. musculus, D. melanogaster, and A. thaliana were obtained from NCBI (ftp://ftp.ncbi.nih.gov/genomes/). The genomic sequences of C. elegans and the draft contigs of O. sativa were obtained from Sanger Center (ftp://ftp.sanger.ac.uk/pub/) and TIGR Institute (ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotaion_dbs/pseudomolecules/version_3.0/), respectively.

Data analysis method

As the first, mapping between full-length cDNAs and genome sequences by MEGABLAST. Following that, convertion to mapping data into bit arrays, detection of splicing patterns and distribution to the types.

Number of data entries

156,654 entries

Data detail
Data item Description
ID

ID for splicing pattern

Species

Species name

Locus ID

Specific locus ID for this database

Region

Splicing Region number in this locus

Splicing pattern

Splicing pattern number in this locus

cDNA ID

Specific cDNA ID for this database

Splicing left

The left position of splicing region onto genome

Splicing right

The right position of splicing region onto genome

Splicing pattern

Splicing pattern represented into bit array."1" means splicing region.

Splicing type

Splicing type (ex. cassette)

NAGNAG

NAGNAG (6bp acceptor site) or not