[ Japanese | English ]
About This Database

cDNA

Data description
Data name
cDNA
DOI
10.18908/lsdba.nbdc00371-002
Description of data contents
List of cDNA in locus
Data file
File name :
astra_cdna.zip
File URL :
File size :
3.3 MB
Simple search URL
http://togodb.biosciencedbc.jp/togodb/view/astra_cdna#en
Data acquisition method

For the five organisms (H. sapiens, M. musculus, D. melanogaster, and A. thaliana, C. elegans) other than O. sativa, cDNA sequences were obtained from UniGene database. For the UniGene cDNAs, we chose those sequences that presumably code for mature protein coding sequences (CDSs) according to the annotation. For O. sativa, a full-length 32 k cDNA clone set and information of coding sequences were obtained from the Laboratory of Gene Expression, Department of Molecular Genetics, National Institute of Agrobiological Sciences (Kikuchi et al., 2003; ftp://cdna01.dna.affrc.go.jp/pub/data/CURRENT).
The genomic sequences of H. sapiens, M. musculus, D. melanogaster, and A. thaliana were obtained from NCBI (ftp://ftp.ncbi.nih.gov/genomes/). The genomic sequences of C. elegans and the draft contigs of O. sativa were obtained from Sanger Center (ftp://ftp.sanger.ac.uk/pub/) and TIGR Institute (ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotaion_dbs/pseudomolecules/version_3.0/), respectively.

Data analysis method

As the first, mapping between full-length cDNAs and genome sequences by MEGABLAST. Following that, convertion to mapping data into bit arrays, detection of splicing patterns and distribution to the types. Genes were identificatified by cDNA annotations

Number of data entries

84,438 entries

Data detail
Data item Description
cDNA ID

Specific cDNA ID for this database

Locus ID

Specific locus ID for this database

Species

Species name

Chr. No.

Chromosome No.

Strand

Strand

Gene name

Gene name

UniGene ID

UniGene ID

cDNA UniGene ID

UniGene ID of this cDNA

Genbank accession

Genbank accession number

GI

Genbank ID

CDS left

The left position of coding region onto this cDNA

CDS right

The right position of coding region onto this cDNA

Splicing pattern

Splicing pattern (gene model)

cDNA sequence length

cDNA sequence length

DOU start

Premature (pseudo) start codon (true) or not (false)

DOU end

Premature (pseudo) stop codon (true) or not (false)

NMD

Nonsense-mediated mRNA Decay (NMD) might happen (true) or not (false)

Number of exons

Number of exons in this variant

Number of splicing regions

Number of alternative splicing/transcriptional initiation regionss