[ Japanese | English ]
List of Data Metadata
Showing 16 to 20 of 416 entries
Show entries Advanced Search
Data name Database name DOI Description of data contents Data file Simple search URL Data acquisition method Data analysis method Number of data entries Data detail
Exon ASTRA 10.18908/lsdba.nbdc00371-003

Exons in variants

astra_exon.zip
(5.9 MB)
http://togodb.biosciencedbc.jp/togodb/view/astra_exon#en

For the five organisms (H. sapiens, M. musculus, D. melanogaster, and A. thaliana, C. elegans) other than O. sativa, cDNA sequences were obtained from UniGene database. For the UniGene cDNAs, we chose those sequences that presumably code for mature protein coding sequences (CDSs) according to the annotation. For O. sativa, a full-length 32 k cDNA clone set and information of coding sequences were obtained from the Laboratory of Gene Expression, Department of Molecular Genetics, National Institute of Agrobiological Sciences (Kikuchi et al., 2003; ftp://cdna01.dna.affrc.go.jp/pub/data/CURRENT).
The genomic sequences of H. sapiens, M. musculus, D. melanogaster, and A. thaliana were obtained from NCBI (ftp://ftp.ncbi.nih.gov/genomes/). The genomic sequences of C. elegans and the draft contigs of O. sativa were obtained from Sanger Center (ftp://ftp.sanger.ac.uk/pub/) and TIGR Institute (ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotaion_dbs/pseudomolecules/version_3.0/), respectively.

As the first, mapping between full-length cDNAs and genome sequences by MEGABLAST. Following that, convertion to mapping data into bit arrays, detection of splicing patterns and distribution to the types.

676,111 entries
Data detail open_in_full
Locus ASTRA 10.18908/lsdba.nbdc00371-001

List of locus and splicing patterns in respective locus

astra_locus.zip
(887 KB)
http://togodb.biosciencedbc.jp/togodb/view/astra_locus#en

For the five organisms (H. sapiens, M. musculus, D. melanogaster, and A. thaliana, C. elegans) other than O. sativa, cDNA sequences were obtained from UniGene database. For the UniGene cDNAs, we chose those sequences that presumably code for mature protein coding sequences (CDSs) according to the annotation. For O. sativa, a full-length 32 k cDNA clone set and information of coding sequences were obtained from the Laboratory of Gene Expression, Department of Molecular Genetics, National Institute of Agrobiological Sciences (Kikuchi et al., 2003; ftp://cdna01.dna.affrc.go.jp/pub/data/CURRENT).
The genomic sequences of H. sapiens, M. musculus, D. melanogaster, and A. thaliana were obtained from NCBI (ftp://ftp.ncbi.nih.gov/genomes/). The genomic sequences of C. elegans and the draft contigs of O. sativa were obtained from Sanger Center (ftp://ftp.sanger.ac.uk/pub/) and TIGR Institute (ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotaion_dbs/pseudomolecules/version_3.0/), respectively.

Genes were identificatified by mapping locus and cDNA annotations

17,034 entries
Data detail open_in_full
Splicing pattern ASTRA 10.18908/lsdba.nbdc00371-004

The patterns of alternative splicing/transcriptional initiation

astra_splicing_pattern.zip
(1.2 MB)
http://togodb.biosciencedbc.jp/togodb/view/astra_splicing_pattern#en

For the five organisms (H. sapiens, M. musculus, D. melanogaster, and A. thaliana, C. elegans) other than O. sativa, cDNA sequences were obtained from UniGene database. For the UniGene cDNAs, we chose those sequences that presumably code for mature protein coding sequences (CDSs) according to the annotation. For O. sativa, a full-length 32 k cDNA clone set and information of coding sequences were obtained from the Laboratory of Gene Expression, Department of Molecular Genetics, National Institute of Agrobiological Sciences (Kikuchi et al., 2003; ftp://cdna01.dna.affrc.go.jp/pub/data/CURRENT).
The genomic sequences of H. sapiens, M. musculus, D. melanogaster, and A. thaliana were obtained from NCBI (ftp://ftp.ncbi.nih.gov/genomes/). The genomic sequences of C. elegans and the draft contigs of O. sativa were obtained from Sanger Center (ftp://ftp.sanger.ac.uk/pub/) and TIGR Institute (ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotaion_dbs/pseudomolecules/version_3.0/), respectively.

As the first, mapping between full-length cDNAs and genome sequences by MEGABLAST. Following that, convertion to mapping data into bit arrays, detection of splicing patterns and distribution to the types.

156,654 entries
Data detail open_in_full
Graphical abstract AT Atlas 10.18908/lsdba.nbdc01162-004

Graphical abstracts (flow chart) in the Technology Development projects of the Targeted Proteins Research Program (TPRP), which is drawn by use of Cell Illustrator. The graphics are in CSML format. One project includes one to six subject(s), and there is a CSML file for every subject.

at_atlas_csml.zip
(1.22 MB)
-

-

-

15 entries
Data detail open_in_full
Image File AT Atlas 10.18908/lsdba.nbdc01162-005

Graphical abstracts (in PNG format) for the Technology Development projects of the Targeted Proteins Research Program (TPRP). One project includes one to six subject(s), and there is a PNG file originally drawn by Cell Illustrator for every subject.

at_atlas_png.zip
(5.37 MB)
-

-

-

15 entries
Data detail open_in_full
Data name Database name DOI Description of data contents Data file Simple search URL Data acquisition method Data analysis method Number of data entries Data detail