[ Japanese | English ]
List of Data Metadata
Showing 116 to 120 of 416 entries
Show entries Advanced Search
Data name Database name DOI Description of data contents Data file Simple search URL Data acquisition method Data analysis method Number of data entries Data detail
Supplementary Data fRNAdb 10.18908/lsdba.nbdc00452-006

- xref.zip: Entry ID, Accession, Mapping Position in genome (only Homo sapiens)
- count_taxon.tsv: list of all species and the number of entries in fRNAdb

Supplementary_Data
(3.8 MB)
-

-

-

2 files
Data detail open_in_full
Amino acid sequences of predicted proteins and their annotation for 95 organism species. Gclust Server 10.18908/lsdba.nbdc00464-001

Amino acid sequences of predicted proteins and their annotation for 95 organism species. The data are given in a CSV format text file.

gclust_seq.zip
(152MB)
http://togodb.biosciencedbc.jp/togodb/view/gclust_seq#en

Protein sequences of a total of 95 organism species were obtained from NCBI, JGI and CGP.

-

698,557 entries
Data detail open_in_full
Amino acid sequences used for clusterintg (Multi FASTA format) Gclust Server 10.18908/lsdba.nbdc00464-004

Amino acid sequences of predicted proteins and their annotation for 95 organism species. FASTA format file.

all95.fa.zip
(161MB)
-

-

-

-
Data detail open_in_full
Cluster based on sequence comparison of homologous proteins of 95 organism species Gclust Server 10.18908/lsdba.nbdc00464-002

Clustering was performed by the method in which the round-robin BLAST search of the above amino acid sequence data is performed, the E-value and the overlap score (the All-against-all BLASTP search of the above amino acid sequence data, and heuristic estimation of a similarity threshold for homologs of each protein by entropy-optimized organism count method (Bioinformatics 2009 Mar 1;25(5):599-605.). The data are given in a CSV format text file.

gclust_cluster.zip
(8.72MB)
http://togodb.biosciencedbc.jp/togodb/view/gclust_cluster#en

Sequence data stated in "Amino acid sequences of predicted proteins and their annotation for 95 organism species".

All-against-all BLASTP search of the above amino acid sequence data, and heuristic estimation of a similarity threshold for homologs of each protein by entropy-optimized organism count method (Bioinformatics 2009 Mar 1;25(5):599-605.).

206,764 entries
Data detail open_in_full
Clustering results Gclust Server 10.18908/lsdba.nbdc00464-009

Results of running Gclust program. The data include such information as the requirements for running the program, the cluster ID, the threshold used for cluster grouping, the ID of the sequence belonging to the cluster and the sequence ID of the related group.

all95m8.hom.1.zip
(140MB)
-

-

-

-
Data detail open_in_full
Data name Database name DOI Description of data contents Data file Simple search URL Data acquisition method Data analysis method Number of data entries Data detail