| Data name ⇅ | Database name ⇅ | DOI ⇅ | Description of data contents ⇅ | Data file ⇅ | Simple search URL ⇅ | Data acquisition method ⇅ | Data analysis method ⇅ | Number of data entries ⇅ | Data detail |
|---|---|---|---|---|---|---|---|---|---|
| Supplementary Data | fRNAdb | 10.18908/lsdba.nbdc00452-006 |
- xref.zip: Entry ID, Accession, Mapping Position in genome (only Homo sapiens)
|
Supplementary_Data
(3.8 MB) |
- |
- |
- |
2 files |
Data detail
open_in_full
|
| Amino acid sequences of predicted proteins and their annotation for 95 organism species. | Gclust Server | 10.18908/lsdba.nbdc00464-001 |
Amino acid sequences of predicted proteins and their annotation for 95 organism species. The data are given in a CSV format text file. |
gclust_seq.zip
(152MB) |
http://togodb.biosciencedbc.jp/togodb/view/gclust_seq#en |
Protein sequences of a total of 95 organism species were obtained from NCBI, JGI and CGP. |
- |
698,557 entries |
Data detail
open_in_full
|
| Amino acid sequences used for clusterintg (Multi FASTA format) | Gclust Server | 10.18908/lsdba.nbdc00464-004 |
Amino acid sequences of predicted proteins and their annotation for 95 organism species. FASTA format file. |
all95.fa.zip
(161MB) |
- |
- |
- |
- |
Data detail
open_in_full
|
| Cluster based on sequence comparison of homologous proteins of 95 organism species | Gclust Server | 10.18908/lsdba.nbdc00464-002 |
Clustering was performed by the method in which the round-robin BLAST search of the above amino acid sequence data is performed, the E-value and the overlap score (the All-against-all BLASTP search of the above amino acid sequence data, and heuristic estimation of a similarity threshold for homologs of each protein by entropy-optimized organism count method (Bioinformatics 2009 Mar 1;25(5):599-605.). The data are given in a CSV format text file. |
gclust_cluster.zip
(8.72MB) |
http://togodb.biosciencedbc.jp/togodb/view/gclust_cluster#en |
Sequence data stated in "Amino acid sequences of predicted proteins and their annotation for 95 organism species". |
All-against-all BLASTP search of the above amino acid sequence data, and heuristic estimation of a similarity threshold for homologs of each protein by entropy-optimized organism count method (Bioinformatics 2009 Mar 1;25(5):599-605.). |
206,764 entries |
Data detail
open_in_full
|
| Clustering results | Gclust Server | 10.18908/lsdba.nbdc00464-009 |
Results of running Gclust program. The data include such information as the requirements for running the program, the cluster ID, the threshold used for cluster grouping, the ID of the sequence belonging to the cluster and the sequence ID of the related group. |
all95m8.hom.1.zip
(140MB) |
- |
- |
- |
- |
Data detail
open_in_full
|
| Data name | Database name | DOI | Description of data contents | Data file | Simple search URL | Data acquisition method | Data analysis method | Number of data entries | Data detail |
List of Data Metadata