First of all, please read the license of this database.
Data names and data descriptions are about the downloadable data in this page. They might not correspond to the contents of the original database.
Click the links on "Data Name" for descriptions of the data.
DBCLS exchanges original data to CSV files.
Note: You can not open those CSV files with MS Excel. Because, data or the number of characters in a cell exceeds the maximum size given in MS Excel.
# | Data name | File | Simple search and download |
---|---|---|---|
1 | README | README_e.html | - |
2 | Amino acid sequences of predicted proteins and their annotation for 95 organism species. | gclust_seq.zip (152MB) | Simple search and download |
3 | Cluster based on sequence comparison of homologous proteins of 95 organism species | gclust_cluster.zip (8.72MB) | Simple search and download |
4 | Proteins in similarity relationship with the cluster | gclust_related.zip(69MB) | - |
The following data are downloadable files in the original site.
* Note that it takes long time when you open huge text files with text editors.
# | Data name | File | Simple search and download |
---|---|---|---|
5 | Amino acid sequences used for clusterintg (Multi FASTA format) | all95.fa.zip (161MB) | - |
6 | Sequence ID and annotation information | all95.p.table.zip (7.28MB) | - |
7 | Prefix list for each organism | prefix_all95(1KB) | - |
8 | Designation of organism group | grp_def1 (1KB) | - |
9 | Parameters for Organism Grouping | pat_def1 (1KB) | - |
10 | Clustering results | all95m8.hom.1.zip (140MB) | - |
11 | Table of Cluster and Organism Species Number | all95.tbl.zip (4.53MB) | - |
FTP server is sometimes jammed. If it is, access [here]. |