Frequently Asked Questions
How can I access to the data?
Our website is located at our GitHub repository and all the data is freely available.
How did conservation scores calculated?
We calculate conservation percentage by removing gaps and applying a similarity measure. For each position, we first calculate the gap percentage. If the gaps are the most frequent, we assinging 0% conservation to the corresponding position. If not we count the most frequent amino acid and add other amino acid(s) that give a BLOSUM80 score greater than 1. The count of similar amino acids are divided by the number of non-gap sequences.
What is the header format for protein sequences?
sp: Swiss-Prot / tr: TrEMBL|Uniprot Protein ID|Gene Name _ Species Name|Tax ID