Assignment 1. Protein structure analysis.
1) (20 points) Select four proteins with known structures which belong to the same family and have pairwise sequence identity of no more than 70%. The length of each protein should be less than 180 residues and structure resolution better than 1.8Å.
2) (20 points) Construct a hydrophobicity profile and a secondary structure propensity profile for all four proteins. For one of the proteins compare profile values with residue positions in the protein and their secondary structures.
3) (20 points) Build a multiple sequence alignment and all possible pairwise structure alignments for these proteins. Plot the values of structure similarity against sequence similarity for all pairs. Evaluate and discuss correlation between sequence and structure similarity.
4) (20 points) Locate one fragment of high sequence similarity and one fragment of low sequence similarity in the alignment form (2). Investigate correlations between structure and sequence similarity for these fragments. Describe a possible role of the secondary structure content in these correlations.
5) (20 points) Identify functionally important site(s) in four proteins. Describe their biological role and their relationship to the structure conservation.
The report should be submitted by email as a Word or PDF file with the filename "b731_25_hw1_Your_Name.doc or .pdf". The string "b731_25_hw1_Your_Name" should be also included in the message subject line.
Due by 11:59 pm on October 23, 2025 October 26, 2025.
Late submissions are subject to penalty, 10 points per day.
============================
============================
Assignment 2. Protein modeling.
1) (20 points) Select a sequence of a prokaryotic enzyme from one of the protein sequence databases that has a homologous experimentally determined X-ray structure with at least 30% sequence identity, but does not have a homolog with known structure and more than 60% sequence identity. Explain the algorithm you used to select your sequence. If you believe that this algorithm can be improved, discuss the ways to improve it. (Recommended length of the selected sequence is 150-250 residues).
2) (20 points) Predict the three-dimensional structure for the selected sequence using at least two methods, e.g., fold recognition and homology modeling (Phyre2, I-TASSER, RaptorX, SwissModel, Modweb, etc.).
3) (20 points) Analyze the quality of your homology model using one of the structure validation or verification tools.
4) (20 points) Compare the structure of your model with the model structures of the same protein from AlphaFold Protein Structure Database and ESM Metagenomic Atlas. Describe the results of this comparison. (If models of your protein are not available in one or both databases, use the models of protein closest to yours).
5) (20 points) Visualize the results of comparison from Part (4), highlighting the differences between the structures (if any). If there are no differences, highlight any other interesting feature of the structures.
The report should be submitted by email as a Word or PDF file with the filename "b731_25_hw2_Your_Name.doc or .pdf". The string "b731_25_hw2_Your_Name" should be also included in the message subject line.
Due December 5, 2025.