The GPM is an experimental project to create knowledge from proteomics data & reuse it solve biomedical research problems.
Analyze the structure of the database you downloaded, as well as the data in it, and write down the different table names and their structure. This will be useful in the following questions.
The “expect” value of a protein is the base-10 log of the expectation that any particular peptide assignment was made at random (E-value).
Find out from which institution the data for this project comes.
In what table and what field can you get the sequences of the proteins?
Make a copy of the database, and perform the following operations on it:
Modify the institution name that you found earlier to say “McGill University’s School of Computer Science”