SMART Project Protein Structure and Folding: The WSSP has teamed up with the Center for BioMolecular Modeling to provide students with the opportunity of creating a physical model of a homolog of one of the proteins that was identified in the screen of the cDNA library. Only one model can be made for each high school. The models above were created from left to right by students from Colonia, West Windsor-Plainsboro South, East Brunswick, and Pingry High Schools for the 2008-2009 WSSP Poster Forum.

High Schools wishing to create a physical model of their protein need to provide the following information by e-mail to Andrew Vershon or Sue Coletta:

1. A .doc file with the following information
  a. School name
  b. Names of students participating in the project
  c. Name of teacher who worked with the students
  d. Protein Data Bank (PDB) file name (e.g. 1A3N.pdb)
  e. Name of the protein modeled (e.g. Hemoglobin)
  f. Primary citation associated with the pdb file (e.g. Tame, J.R., Vallone, B. (2000) The structures of deoxy human haemoglobin and the mutant Hb Tyralpha42His at 120 K. Acta Crystallogr.,Sect.D 56: 805-811)
  g. Name of the clone from the cDNA library you have annotated that corresponds to the protein you have modeled and the percent sequence identity and similarity over the homologous region. (e.g. Clone 20XY23.08 has 54% identity and 65% similarity with 120 residues region of the protein from the database.
  h. A brief description of the function of the protein you have modeled. What does the protein do in the cell or organism (e.g. glycolysis, protein translation, etc.)? Does it have a specific enzymatic activity (e.g. kinase, nuclease, etc)? What other proteins may it interact with? When is it likely expressed (e.g. in all cells all the time, just a subset of cells at specific times during development, etc)?
  i. A description of the color scheme you have used - including features you have highlighted. This may be a color scheme that: indicates identical or similar residues as predicted by the sequence alignments; the active site of the protein; structural components such as alpha helices and beta-strands; etc. If the protein sequence from the database is strongly conserved with your protein, you may want to highlight the differences.
2. RasMol Script file (e.g. 1A3N.spt)
3. Protein Data Bank (PDB) file (e.g. 1A3N.pdb)
4. A screen capture image (.gif or jpeg) of your model

To get started:

1. Students who have identified a clone from the cDNA library that codes for a full or partial protein should first search if a structure of a homologous protein has been determined by x-ray crystallography of NMR. This can be done by performing a NCBI BLASTP search with the predicted protein sequence from their clone and then examining if any of the matches that were identified in the Conserved Domain Database (CDD) contain structures (see lecture notes on CDD). Alternatively the students can search the Protein Data Bank (PDB) for the structure.

2. Perform a BLAST2 sequences alignment between the predicted protein sequence from the Artemia clone and the protein sequence of the protein you will be modeling. This alignment is helpful to identify conserved residues on the structure model.

3. Download the pdb file (e.g. 1A3N.pdb) with the coordinates of the protein structure and use a version of the RasMol program that has been designed to create the physical models using Rapid Prototyping technology.

4. Students should use the commands in RasMol to create a script file that will display the protein using the format features (spacefill, backbone, etc) and the colors they prefer. A color scheme may be used that indicates identical or similar residues as predicted by the sequence alignments; the active site of the protein; structural components such as alpha helices and beta-strands; etc. If the protein sequence from the database is strongly conserved with your protein, you may want to highlight the differences.

Note: Cartoons and strands don't build well.  Wireframe and ball and stick work well (see sizes below).  Spacefill doesn't really show the protein structure well, just the overall shape.  It depends on what you are trying to show.  A typical structure may show a backbone model with few selected sidechains in ball and stick.  

Standard sizes that build well:
backbone 300
wireframe 225
spacefill 275
hbonds 225

The commands wireframe and spacefill combined give a ball and stick.

Also, it is important when doing sidechains to use the command:
select *** and (sidechain or alpha)
where *** is the amino acid number
Otherwise you get a bumpy backbone.

There's a new tutorial at: http://cbm.msoe.edu/includes/jmol/rasmol.php (only available in IE).  It covers all the important information for building a model.

RasMol Training Guide - Below are resources to help with creating a model using RasMol. The RasMol Training Guide was developed by Shannon Colton at the Center for BioMolecular Modeling. Additional information can be found on the CBM RasMol Resource Page

RasMol Training Guide Cover doc
History of Molecular Visualization doc
RasMol Training Guide Section I - Basics, opening files, display functions, coloring, selecting doc
RasMol Training Guide Section II - PDB, scripts doc
RasMol Training Guide Section III doc
Appendix for RasMol Training with a Mac doc
Acknowledgements doc
RasMol Reference Card doc
Zinc finger pdb file (Example structure used in the training guide) pdb
Download RasMol (Free at RasMol.org) html
Download RasMol from CBM Windows or Mac - Use this program for creating the models! html