Homework 2009 (due in one week - January 29th)

1) Download the protein PDB file: 2B4C.

2) write a interactive I/O perl script that:
   Extracts (in any order a-d): 
   a) the TITLE lines for this protein,
   b) the RESOLUTION of the structure
   c) the date this structure was deposited (look on the HEADER line),
   d) and for all CHAINS: the following info:
       1) The CHAIN letter,
       2) Residue name and the sequence number
       3) The number of atoms for each residue
   Prints out all above information either to a file or to the screen.

Note that you can "capture" a screen print with:
"perl your_script.pl > output.file"



The output should look something like:

TITLE    CRYSTAL STRUCTURE OF HIV-1 JR-FL GP120 CORE PROTEIN                  
TITLE    2 CONTAINING THE THIRD VARIABLE REGION (V3) COMPLEXED WITH            
TITLE    3 CD4 AND THE X5 ANTIBODY

RESOLUTION. 3.30 ANGSTROMS

DEPOSIT DATE 23 SEP 05

G       VAL-84  7
G       VAL-85  7
G       LEU-86  8
G       GLU-87  9
G       ASN-88  8
G       VAL-89  7
G       THR-90  7
G       GLU-91  9
G       HIS-92  10
G       PHE-93  11
G       ASN-94  8
G       MET-95  8
G       TRP-96  14
G       LYS-97  9
G       ASN-98  8
G       ASP-99  8
G       MET-100 8
G       VAL-101 7
G       GLU-102 9
G       GLN-103 9
G       MET-104 8
G       GLN-105 9
G       GLU-106 9
.
.
C       LYS-1   9
C       LYS-2   9
C       VAL-3   7
.
.


Please take a look at 2005's homework, key, PDB file and output. It might be helpful to your homework.