MASIA Project

MASIA (Multiple Aligned Sequences Investigation and Analysis) is a program with GUI to search for consistent patterns in multiple aligned sequences. Predictions of secondary structures and inside/outside properties of residues at each position in an aligned sequences are based on generalized rules for globular proteins, which are derived from observations of known 3D-structures of proteins. The secondary and tertiary structure of a protein is related to chemical characteristics of the individual amino acid residues, but a clear picture of the secondary structure may not be apparent for one protein sequence alone. Comparing many aligned, related sequences can reveal patterns of sequence conservation that indicate the location of residues essential for the function, folding or solubility of the protein.

The rules are manipulated as corresponding combinations of commands to predict specific properties. Users can easily extend or create new rules for their specific purposes. Before using MASIA, the best possible alignment of the protein with other proteins in the SWISS-PROT data base should be obtained. MASIA uses as input files generated with PileUP or ClustalW. A recent addition to MASIA is a physical-chemical property (PCP) based motif detection procedure. This procedure helps to detect subtle motifs that are conserved in physical-chemical properties.

Acknowledge

This project is supported by the Department of Energy (Grant DE-FG03-96ER62267 & DE-F603-00ER63041) and the Sealy and Smith Foundation.

Publications