ResearchChannel - Automatic Identification and Classification of Protein Domains
  Programs A to Z Premieres Webcast Schedule Where to Watch Contact Us Help
      Learn How to Watch ResearchChannel  
Programming Home >

Automatic Identification and Classification of Protein Domains

Multimedia Presentation Launch Presentation
 
Share this video —
 
Produced by:
Microsoft Research

04/18/2005

Description: 
Among their many other roles, proteins are the scaffolds, workhorses, and computational devices of all organisms. For many pratical purposes, a protein is a string of charaters in a 20 letter alphabet. These charaters represent amino acids. A proten's 3D structure and biological function depend on its sequence of amino acids.

Proteins are typically composed of several functional subunits, called domains. These subunits have relatively autonomous function. Domains are shuffled through evolution in a mix and match process in which new proteins are created as combinations of existing domains.

Recent technologies advances and genome sequencing projects, have given us a very large number of protein sequences to analyze. However, our knowledge about higher properties of proteins, such as their shape and function is scarce, since it is much harder to experimentally derive such information.

We are still far from being able to deduce a protein's structure or function from its sequence. We approach the problem of deducing structure/function from sequence using homology modeling. The basic idea is to infer a protein's higher properties from those of other proteins which have similar sequences. Because of the mix and match process mention earlier, we believe it is best to employ the homology modeling scheme on the domains of a protein. However, even the parsing of a protein sequence to its domains is still an unsolved problem.

I will present a process we have developed for the identification and classification of protein domains in a comprehensive database of protein sequences. Our process combines methodologies of sequence similarity identification, graph based clustering, machine learning, statistical modeling and iterative refinement. We achieve state of the art results, recovering 63\% of the known domain families and suggesting new families with about 40\% fidelity.

This is joint work with Michal Linial and Nati Linial.

Speaker(s):
Elon Portugaly, Ph.D. student, Hebrew University of Jerusalem

Runtime:01:20:57

Rating:TV-G


Explore our more than 3,500 titles available online —
Arts and Humanities | Business and Economics | Computer Science and Engineering
Health and Medicine | K-12 and Education | Sciences | Social Sciences
-or-
Browse by Program Title | Browse by Series Title | Browse by University/Institution
 
Fibromyalgia An Update on Fibromyalgia

Milton Masciadri Inside Stories: Milton Masciadri

Dr. Paul Farmer Building a Community-based Health Care Movement

Sign up now for our monthly newsletter,
Think Forward
!
Name:   
Email:   

 

Home | About ResearchChannel | Retransmission | Terms of Use | Privacy Policy | Contact Us

Copyright © 2010 ResearchChannel. All Rights Reserved.