Evolution guided atomistic design - a new paradigm for design of function

Our mission is to enable reliable and completely computational design of efficient, selective, and stable protein binders and enzymes. To achieve this goal, we are developing a unique strategy called evolution-guided atomistic design that uses information encoded in the evolutionary history of protein families to infer what structure and sequence features are likely to be tolerated in any given protein. We then use these rules to guide Rosetta atomistic design calculations in the search for new proteins with desired functions. To test our algorithms, we design new proteins that don't exist in nature and carry out wet-lab experiments either in-house or with our collaborators. Feedback from these experiments enables the development of more sophisticated design algorithms. We therefore combine cutting-edge computational methods development with high-throughput experimental screening and stringent biochemical and structural analysis of designed proteins.

Applied protein optimisation

Natural and laboratory selection can optimise protein activities. But evolution is an iterative process in which every change in a protein (mutation) must result in a variant that is at least as functional as its predecessor or it would be purged by the powerful forces of selection, and lab evolution experiments may take years of tedious trial and error. We developed a suite of computational design algorithms to address the major problems of protein optimisation, including improving stability and protein expressibility, binding affinity and catalytic efficiency, and specificity. Our methods have had a central role in optimising very challenging enzymes and binding proteins for therapeutics, vaccines, and green chemistry applications. 

Designing new protein structures

Control over protein activity demands control over the protein backbone structure, but the backbone has numerous degrees of freedom and design of new backbones in protein active sites has been a notoriously difficult problem. By combining information from naturally occurring structures and sequences with atomistic design calculations, we developed a new approach for backbone design in active sites. Initially, we implemented this strategy to design new antibodies, and therefore called the method AbDesign. Encouraged by the method's success in designing atomically accurate new antibodies with over 50 mutations from any naturally occurring antibody, we next applied this method to the design of high-efficiency new enzymes and a large network of interacting pairs of proteins that exhibited ultrahigh specificity binding. Thus, evolution-guided atomistic design provides exquisite control over protein structure, stability, and activity.

Combinatorial assembly and design: a new paradigm for designing vast functional diversity

The holy grail of protein design methodology is to enable the complete computational design of any arbitrarily chosen biomolecular activity. To enable such template-free design of function, we still need to learn a lot more about how function is encoded in proteins. We are therefore developing methods to design, not a handful of binders or enzymes as in all current protein design methods, but vast repertoires comprising millions of substantially different variants. We use high-throughput screening methods to isolate the functional designs and deep sequencing analysis to fully characterise these designs. Next, advanced machine-learning methods are trained to find molecular features that discriminate the best designs from the rest, and these features are then used to improve the design algorithms, leading to a continuous, unbiased and systematic approach to learn the rules for designing new biomolecular activities. We are applying this strategy to the design of new hydrolytic enzymes, single-domain camelid antibodies, fluorescent proteins, and therapeutic antibodies. Already, our methods have been able to design thousands of diverse and active enzymes and fluorescent proteins. These methods will enable rapid and effective discovery and optimisation of biomolecular activities in protein engineering, synthetic biology, and biotherapeutic design.