Learning to Engineer Protein Flexibility

Authors

Kouba, P., Planas-Iglesias, J., Damborsky, J., Sedlar, J., Mazurenko, S., Sivic, J.

Source

ICLR: 1-23 (2025)

Abstract

Generative machine learning models are increasingly being used to design novel proteins. However, their major limitation is the inability to account for protein flexibility, a property crucial for protein function. Learning to engineer flexibility is difficult because the relevant data is scarce, heterogeneous, and costly to obtain using computational and experimental methods. Our contributions are three-fold. First, we perform a comprehensive comparison of methods for evaluating protein flexibility and identify relevant data for learning. Second, we overcome the data scarcity issue by leveraging a pre-trained protein language model. We design and train flexibility predictors utilizing either only sequential or both sequential and structural information on the input. Third, we introduce a method for fine-tuning a protein inverse folding model to make it steerable toward desired flexibility at specified regions. We demonstrate that our method Flexpert enables guidance of inverse folding models toward increased flexibility. This opens up a transformative possibility of engineering protein flexibility.

Full text

Citation

Kouba, P., Planas-Iglesias, J., Damborsky, J., Sedlar, J., Mazurenko, S., Sivic, J., 2025: Learning to Engineer Protein Flexibility. ICLR: 1-23.

Amalie Receives Awards at the Conference of Young Life Scientists
Success at ISBC & ISLS 2026 in Thailand
Press Release: New Algorithm Helps Scientists Reduce Protein Aggregation
Honza Successfully Defends His PhD Thesis
Interview with Naina
David Received MU Rector’s Award
Hana Received Student Award at Interdisciplinary Meeting