PLAY PODCASTS
Accelerated enzyme engineering by machine-learning guided cell-free expression
Episode 19

Accelerated enzyme engineering by machine-learning guided cell-free expression

Science TLDR · Raymond Ruff

January 23, 202518m 41s

Audio is streamed directly from the publisher (content.rss.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

DOI: https://doi.org/10.1038/s41467-024-55399-0

Central Idea: This paper presents an ML-guided platform for accelerating enzyme engineering by combining cell-free protein synthesis, functional screening, and machine learning to rapidly optimize enzymes for multiple distinct chemical reactions. The approach is demonstrated by engineering variants of an amide synthetase (McbA) to improve synthesis of various pharmaceutical compounds.

Key Concepts:

Cell-Free Platform Integration:

- Combines cell-free DNA assembly, protein expression, and activity screening

- Enables rapid testing of enzyme variants without time-consuming cloning steps

- Complete process from DNA design to activity testing takes hours instead of weeks

Machine Learning Strategy:

- Uses single mutation data to predict beneficial higher-order mutations

- Employs augmented ridge regression models with evolutionary predictions

- Successfully identifies improved enzyme variants with reduced screening burden

- Model training requires minimal computational resources (runs on standard CPU)

Engineering Campaign Results:

- Engineered 9 specialized McbA variants for different pharmaceutical compounds

- Achieved 1.6 to 42-fold improvements in activity over wild-type enzyme

- Maintained important properties like stereo- and regioselectivity

- Generated comprehensive dataset: 2,856 enzyme variants tested across 12,584 reactions

Exemplified Applications:

Moclobemide Synthesis:

- Achieved 96% conversion (42-fold improvement)

- Demonstrated scalability to milligram quantities

- Maintained enzyme stability while improving activity

Multi-Product Engineering:

- Parallel optimization for 6 different pharmaceutical products

- Each campaign completed in approximately one week

- Successful prediction of beneficial mutations across diverse substrates

Further Research/Challenges:

Model Improvements:

- Exploring more sophisticated ML models for complex fitness landscapes

- Incorporating additional data types (kinetics, stability) into predictions

- Better understanding of mutation effects on substrate specificity

Screening Bottlenecks:

- Development of higher-throughput analytical methods

- Integration with selection-based approaches where applicable

- Handling products requiring complex detection methods

Broader Applications:

- Extending approach to other enzyme classes and reaction types

- Scaling to industrial-relevant conditions

- Integration with de novo protein design methods

This work represents a significant advance in making enzyme engineering more accessible and efficient, with potential impact on biocatalyst development for pharmaceutical and chemical manufacturing.