AI trained on bacterial genomes produces never-before-seen proteins
AI systems have recently had a lot of success in one key aspect of biology: the relationship between a proteinβs structure and its function. These efforts have included the ability to predict the structure of most proteins and to design proteins structured so that they perform useful functions. But all of these efforts are focused on the proteins and amino acids that build them.
But biology doesnβt generate new proteins at that level. Instead, changes have to take place in nucleic acids before eventually making their presence felt via proteins. And information at the DNA level is fairly removed from proteins, with lots of critical non-coding sequences, redundancy, and a fair degree of flexibility. Itβs not necessarily obvious that learning the organization of a genome would help an AI system figure out how to make functional proteins.
But it now seems like using bacterial genomes for the training can help develop a system that can predict proteins, some of which donβt look like anything weβve ever seen before.


Β© CHRISTOPH BURGSTEDT/SCIENCE PHOTO LIBRARY