Foundation models in biological and chemical domains

A recent review, Scientific Large Language Models: A Survey on Biological & Chemical Domains, comprehensively surveyed the advancements in large foundation models in biological and chemical domains, including models for text, small molecules, proteins, and genomic sequences. The review outlines the models, datasets, and evaluation methods for each of these data types. The 75-page review provides a comprehensive and valuable resource for researchers navigating the landscape.

With hundreds of large biological and chemical models being developed, it seems the field has achieved a lot. However, as the review pointed out, this area is still in its nascent stage. The authors highlight a few challenges, such as the lack of large-scale and high-quality training data, integration of domain-specific information into the model architecture, and reliable computational and experimental evaluation. The lack of high-quality data is probably the most critical issue. After all, your results are only as good as your data.

By the way, the review unfortunately missed one important data type: RNA sequences (or transcriptomic data). Many RNA foundation models can be found in this RNA-FM GitHub repository. These models could be very useful for understanding gene functions, identifying drug targets, predicting RNA structures and RNA-protein interactions, and designing RNA-based therapeutics.

Featured papers

  • A review on de novo protein design

  • Schrödinger’s approach that integrates both AI-generated predictions and physics-based methods for drug design

  • A method, called MIDAS, for integrating single-cell multi-omics datasets from multiple resources

  • A machine learning model, ALBATROSS, for predicting 3D properties of disordered proteins

If you find the newsletter helpful, please consider:

  • 🔊 Sharing the newsletter with other people

  • 👍 Upvoting on Product Hunt

  • 📧 Sending any feedback, suggestions, and questions by directly replying to this email or writing reviews on Product Hunt

  • 🙏 Supporting us with a cup of coffee.

Thanks, and see you next time!

Join the conversation

or to participate.