1. Poplin R, Chang P-C, Alexander D, Schwartz S, Colthurst T, Ku A, et al. A universal SNP and small-indel variant caller using deep neural networks. Nature Biotechnology. 2018;36:983–7. doi:
10.1038/nbt.4235.
2. Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–9. doi:
10.1038/s41586-021-03819-2.
3. Ji Y, Zhou Z, Liu H, Davuluri RV. DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome. Bioinformatics. 2021;37:2112–20. doi:
10.1093/bioinformatics/btab083.
4. Zhou Z, Ji Y, Li W, Dutta P, Davuluri R, Liu H. DNABERT-2: Efficient foundation model and benchmark for multi-species genome. 2023. doi:
10.48550/ARXIV.2306.15006.
5. Zhou Z, Wu W, Ho H, Wang J, Shi L, Davuluri RV, et al. DNABERT-s: Pioneering species differentiation with species-aware DNA embeddings. 2024. doi:
10.48550/ARXIV.2402.08777.
6. Theodoris CV, Xiao L, Chopra A, Chaffin MD, Al Sayed ZR, Hill MC, et al. Transfer learning enables predictions in network biology. Nature. 2023;618:616–24. doi:
10.1038/s41586-023-06139-9.
7. Cui H, Wang C, Maan H, Pang K, Luo F, Duan N, et al. scGPT: Toward building a foundation model for single-cell multi-omics using generative AI. Nature Methods. 2024;21:1470–80. doi:
10.1038/s41592-024-02201-0.
8. Abramson J, Adler J, Dunger J, Evans R, Green T, Pritzel A, et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature. 2024;630:493–500. doi:
10.1038/s41586-024-07487-w.
9. Nguyen E, Poli M, Durrant MG, Kang B, Katrekar D, Li DB, et al. Sequence modeling and design from molecular to genome scale with Evo. Science. 2024;386. doi:
10.1126/science.ado9336.
10. Brixi G, Durrant MG, Ku J, Naghipourfar M, Poli M, Sun G, et al. Genome modelling and design across all domains of life with Evo 2. Nature. 2026;652:1349–61. doi:
10.1038/s41586-026-10176-5.
11. Cui H, Tejada-Lapuerta A, Brbić M, Saez-Rodriguez J, Cristea S, Goodarzi H, et al. Towards multimodal foundation models in molecular cell biology. Nature. 2025;640:623–33. doi:
10.1038/s41586-025-08710-y.
12. Wenckstern J, Jain E, Vasilev K, Pariset M, Wicki A, Gut G, et al. AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery. 2025. doi:
10.48550/ARXIV.2501.06039.
13. Bunne C, Roohani Y, Rosen Y, Gupta A, Zhang X, Roed M, et al. How to build the virtual cell with artificial intelligence: Priorities and opportunities. Cell. 2024;187:7045–63. doi:
10.1016/j.cell.2024.11.015.
14. Candido S, Hayes T, Derry A, Rao R, Lin Z, Verkuil R, et al. Language modeling materializes a world model of protein biology. 2026.
http://dx.doi.org/10.64898/2026.06.03.729735.
15. Barbitoff YA, Abasov R, Tvorogova VE, Glotov AS, Predeus AV. Systematic benchmark of state-of-the-art variant calling pipelines identifies major factors affecting accuracy of coding sequence variant discovery. BMC Genomics. 2022;23. doi:
10.1186/s12864-022-08365-3.
16. Chen N-C, Kolesnikov A, Goel S, Yun T, Chang P-C, Carroll A. Improving variant calling using population data and deep learning. BMC Bioinformatics. 2023;24. doi:
10.1186/s12859-023-05294-0.
17. Bertoline LMF, Lima AN, Krieger JE, Teixeira SK. Before and after AlphaFold2: An overview of protein structure prediction. Frontiers in Bioinformatics. 2023;3. doi:
10.3389/fbinf.2023.1120370.
18. Lin Z, Akin H, Rao R, Hie B, Zhu Z, Lu W, et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science. 2023;379:1123–30. doi:
10.1126/science.ade2574.
19. Weissenow K, Heinzinger M, Rost B. Protein language-model embeddings for fast, accurate, and alignment-free protein structure prediction. Structure. 2022;30:1169–1177.e4. doi:
10.1016/j.str.2022.05.001.
20. Weissenow K, Heinzinger M, Steinegger M, Rost B. Ultra-fast protein structure prediction to capture effects of sequence variation in mutation movies. 2022.
http://dx.doi.org/10.1101/2022.11.14.516473.
21. Callaway E, Naddaf M. Move over, AlphaFold: open-source model predicts shape of 1 billion proteins. Nature. 2026;654:13–4. doi:
10.1038/d41586-026-01686-3.
22. Zhang H, Zhang Y, Kang Z, Xiong J, Yang R, Ning K. MGM as a Large
-Scale Pretrained Foundation Model for Microbiome Analyses in Diverse Contexts. Advanced Science. 2026;13. doi:
10.1002/advs.202513333.
23. Medearis NA, Zhu S, Zomorrodi AR. BiomeGPT: A foundation model for the human gut microbiome. 2026.
http://dx.doi.org/10.64898/2026.01.05.697599.
24. Treloar NJ, Ur-Rehman S, Yang J. Learning the language of the microbiome with transformers. 2026.
http://dx.doi.org/10.64898/2026.05.02.722381.