Crick, F. Central dogma of molecular biology. Nature 227, 561–563 (1970).
Google Scholar
Vaswani, A. et al. Consideration is all you want. Adv. Neural Inf. Course of. Syst. 30 (2017).
Naveed, H. et al. A complete overview of huge language fashions. ACM Trans. Intell. Syst. Technol. 16, 106 (2025).
Google Scholar
Scarselli, F., Gori, M., Tsoi, A. C., Hagenbuchner, M. & Monfardini, G. The graph neural community mannequin. IEEE Trans. Neural Netw. 20, 61–80 (2008).
Google Scholar
Ho, J., Jain, A. & Abbeel, P. Denoising diffusion probabilistic fashions. Adv. Neural Inf. Course of. Syst. 33, 6840–6851 (2020).
Wang, H. et al. Scientific discovery within the age of synthetic intelligence. Nature 620, 47–60 (2023).
Google Scholar
Jumper, J. et al. Extremely correct protein construction prediction with AlphaFold. Nature 596, 583–589 (2021).
Google Scholar
Abramson, J. et al. Correct construction prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500 (2024).
Google Scholar
Jiang, Okay. et al. Fast in silico directed evolution by a protein language mannequin with EVOLVEpro. Science 387, eadr6006 (2025).
Google Scholar
Brixi, G., Durrant, M.G., Ku, J. et al. Genome modelling and design throughout all domains of life with Evo 2. Nature (2026).
Nguyen, E. et al. Sequence modeling and design from molecular to genome scale with Evo. Science 386, eado9336 (2024).
Google Scholar
Zou, S. et al. A big-scale basis mannequin for RNA perform and construction prediction. Preprint at bioRxiv (2024).
Dalla-Torre, H. et al. Nucleotide Transformer: constructing and evaluating sturdy basis fashions for human genomics. Nat. Strategies 22, 287–297 (2025).
Google Scholar
Zhang, Z., Shen, W. X., Liu, Q. & Zitnik, M. Environment friendly technology of protein pockets with PocketGen. Nat. Mach. Intell. 6, 1382–1395 (2024).
Google Scholar
Ying, Okay. et al. MethylGPT: a basis mannequin for the DNA methylome. Preprint at bioRxiv (2024).
Bunne, C. et al. Easy methods to construct the digital cell with synthetic intelligence: priorities and alternatives. Cell 187, 7045–7063 (2024).
Google Scholar
Track, L., Segal, E. & Xing, E. Towards AI-driven digital organism: multiscale basis fashions for predicting, simulating and programming biology in any respect ranges. Prerpint at arXiv (2024).
He, Y. et al. Generalized organic basis mannequin with unified nucleic acid and protein language. Nat. Mach. Intell. 7, 942–953 (2025).
Google Scholar
Financial institution, P. D. Protein Knowledge Financial institution. Nat. New Biol. 233, 223 (1971).
Google Scholar
Topol, E. J. Studying the language of life with AI. Science 387, eadv4414 (2025).
Google Scholar
Service provider, A. T. et al. Semantic design of practical de novo genes from a genomic language mannequin. Nature 649, 749–758 (2026).
Google Scholar
King, S. H. et al. Generative design of novel bacteriophages with genome language fashions. Preprint at bioRxiv (2025).
Zhou, Z. et al. DNABERT-S: Pioneering species differentiation with species-aware DNA embeddings. Bioinformatics 41, i255–i264 (2025).
Google Scholar
Ellington, C. N. et al. Correct and normal DNA representations emerge from genome basis fashions at scale. Preprint at bioRxiv (2024).
Zhao, Q., Zhang, C. & Zhang, W. dnaGrinder: a light-weight and high-capacity genomic basis mannequin. Preprint at arXiv (2024).
Benegas, G., Albors, C., Aw, A. J., Ye, C. & Track, Y. S. A DNA language mannequin based mostly on multispecies alignment predicts the results of genome-wide variants. Nat. Biotechnol. 43, 1960–1965 (2025).
Google Scholar
Saberi, A. et al. A protracted-context RNA basis mannequin for predicting transcriptome structure. Preprint at bioRxiv (2024).
Tahmid, M. T. et al. BiRNA-BERT permits environment friendly RNA language modeling with adaptive tokenization. Commun. Biol. 8, 1621 (2025).
Google Scholar
Yu, H. et al. An interpretable RNA basis mannequin for exploring practical RNA motifs in vegetation. Nat. Mach. Intell. 6, 1616–1625 (2024).
Google Scholar
Yang, H. & Li, Okay. M. MP-RNA: unleashing multi-species RNA basis mannequin by way of calibrated secondary construction prediction. In Findings of the Affiliation for Computational Linguistics: EMNLP 2024 (eds Al-Onaizan, Y. et al.) 5278–5296 (Affiliation for Computational Linguistics, 2024).
Chen, J. et al. Interpretable RNA basis mannequin from unannotated information for extremely correct RNA construction and performance predictions. Preprint at arXiv (2022).
Zhang, Z. et al. RNAGenesis: basis mannequin for enhanced RNA sequence technology and structural insights. Preprint at bioRxiv (2024).
De Lima Camillo, L. P. et al. CpGPT: a basis mannequin for DNA methylation. Preprint at bioRxiv (2024).
Zhou, H. et al. A basis language mannequin to decipher various regulation of RNAs. Genome Biol. 26, 301 (2025).
Google Scholar
Linder, J., Srivastava, D., Yuan, H., Agarwal, V. & Kelley, D. R. Predicting RNA-seq protection from DNA sequence as a unifying mannequin of gene regulation. Nat. Genet. 57, 949–961 (2025).
Google Scholar
Fu, X. et al. A basis mannequin of transcription throughout human cell sorts. Nature 637, 965–973 (2025).
Google Scholar
Nijkamp, E., Ruffolo, J. A., Weinstein, E. N., Naik, N. & Madani, A. ProGen2: exploring the boundaries of protein language fashions. Cell Syst. 14, 968–978 (2023).
Google Scholar
Madani, A. et al. Massive language fashions generate practical protein sequences throughout various households. Nat. Biotechnol. 41, 1099–1106 (2023).
Google Scholar
Peng, F. Z. et al. PTM-Mamba: a PTM-aware protein language mannequin with bidirectional gated Mamba blocks. Nat. Strategies 22, 945–949 (2025).
Google Scholar
Zhang, Y., Bian, B. & Okumura, M. Hyena structure permits quick and environment friendly protein language modeling. IMetaOmics 2, e45 (2025).
Google Scholar
Zhuo, L. et al. ProtLLM: an interleaved protein-language LLM with protein-as-word pre-training. In Proc. 62nd Annual Assembly of the Affiliation for Computational Linguistics Vol. 1 (eds Ku, L.-W. et al.) 8950–8963 (Affiliation for Computational Linguistics, 2024).
Xu, M., Yuan, X., Miret, S. & Tang, J. ProtST: multi-modality studying of protein sequences and biomedical texts. In Proc. fortieth Int. Conf. Machine Studying (eds Krause, A. et al.) 38749–38767 (PMLR, 2023).
Queen, O. et al. ProCyon: a multimodal basis mannequin for protein phenotypes. Preprint at bioRxiv (2025).
Consortium UniProt. UniProt: the common protein knowledgebase in 2023. Nucleic Acids Res. 51, D523–D531 (2023).
Google Scholar
Xiong, P., Xu, H. & Zheng, H. Supervised contrastive studying results in extra cheap spectral embeddings. Anal. Chem. 97, 20137–20146 (2025).
Google Scholar
Zhang, H. et al. MSBERT: embedding tandem mass spectra into chemically rational house by masks studying and contrastive studying. Anal. Chem. 96, 16599–16608 (2024).
Google Scholar
Huber, F. et al. Spec2Vec: improved mass spectral similarity scoring by way of studying of structural relationships. PLoS Comput. Biol. 17, e1008724 (2021).
Google Scholar
Bushuiev, R. et al. Self-supervised studying of molecular representations from hundreds of thousands of tandem mass spectra utilizing DreaMS. Nat. Biotechnol. (2025).
Cui, H. et al. scGPT: towards constructing a basis mannequin for single-cell multi-omics utilizing generative AI. Nat. Strategies 21, 1470–1480 (2024).
Google Scholar
Hao, M. et al. Massive-scale basis mannequin on single-cell transcriptomics. Nat. Strategies 21, 1481–1491 (2024).
Google Scholar
Theodoris, C. V. et al. Switch studying permits predictions in community biology. Nature 618, 616–624 (2023).
Google Scholar
Kalfon, J., Samaran, J., Peyré, G. & Cantini, L. scPRINT: pre-training on 50 million cells permits sturdy gene community predictions. Nat. Commun. 16, 3607 (2025).
Google Scholar
Rizvi, S. A. et al. Scaling massive language fashions for next-generation single-cell evaluation. Preprint at bioRxiv (2025).
Wen, H. et al. Single cells are spatial tokens: transformers for spatial transcriptomic information imputation. Preprint at arXiv (2023).
Hao, M. et al. GeST: in direction of constructing a generative pretrained transformer for studying mobile spatial context. Proc. twentieth Machine Studying in Computational Biology Assembly 311, 1–11 (PMLR, 2025).
Tejada-Lapuerta, A. et al. Nicheformer: a basis mannequin for single-cell and spatial omics. Nat. Strategies 22, 2525–2538 (2025).
Google Scholar
Wang, C. X. et al. scGPT-spatial: continuous pretraining of single-cell basis mannequin for spatial transcriptomics. Preprint at bioRxiv (2025).
Yang, X. et al. GeneCompass: deciphering common gene regulatory mechanisms with a knowledge-informed cross-species basis mannequin. Cell Res. 34, 830–845 (2024).
Google Scholar
Hu, L. et al. RegFormer: a single-cell basis mannequin powered by gene regulatory hierarchies. Preprint at bioRxiv (2025).
Yang, Z. et al. Multiomic basis mannequin predicts epigenetic regulation by zero-shot. Preprint at bioRxiv (2024).
Lin, Z. et al. Evolutionary-scale prediction of atomic-level protein construction with a language mannequin. Science 379, 1123–1130 (2023).
Google Scholar
Hayes, T. et al. Simulating 500 million years of evolution with a language mannequin. Science 387, 850–858 (2025).
Google Scholar
Rosen, Y. et al. Common cell embeddings: a basis mannequin for cell biology. Preprint at bioRxiv (2023).
Baek, M. et al. Correct prediction of protein constructions and interactions utilizing a three-track neural community. Science 373, 871–876 (2021).
Google Scholar
Wang, W. et al. trRosettaRNA: automated prediction of RNA 3D construction with transformer community. Nat. Commun. 14, 7266 (2023).
Google Scholar
Shen, T. et al. Correct RNA 3D construction prediction utilizing a language model-based deep studying strategy. Nat. Strategies 21, 2287–2298 (2024).
Pearce, R., Omenn, G. S. & Zhang, Y. De novo RNA tertiary construction prediction at atomic decision utilizing geometric potentials from deep studying. Preprint at bioRxiv (2022).
Krishna, R. et al. Generalized biomolecular modeling and design with RoseTTAFold All-Atom. Science 384, eadl2528 (2024).
Wu, Okay. E. et al. Protein construction technology by way of folding diffusion. Nat. Commun. 15, 1059 (2024).
Google Scholar
Jing, B. et al. Eigenfold: generative protein construction prediction with diffusion fashions. In Workshop on Machine Studying for Drug Discovery (MLDD) on the eleventh Worldwide Convention on Studying Representations (ICLR) (eds. Notin, P. et al.) (2023).
Fu, C. et al. A latent diffusion mannequin for protein construction technology. In Proc. Second Studying on Graphs Convention (eds Villar, S. & Chamberlain, B.) 29:1–29:17 (PMLR, 2024).
Anand, N. & Achim, T. Protein construction and sequence technology with equivariant denoising diffusion probabilistic fashions. In Rao, R., Adler, J., Anand, N., Ingraham, J., Ovchinnikov, S. & Zhong, E. (eds) Workshop on Machine Studying in Structural Biology on the thirty sixth Convention on Neural Info Processing Methods (NeurIPS) (2022).
Wang, Z. et al. RNADiffFold: generative RNA secondary construction prediction utilizing discrete diffusion fashions. Transient. Bioinform. 26, bbae618 (2025).
Google Scholar
Fang, A., Zhang, Z., Zhou, A. & Zitnik, M. ATOMICA: studying common representations of intermolecular interactions. Preprint at bioRxiv (2025).
Watson, J. L. et al. De novo design of protein construction and performance with RFdiffusion. Nature 620, 1089–1100 (2023).
Google Scholar
Butcher, J. et al. De novo design of All-Atom biomolecular interactions with RFdiffusion3. Preprint at bioRxiv (2025).
Ahern, W. et al. Atom-level enzyme energetic web site scaffolding utilizing RFdiffusion2. Nat. Strategies 23, 96–105 (2026).
Lisanza, S. L. et al. Multistate and practical protein design utilizing RoseTTAFold sequence house diffusion. Nat. Biotechnol. 43, 1288–1298 (2025).
Dauparas, J. et al. Strong deep learning-based protein sequence design utilizing ProteinMPNN. Science 378, 49–56 (2022).
Google Scholar
Gruver, N. et al. Protein design with guided discrete diffusion. Adv. Neural Inf. Course of. Syst. 36, 12489–12517 (2023).
Ni, B., Kaplan, D. L. & Buehler, M. J. Generative design of de novo proteins based mostly on secondary-structure constraints utilizing an attention-based diffusion mannequin. Chem 9, 1828–1849 (2023).
Google Scholar
Liu, Y. et al. De novo protein design with a denoising diffusion community impartial of pretrained construction prediction fashions. Nat. Strategies 21, 2107–2116 (2024).
Google Scholar
Sarkar, A., Tang, Z., Zhao, C. & Koo, P. Okay. Designing DNA with tunable regulatory exercise utilizing discrete diffusion. In Workshop on AI for New Drug Modalities on the thirty seventh Convention on Neural Info Processing Methods (NeurIPS) (eds Uehara, M. et al.) (2024).
Hou, D. et al. A hyperbolic discrete diffusion 3D RNA inverse folding mannequin for practical RNA design. J. Chem. Inf. Mannequin. 65, 6568–6584 (2025).
Zhao, Y., Oono, Okay., Takizawa, H. & Kotera, M. GenerRNA: a generative pre-trained language mannequin for de novo RNA design. PLoS ONE 19, e0310814 (2024).
Google Scholar
Huang, B. et al. A backbone-centred power perform of neural networks for protein design. Nature 602, 523–528 (2022).
Google Scholar
Wallach, H. et al. (eds). Generative fashions for graph-based protein design. Proceedings of the thirty third Convention on Neural Info Processing Methods Vol. 32 (Curran Associates, 2019).
Strokach, A., Becerra, D., Corbi-Verge, C., Perez-Riba, A. & Kim, P. M. Quick and versatile protein design utilizing deep graph neural networks. Cell Syst. 11, 402–411 (2020).
Google Scholar
Zhang, X., Yin, H., Ling, F., Zhan, J. & Zhou, Y. SPIN-CGNN: improved fastened spine protein design with contact map-based graph development and get in touch with graph neural community. PLoS Comput. Biol. 19, e1011330 (2023).
Google Scholar
Stringer, C., Wang, T., Michaelos, M. & Pachitariu, M. CellPose: a generalist algorithm for mobile segmentation. Nat. Strategies 18, 100–106 (2021).
Google Scholar
Archit, A. et al. Phase something for microscopy. Nat. Strategies 22, 579–591 (2025).
Google Scholar
He, L., Shi, R., Wang, W., Cai, Y. & Ma, L. Unifying the electron microscopy multiverse by way of a large-scale basis mannequin. Preprint at bioRxiv (2025).
Pachitariu, M. & Stringer, C. CellPose 2.0: easy methods to practice your individual mannequin. Nat. Strategies 19, 1634–1641 (2022).
Google Scholar
Kirillov, A. et al. Phase something. Proc. IEEE/CVF Int. Conf. Laptop Imaginative and prescient 4015–4026 (IEEE, 2023).
Gupta, A. et al. SubCell: imaginative and prescient basis fashions for microscopy seize single-cell biology. Preprint at bioRxiv (2024).
Ma, C., Tan, W., He, R. & Yan, B. Pretraining a basis mannequin for generalizable fluorescence microscopy-based picture restoration. Nat. Strategies 21, 1558–1567 (2024).
Google Scholar
Bilodeau, A. et al. A self-supervised basis mannequin for sturdy and generalizable illustration studying in STED microscopy. Preprint at bioRxiv (2025).
Bilal, M. et al. Basis fashions in computational pathology: a evaluate of challenges, alternatives, and impression. Preprint at arXiv (2025).
Vorontsov, E. et al. A basis mannequin for clinical-grade computational pathology and uncommon cancers detection. Nat. Med. 30, 2924–2935 (2024).
Google Scholar
Zimmermann, E. et al. Virchow2: scaling self-supervised blended magnification fashions in pathology. Preprint at arXiv (2024).
Chen, R. J. et al. In the direction of a general-purpose basis mannequin for computational pathology. Nat. Med. 30, 850–862 (2024).
Google Scholar
Nicke, T. et al. Tissue Ideas v2: a supervised basis mannequin for complete slide photos. Preprint at arXiv (2025).
Huang, Z., Bianchi, F., Yuksekgonul, M., Montine, T. J. & Zou, J. A visible-language basis mannequin for pathology picture evaluation utilizing medical twitter. Nat. Med. 29, 2307–2316 (2023).
Google Scholar
Lu, M. Y. et al. A multimodal generative AI copilot for human pathology. Nature 634, 466–473 (2024).
Google Scholar
Chen, Y. et al. Slidechat: a big vision-language assistant for whole-slide pathology picture understanding. Proc. Laptop Imaginative and prescient and Sample Recognition Conf. 5134–5143 (2025).
Vaidya, A. et al. Molecular-driven basis mannequin for oncologic pathology. Preprint at arXiv (2025).
Xu, Y. et al. A multimodal knowledge-enhanced whole-slide pathology basis mannequin. Nat Commun 16, 11406 (2025).
Google Scholar
Senan, S. et al. DNA-Diffusion: leveraging generative fashions for controlling chromatin accessibility and gene expression by way of artificial regulatory components. In Workshop on Machine Studying for Genomics Explorations (MLGenX) on the twelfth Worldwide Convention on Studying Representations (ICLR) (eds Hajiramezanali, E. et al.) (2024).
Bai, Y., Zhong, H., Wang, T. & Lu, Z. J. OligoFormer: an correct and sturdy prediction methodology for siRNA design. Bioinformatics 40, btae577 (2024).
Google Scholar
Wong, F. et al. Deep generative design of RNA aptamers utilizing structural predictions. Nat. Comput. Sci. 4, 829–839 (2024).
Google Scholar
Xiong, D. et al. A structurally knowledgeable human protein–protein interactome reveals proteome-wide perturbations brought on by illness mutations. Nat. Biotechnol. 43, 1510–1524 (2025).
Google Scholar
Réau, M., Renaud, N., Xue, L. C. & Bonvin, A. M. J. J. DeepRank-GNN: a graph neural community framework to study patterns in protein–protein interfaces. Bioinformatics 39, btac759 (2023).
Google Scholar
Baek, M. et al. Correct prediction of protein–nucleic acid complexes utilizing RoseTTAFoldNA. Nat. Strategies 21, 117–121 (2024).
Google Scholar
Rajwade, D. et al. Understanding protein–DNA interactions by taking note of protein and genomics basis fashions. In Proc. NeurIPS 2024 Workshop on Basis Fashions for Science: Progress, Alternatives, and Challenges (eds Chen, W. et al.) (2024).
Gainza, P. et al. De novo design of protein interactions with discovered floor fingerprints. Nature 617, 176–184 (2023).
Google Scholar
Marchand, A. et al. Concentrating on protein–ligand neosurfaces with a generalizable deep studying software. Nature 639, 522–531 (2025).
Google Scholar
Hie, B. L. et al. Environment friendly evolution of human antibodies from normal protein language fashions. Nat. Biotechnol. 42, 275–283 (2024).
Google Scholar
Schneuing, A. et al. Construction-based drug design with equivariant diffusion fashions. Nat. Comput. Sci. 4, 899–909 (2024).
Google Scholar
Igashov, I. et al. Equivariant 3D-conditional diffusion mannequin for molecular linker design. Nat. Mach. Intell. 6, 417–427 (2024).
Google Scholar
Li, X.-S. et al. Multiphysical graph neural community (MP-GNN) for COVID-19 drug design. Transient. Bioinform. 23, bbac231 (2022).
Google Scholar
Shanker, V. R., Bruun, T. U., Hie, B. L. & Kim, P. S. Unsupervised evolution of protein and antibody complexes with a structure-informed language mannequin. Science 385, 46–53 (2024).
Google Scholar
Jonas, E. & Kuhn, S. Fast prediction of NMR spectral properties with quantified uncertainty. J. Cheminform. 11, 50 (2019).
Google Scholar
Kang, S., Kwon, Y., Lee, D. & Choi, Y.-S. Predictive modeling of NMR chemical shifts with out utilizing atomic-level annotations. J. Chem. Inf. Mannequin. 60, 3765–3769 (2020).
Google Scholar
Younger, A., Röst, H. & Wang, B. Tandem mass spectrum prediction for small molecules utilizing graph transformers. Nat. Mach. Intell. 6, 404–416 (2024).
Google Scholar
Hu, F., Chen, M. S., Rotskoff, G. M., Kanan, M. W. & Markland, T. E. Correct and environment friendly construction elucidation from routine one-dimensional NMR spectra utilizing multitask machine studying. ACS Cent. Sci. 10, 2162–2170 (2024).
Google Scholar
Yilmaz, M., Fondrie, W., Bittremieux, W., Oh, S. & Noble, W. S. De novo mass spectrometry peptide sequencing with a transformer mannequin. In Proc. Int. Conf. Machine Studying (eds Chaudhuri, Okay. et al.) 25514–25522 (PMLR, 2022).
Liang, Y., Li, D., Xu, A. G., Shao, Y. & Tang, Okay. GeneBag: coaching a cell basis mannequin for broad-spectrum most cancers analysis and prognosis with bulk RNA-seq information. Preprint at bioRxiv (2024).
Theus, A., Barkmann, F., Wissel, D. & Boeva, V. CancerFoundation: a single-cell RNA sequencing basis mannequin to decipher drug resistance in most cancers. Preprint at bioRxiv (2024).
Maleki, S. et al. Environment friendly fine-tuning of single-cell basis fashions permits zero-shot molecular perturbation prediction. In Workshop on Machine Studying for Genomics Explorations (MLGenX) on the thirteenth Worldwide Convention on Studying Representations (ICLR) (eds Hajiramezanali, E. et al.) (2025).
Sumanaweera, D. et al. Gene-level alignment of single-cell trajectories. Nat. Strategies 22, 68–81 (2025).
Google Scholar
Ergen, C. et al. Consensus prediction of cell sort labels in single-cell information with popV. Nat. Genet. 56, 2731–2738 (2024).
Google Scholar
Schuster, V., Dann, E., Krogh, A. & Teichmann, S. A. multiDGD: a flexible deep generative mannequin for multi-omics information. Nat. Commun. 15, 10031 (2024).
Google Scholar
Zinati, Y., Takiddeen, A. & Emad, A. GRouNdGAN: GRN-guided simulation of single-cell RNA-seq information utilizing causal generative adversarial networks. Nat. Commun. 15, 4055 (2024).
Google Scholar
Wan, J. et al. TriSAM: Tri-Aircraft SAM for zero-shot cortical blood vessel segmentation in VEM photos. IEEE J. Biomed. Well being Inform. 29, 8246–8255 (2025).
Google Scholar
Zhuo, Z. et al. Phase something for dendrites from electron microscopy. Proceedings of the 2025 IEEE sixth Worldwide Convention on Picture Processing, Functions and Methods (IPAS) pp. 1–6 (IEEE, 2025).
Van Gent, D. C. & Kanaar, R. Exploiting DNA restore defects for novel most cancers therapies. Mol. Biol. Cell 27, 2145–2148 (2016).
Google Scholar
Silverstein, R. A. et al. Customized CRISPR–Cas9 PAM variants by way of scalable engineering and machine studying. Nature 643, 539–550 (2025).
Google Scholar
Armingol, E., Officer, A., Harismendy, O. & Lewis, N. E. Deciphering cell–cell interactions and communication from gene expression. Nat. Rev. Genet. 22, 71–88 (2021).
Google Scholar
Singh, R. et al. Studying the language of antibody hypervariability. Proc. Natl Acad. Sci. USA 122, e2418918121 (2025).
Google Scholar
Guan, C., Fernandes, F. C., Franco, O. L. & de la Fuente-Nunez, C. Leveraging massive language fashions for peptide antibiotic design. Cell Rep. Phys. Sci. 6, 102359 (2025).
Google Scholar
Yang, Okay. Okay., Wu, Z. & Arnold, F. H. Machine-learning-guided directed evolution for protein engineering. Nat. Strategies 16, 687–694 (2019).
Google Scholar
Roney, M. & Aluwi, M. F. F. M. The significance of in-silico research in drug discovery. Intell. Pharm. 2, 578–579 (2024).
Gottweis, J. et al. In the direction of an AI co-scientist. Preprint at arXiv (2025).
Gridach, M., Nanavati, J., Abidine, Okay. Z. E., Mendes, L. & Mack, C. Agentic AI for scientific discovery: a survey of progress, challenges, and future instructions. Preprint at arXiv (2025).
Yamada, Y. et al. The AI Scientist-v2: workshop-level automated scientific discovery by way of agentic tree search. Preprint at arXiv (2025).
Gao, S. et al. Empowering biomedical discovery with AI brokers. Cell 187, 6125–6151 (2024).
Google Scholar
Swanson, Okay. et al. The digital lab of AI brokers designs new SARS-CoV-2 nanobodies. Nature 646, 716–723 (2025).
Google Scholar
Huang, Okay. et al. Biomni: a general-purpose biomedical AI agent. Preprint at bioRxiv (2025).
Wang, H. et al. SpatialAgent: an autonomous AI agent for spatial biology. Preprint at bioRxiv (2025).
Youngblut, N. D. et al. scBaseCamp: an AI agent-curated, uniformly processed, and frequently increasing single cell information repository. Preprint at bioRxiv (2025).
Loew, L. M. & Schaff, J. C. The digital cell: a software program atmosphere for computational cell biology. Tendencies Biotechnol. 19, 401–406 (2001).
Google Scholar
Heimberg, G. et al. A cell atlas basis mannequin for scalable search of comparable human cells. Nature 638, 1085–1094 (2025).
Google Scholar
Fischer, F. et al. scTab: scaling cross-tissue single-cell annotation fashions. Nat. Commun. 15, 6611 (2024).
Google Scholar
Gao, H. et al. Constructing a learnable common coordinate system for single-cell atlas with a joint-VAE mannequin. Commun. Biol. 7, 977 (2024).
Google Scholar
Zhang, J. et al. Tahoe-100M: a giga-scale single-cell perturbation atlas for context-dependent gene perform and mobile modeling. Preprint at bioRxiv (2025).
Adduri, A. et al. Predicting mobile responses to perturbation throughout various contexts with STATE. In Workshop on AI Digital Cells and Devices: A New Period in Drug Discovery and Improvement on the thirty eighth Convention on Neural Info Processing Methods (NeurIPS) (eds Gu, Q. et al.) (2024).
Roohani, Y. H. et al. Digital Cell Problem: towards a turing take a look at for the digital cell. Cell 188, 3370–3374 (2025).
Google Scholar
Wenckstern, J. et al. AI-powered digital tissues from spatial proteomics for scientific diagnostics and biomedical discovery. Clin. Most cancers Res. 31(13 Suppl.), B037 (2025).
Google Scholar
He, S. et al. Studying single-cell spatial context by way of built-in spatial multiomics with CORAL. Preprint at bioRxiv (2025).
Li, J., Chen, S., Pan, X., Yuan, Y. & Shen, H.-B. Cell clustering for spatial transcriptomics information with graph neural networks. Nat. Comput. Sci. 2, 399–408 (2022).
Google Scholar
Bao, X., Bai, X., Liu, X., Shi, Q. & Zhang, C. Spatially knowledgeable graph transformers for spatially resolved transcriptomics. Commun. Biol. 8, 574 (2025).
Google Scholar
Zhang, D. et al. Inferring super-resolution tissue structure by integrating spatial transcriptomics with histology. Nat. Biotechnol. 42, 1372–1377 (2024).
Google Scholar
He, S. et al. Starfysh integrates spatial transcriptomic and histologic information to disclose heterogeneous tumor-immune hubs. Nat. Biotechnol. 43, 223–235 (2024).
Google Scholar
Lee, Y., Liu, X., Hao, M., Liu, T. & Regev, A. PathOmCLIP: connecting tumor histology with spatial gene expression by way of domestically enhanced contrastive studying of pathology and single-cell basis mannequin. Preprint at bioRxiv (2024).
Almagro-Pérez, C. et al. AI-driven 3D spatial transcriptomics. Preprint at arXiv (2025).
Chen, W. et al. A visible–omics basis mannequin to bridge histopathology with spatial transcriptomics. Nat. Strategies 22, 1568–158 (2025).
Google Scholar
Cui, H. et al. In the direction of multimodal basis fashions in molecular cell biology. Nature 640, 623–633 (2025).
Google Scholar
Tang, Z., Somia, N., Yu, Y. & Koo, P. Okay. Evaluating the representational energy of pre-trained DNA language fashions for regulatory genomics. Genome Biol. 26, 203 (2025).
Google Scholar
Tsishyn, M., Hermans, P., Rooman, M. & Pucci, F. Residue conservation and solvent accessibility are (nearly) all you want for predicting mutational results in proteins. Bioinformatics 41, btaf322 (2025).
Google Scholar
Atti, S. & Subramaniam, S. Basic limitations of basis fashions in single-cell transcriptomics. Preprint at bioRxiv (2025).
Kedzierska, Okay. Z., Crawford, L., Amini, A. P. & Lu, A. X. Zero-shot analysis reveals limitations of single-cell basis fashions. Genome Biol. 26, 101 (2025).
Google Scholar
Ahlmann-Eltze, C., Huber, W. & Anders, S. Deep-learning-based gene perturbation impact prediction doesn’t but outperform easy linear baselines. Nat. Strategies 22, 1657–1661 (2025).
Google Scholar
Märtens, Okay., Donovan-Maiye, R. & Ferkinghoff-Borg, J. Enhancing generative perturbation fashions with LLM-informed gene embeddings. In Proc. ICLR 2024 Workshop on Machine Studying for Genomics Explorations (eds Theis, F. et al.) (2024).
Csendes, G., Sanz, G., Szalay, Okay. Z. & Szalai, B. Benchmarking basis cell fashions for post-perturbation RNA-seq prediction. BMC Genomics 26, 393 (2025).
Google Scholar
Liu, Z. et al. Genbench: a benchmarking suite for systematic analysis of genomic basis fashions. Preprint at arXiv (2024).
Gao, Z. et al. PFMBench: protein basis mannequin benchmark. Preprint at arXiv (2025).
Qiu, P. et al. BioLLM: a standardized framework for integrating and benchmarking single-cell basis fashions. Patterns (N.Y.) 6, 101326 (2025).
Google Scholar
Theodoris, C. V. Views on benchmarking basis fashions for community biology. Quant. Biol. 12, 335–338 (2024).
Google Scholar
Fishman, V. et al. GENA-LM: a household of open-source foundational DNA language fashions for lengthy sequences. Nucleic Acids Res. 53, gkae1310 (2025).
Google Scholar
Koonin, E. V., Wolf, Y. I. & Karev, G. P. The construction of the protein universe and genome evolution. Nature 420, 218–223 (2002).
Google Scholar
Rood, J. E., Hupalowska, A. & Regev, A. Towards a basis mannequin of causal cell and tissue biology with a perturbation cell and tissue atlas. Cell 187, 4520–4545 (2024).
Google Scholar
Li, C. et al. Benchmarking AI fashions for in silico gene perturbation of cells. Preprint at bioRxiv (2024).
Yuan, B. et al. CellBox: interpretable machine studying for perturbation biology with utility to the design of most cancers mixture remedy. Cell Syst. 12, 128–140 (2021).
Google Scholar
Qian, L. et al. AI-empowered perturbation proteomics for advanced organic programs. Cell Genomics 4, 100691 (2024).
Google Scholar
Pearce, J. D. et al. A cross-species generative cell atlas throughout 1.5 billion years of evolution: the TranscriptFormer Single-cell Mannequin. Preprint at bioRxiv https://doi.org/10.1101/2025.04.25.650731(2025).
Zhang, Q., Stelzer, A. C., Fisher, C. Okay. & Al-Hashimi, H. M. Visualizing spatially correlated dynamics that directs RNA conformational transitions. Nature 450, 1263–1267 (2007).
Google Scholar
Rozenblatt-Rosen, O., Stubbington, M. J. T., Regev, A. & Teichmann, S. A. The Human Cell Atlas: from imaginative and prescient to actuality. Nature 550, 451–453 (2017).
Google Scholar
Börner, Okay. et al. Human BioMolecular Atlas Program (HuBMAP): 3D Human Reference Atlas development and utilization. Nat. Strategies 22, 845–860 (2025).
Google Scholar
Rozenblatt-Rosen, O. et al. The Human Tumor Atlas Community: charting tumor transitions throughout house and time at single-cell decision. Cell 181, 236–249 (2020).
Google Scholar
Coleman, Okay. et al. Resolving tissue complexity by multimodal spatial omics modeling with MISO. Nat. Strategies 22, 530–538 (2025).
Google Scholar
Lengthy, Y. et al. Deciphering spatial domains from spatial multi-omics with SpatialGlue. Nat. Strategies 21, 1658–1667 (2024).
Google Scholar
Zeng, Y. et al. Imputing spatial transcriptomics by way of gene community constructed from protein language mannequin. Commun. Biol. 7, 1271 (2024).
Google Scholar
Chen, T. et al. SELF-Former: multi-scale gene filtration transformer for single-cell spatial reconstruction. Transient. Bioinform. 25, bbae523 (2024).
Google Scholar
Schroeder, A. et al. Scaling up spatial transcriptomics for large-sized tissues: uncovering cellular-level tissue structure past standard platforms with iSCALE. Nat Strategies 22, 1911–1922 (2025).
Google Scholar
Gandin, V. et al. Deep-tissue transcriptomics and subcellular imaging at excessive spatial decision. Science 388, eadq2084 (2025).
Google Scholar
U.S. Meals & Drug Administration. FDA publicizes plan to section out animal testing requirement for monoclonal antibodies and different medication. (2025).
Kim, J., Koo, B.-Okay. & Knoblich, J. A. Human organoids: mannequin programs for human biology and drugs. Nat. Rev. Mol. Cell Biol. 21, 571–584 (2020).
Google Scholar
Passaro, S. et al. Boltz-2: in direction of correct and environment friendly binding affinity prediction. Preprint at bioRxiv (2025).
Wechsler, H. (ed.). Neural Networks for Notion pp. 65–93 (Educational Press, 1992).
Hastie, T., Tibshirani, R. & Friedman, J. The Components of Statistical Studying: Knowledge Mining, Inference, and Prediction pp. 485–585 (Springer, 2008).
Twine, M. & Cunningham, P. (eds). Machine Studying Strategies for Multimedia: Case Research on Group and Retrieval pp. 21–49 (Springer, 2008).
O’shea, Okay. & Nash, R. An introduction to convolutional neural networks. Preprint at arXiv (2015).
Xing, F., Xie, Y., Su, H., Liu, F. & Yang, L. Deep studying in microscopy picture evaluation: A survey. IEEE Trans. Neural Netw. Study. Syst. 29, 4550–4568 (2017).
Google Scholar
Banerji, S. & Mitra, S. Deep studying in histopathology: a evaluate. Wiley Interdiscip. Rev. Knowledge Min. Knowl. Discov. 12, e1439 (2022).
Google Scholar
Dosovitskiy, A. et al. A picture is price 16×16 phrases: transformers for picture recognition at scale. In ninth Int. Conf. Studying Representations (ICLR) (2021).
Devlin, J., Chang, M. W., Lee, Okay. & Toutanova, Okay. BERT: pre-training of deep bidirectional transformers for language understanding. In Proc. 2019 Conf. North American Chapter of the Affiliation for Computational Linguistics: Human Language Applied sciences Vol. 1 (eds Burstein, J. et al.) 4171–4186 (Affiliation for Computational Linguistics, 2019).



