Associate Professor Yale University New Haven, Connecticut, United States
Introduction/Rationale: Predicting immunogenicity remains a critical challenge in vaccine design, autoimmunity treatment, and pharmaceutical development. Current AI tools rely on limited data inputs, typically only class-I MHC-peptide sequences, missing crucial structural and biochemical information. We developed the ImmunoFoundation Model (IFM), a multimodal deep learning system that integrates not only peptide sequences, 3D molecular structures, and biochemical properties but also TCR-MHC-peptide (class-II and class-I) to achieve superior immunogenicity prediction and enable peptide optimization for therapeutic applications.
Methods: IFM comprises three modules: (1) ESM3 transformer for embedding antigen-peptide, MHC, and TCR sequences from vast biomedical data; (2) geometric scattering transformer networks to capture molecular structure from AlphaFold3-predicted peptide-MHC complexes; (3) Autoencoder for biochemical property embedding including surface area and thermal stability. Cross-modal attention layers integrate these representations. Training utilized IEDB, VDJdb, McPAS-TCR, and TCR3d datasets totaling >300,000 samples across MHC class I and II.
Results: The preliminary model achieved state-of-the-art performance on CEDAR cancer neoepitope datasets. Attention mechanism analysis revealed structural motifs influencing immunogenicity, distinguishing between KRAS G12V and G12D mutants. The model successfully predicted vaccine cassette immunogenicity and identified key peptide-MHC interaction sites. Current IFM development shows improved multimodal integration with enhanced predictive accuracy across viral and cancer peptide immunogenicity tasks.
Conclusion: IFM represents a paradigm shift in immunogenicity prediction by comprehensively modeling the complex antigen-MHC-TCR interaction system. Its generative capabilities enable peptide optimization for cancer vaccines and personalized immunotherapy, with potential applications in autoimmunity treatment and biologics development.