Explainable Artificial Intelligence for Predicting Oncogenic Gene Expression Programs Driven by MYC Condensate Dynamics
Keywords:
explainable artificial intelligence, MYC condensate dynamics, oncogenic gene expression, phase separation, system architecture, algorithmic governance, robustness, fairness, infrastructureAbstract
The emergence of phase separation as a fundamental organizing principle in transcriptional regulation has reshaped our understanding of oncogenic gene expression, particularly for the MYC oncoprotein whose condensate dynamics orchestrate selective transcriptional programs. Concurrently, explainable artificial intelligence has become indispensable for interpreting high-dimensional genomic data and for building trust in predictive models used in clinical and research settings. This paper presents a systems-level analysis of an explainable AI framework designed to predict oncogenic gene expression programs driven by MYC condensate dynamics. We examine the architectural trade-offs between predictive accuracy and interpretability, the robustness of model explanations under biological variability, and the infrastructural requirements for deploying such systems in translational workflows. Governance and fairness considerations are discussed through the lens of algorithmic bias in genomics, reproducibility of explanations, and ethical implications for personalized oncology. Cross-domain comparisons with other biophysical systems, such as transcriptional condensates at super-enhancers, highlight the generality of the framework. Sustainability of model training, data provenance, and the policy landscape for responsible AI in molecular biology are also addressed. The paper argues that while explainable AI offers powerful tools for deciphering complex phase-separation-driven transcription, its effective integration demands careful attention to system architecture, validation protocols, and socio-technical governance to ensure reliable and equitable outcomes.
References
1. Rudin, C. (2019). Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1(5), 206-215.
2. Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30.
3. Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). "Why should I trust you?" Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135-1144.
4. Doshi-Velez, F., & Kim, B. (2017). Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608.
5. Arrieta, A. B., Díaz-Rodríguez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., ... & Herrera, F. (2020). Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion, 58, 82-115.
6. Boija, A., Klein, I. A., Sabari, B. R., Dall'Agnese, A., Coffey, E. L., Zamudio, A. V., ... & Young, R. A. (2018). Transcription factors activate genes through the phase-separation capacity of their activation domains. Cell, 175(7), 1842-1855.
7. Sabari, B. R., Dall'Agnese, A., Boija, A., Klein, I. A., Coffey, E. L., Shrinivas, K., ... & Young, R. A. (2018). Coactivator condensation at super-enhancers links phase separation and gene control. Science, 361(6400), eaar3958.
8. Hnisz, D., Shrinivas, K., Young, R. A., Chakraborty, A. K., & Sharp, P. A. (2017). A phase separation model for transcriptional control. Cell, 169(1), 13-23.
9. Kress, T. R., Sabò, A., & Amati, B. (2015). MYC: connecting selective transcriptional control to global RNA production. Nature Reviews Cancer, 15(10), 593-607.
10. Lin, C. Y., Lovén, J., Rahl, P. B., Paranal, R. M., Burge, C. B., Bradner, J. E., ... & Young, R. A. (2012). Transcriptional amplification in tumor cells with elevated c-Myc. Cell, 151(1), 56-67.
11. Shin, Y., & Brangwynne, C. P. (2017). Liquid phase condensation in cell physiology and disease. Science, 357(6357), eaaf4382.
12. Yang, J., Chung, C. I., Koach, J., Liu, H., Navalkar, A., He, H., ... & Shu, X. (2024). MYC phase separation selectively modulates the transcriptome. Nature Structural & Molecular Biology, 31(10), 1567-1579.
13. Chen, D., & Liu, Y. (2023). Deep learning for gene expression prediction: a review. Briefings in Bioinformatics, 24(1), bbac536.
14. Ching, T., Himmelstein, D. S., Beaulieu-Jones, B. K., Kalinin, A. A., Do, B. T., Way, G. P., ... & Greene, C. S. (2018). Opportunities and obstacles for deep learning in biology and medicine. Journal of the Royal Society Interface, 15(141), 20170387.
15. Yu, M. K., Ma, J., & Fisher, J. (2020). Using mechanistic models to integrate - not replace - machine learning and mechanistic biology. Nature Machine Intelligence, 2(6), 307-309.
16. Floridi, L., Cowls, J., Beltrametti, M., Chatila, R., Chazerand, P., Dignum, V., ... & Vayena, E. (2018). AI4People—an ethical framework for a good AI society: opportunities, risks, principles, and recommendations. Minds and Machines, 28, 689-707.
17. Mittelstadt, B. D., Allo, P., Taddeo, M., Wachter, S., & Floridi, L. (2016). The ethics of algorithms: Mapping the debate. Big Data & Society, 3(2), 2053951716679679.
18. Jobin, A., Ienca, M., & Vayena, E. (2019). The global landscape of AI ethics guidelines. Nature Machine Intelligence, 1(9), 389-399.
19. Angermueller, C., Pärnamaa, T., Parts, L., & Stegle, O. (2016). Deep learning for computational biology. Molecular Systems Biology, 12(7), 878.
20. Eraslan, G., Avsec, Ž., Gagneur, J., & Theis, F. J. (2019). Deep learning: new computational modelling techniques for genomics. Nature Reviews Genetics, 20(7), 389-403.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Bioinformatics Insights and Analytics

This work is licensed under a Creative Commons Attribution 4.0 International License.



