Secure Multimodal Clinical Decision Support Using Robust LLM Agents and Margin-Scalable Semantic Hashing Networks

Bjay Manha; Blake Jarvinen; Anirudh M. Pandey

Authors

Bjay Manha Department of Computer Science, University of Central Florida, Orlando, FL, USA.
Blake Jarvinen Department of Computer Science, George Mason University, Fairfax, VA, USA.
Anirudh M. Pandey Department of Computer Science, University of Houston, Houston, TX, USA.

Keywords:

clinical decision support, multimodal learning, large language model agents, semantic hashing, adversarial robustness, healthcare security, margin-scalable constraint

Abstract

The integration of large language model agents into multimodal clinical decision support systems represents a transformative opportunity to augment diagnostic accuracy, personalize treatment pathways, and streamline clinical workflows. However, the deployment of such systems in high-stakes medical environments introduces severe security vulnerabilities, ranging from adversarial input perturbations to model inversion and data leakage. This paper presents a secure architecture that couples robust, retrieval-augmented LLM agents with margin-scalable semantic hashing networks to enable fast and resilient multimodal evidence retrieval while preserving patient privacy and clinical trustworthiness. The LLM agents incorporate adversarial training and defensive prompt filtering to withstand both white-box and black-box attacks targeting clinical guidance. The semantic hashing module leverages a self-supervised asymmetric learning paradigm and a margin-scalable constraint that maintains discriminability at arbitrary hash code lengths, ensuring efficient and precise retrieval of similar multimodal cases from massive clinical repositories. We discuss the structural trade-offs between retrieval granularity, computational overhead, and adversarial robustness, and we propose a layered governance framework that addresses fairness, explainability, and regulatory compliance. Through a systems-level analysis, the paper illustrates how the synergistic combination of semantically structured hashing and adversarially hardened LLM agents can fortify multimodal clinical decision support against emerging threats while sustaining high-quality care outcomes.

References

1. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. In Advances in neural information processing systems (pp. 5998-6008).

2. Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., ... & Amodei, D. (2020). Language models are few-shot learners. In Advances in neural information processing systems (pp. 1877-1901).

3. Topol, E. J. (2019). High-performance medicine: The convergence of human and artificial intelligence. Nature Medicine, 25(1), 44-56.

4. Schick, T., Dwivedi-Yu, J., Dessì, R., Raileanu, R., Lomeli, M., Zettlemoyer, L., Cancedda, N., & Scialom, T. (2023). Toolformer: Language models can teach themselves to use tools. arXiv preprint arXiv:2302.04761.

5. Yu, Z., Wu, S., Dou, Z., & Bakker, E. M. (2022). Deep hashing with self-supervised asymmetric semantic excavation and margin-scalable constraint. Neurocomputing, 483, 87-104.

6. Hu, S. (2026). Research on Security Enhancement Methods for Adversarial Robust Large Language Model Intelligent Agents for Medical Decision-Making Tasks. arXiv preprint arXiv:2605.08257.

7. Wallace, E., Feng, S., Kandpal, N., Gardner, M., & Singh, S. (2019). Universal adversarial triggers for attacking and analyzing NLP. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (pp. 2153-2162).

8. Perez, E., Huang, S., Song, F., Cai, T., Ring, R., Aslanides, J., Glaese, A., McAleese, N., & Irving, G. (2022). Red teaming language models with language models. arXiv preprint arXiv:2202.03286.

9. Finlayson, S. G., Bowers, J. D., Ito, J., Zittrain, J. L., Beam, A. L., & Kohane, I. S. (2019). Adversarial attacks on medical machine learning. Science, 363(6433), 1287-1289.

10. Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., Narasimhan, K., & Cao, Y. (2023). ReAct: Synergizing reasoning and acting in language models. In International Conference on Learning Representations.

11. Singhal, K., Azizi, S., Tu, T., Mahdavi, S. S., Wei, J., Chung, H. W., ... & Natarajan, V. (2023). Large language models encode clinical knowledge. Nature, 620(7972), 172-180.

12. Baltrusaitis, T., Ahuja, C., & Morency, L. P. (2019). Multimodal machine learning: A survey and taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(2), 423-443.

13. Kusner, M. J., Loftus, J., Russell, C., & Silva, R. (2017). Counterfactual fairness. In Advances in neural information processing systems (pp. 4066-4076).

14. Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM Computing Surveys, 54(6), 1-35.

15. Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., Hutchinson, B., Spitzer, E., Raji, I. D., & Gebru, T. (2019). Model cards for model reporting. In Proceedings of the Conference on Fairness, Accountability, and Transparency (pp. 220-229).

16. Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., Chaudhary, V., Young, M., & Dennison, D. (2015). Hidden technical debt in machine learning systems. In Advances in neural information processing systems (pp. 2503-2511).

17. Price, W. N., & Cohen, I. G. (2019). Privacy in the age of medical big data. Nature Medicine, 25(1), 37-43.

18. McMahan, B., Moore, E., Ramage, D., Hampson, S., & Arcas, B. A. y. (2017). Communication-efficient learning of deep networks from decentralized data. In Artificial Intelligence and Statistics (pp. 1273-1282).

Secure Multimodal Clinical Decision Support Using Robust LLM Agents and Margin-Scalable Semantic Hashing Networks

Authors

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Journal Information

Indexing & Infrastructure

Current Issue

Information

Make a Submission