Towards safe, robust, interpretable dialogue agents for democratized medical care

Towards Safe, Robust, Interpretable Dialogue Agents for Democratized Medical Care Reverse Engineering the Invariances of the Primate Visual System

PI: Julia Hirschberg
Co-PI: Sarah Ita Levitan, CUNY; Tatiana Emmanouil, CUNY

Abstract

With the growing capabilities of Large Language Models (LLMs) in enhancing communication, AI-powered dialogue agents are increasingly used in clinical psychology, promising to augment human therapists. However, deploying these technologies in medical settings involves significant risks, including the lack of safety regulation and robust evaluation methods. Our project aims to develop reliable LLMs that provide immediate mental health support by establishing comprehensive safety guidelines and creating interpretable, effective medical AI/NLP systems tailored for psychological counseling. A cornerstone of this effort is our continuous interdisciplinary collaboration with medical professionals to ensure our technological advancements are safely grounded in clinical reality.

Publications

Chen, R., Liang, W., Gong, Z., Ai, L., & Hirschberg, J. (2026). Detecting Mental Manipulation in Speech via Synthetic Multi-Speaker Dialogue. arXiv. https://doi.org/10.48550/ARXIV.2601.08342
Bojic, I., Ong, Q. C., Ma, S. H. X., Ai, L., Liu, Z., Gong, Z., Hirschberg, J., Ho, A. H. Y., & Khong, A. W. H. (2025). SMARTMiner: Extracting and Evaluating SMART Goals from Low-Resource Health Coaching Notes. Findings of the Association for Computational Linguistics: EMNLP 2025, 16288–16305. https://doi.org/10.18653/v1/2025.findings-emnlp.885
Gong, Z., Shi, P., Donbekci, K., Ai, L., Chen, R., Sasu, D., Wu, Z., & Hirschberg, J. (2025). Learning More with Less: Self-Supervised Approaches for Low-Resource Speech Emotion Recognition. arXiv. https://doi.org/10.48550/ARXIV.2506.02059
Cai, Y., Wu, C., Ma, B., Chen, B., Xue, Y., Hirschberg, J., & Gong, Z. (2026). SURE: Synergistic Uncertainty-aware Reasoning for Multimodal Emotion Recognition in Conversations. arXiv. https://doi.org/10.48550/ARXIV.2604.01916
Ma, B., Li, Y., Zhou, W., Gong, Z., Liu, Y. J., Jasinskaja, K., Friedrich, A., Hirschberg, J., Kreuter, F., & Plank, B. (2025). Pragmatics in the Era of Large Language Models: A Survey on Datasets, Evaluation, Opportunities and Challenges. arXiv. https://doi.org/10.48550/ARXIV.2502.12378
Gong, Z., Ai, L., Deshpande, H., Johnson, A., Phung, E., Wu, Z., Emami, A., & Hirschberg, J. (2024). CREAM: Comparison-Based Reference-Free ELO-Ranked Automatic Evaluation for Meeting Summarization. arXiv. https://doi.org/10.48550/ARXIV.2409.10883
Liu, Z., Gong, Z., Ai, L., Hui, Z., Chen, R., Leach, C. W., Greene, M. R., & Hirschberg, J. (2025). A Review of Incorporating Psychological Theories in LLMs. arXiv. https://doi.org/10.48550/ARXIV.2505.00003
Liu, Z., Gong, Z., Ai, L., Hui, Z., Chen, R., Leach, C. W., Greene, M. R., & Hirschberg, J. (2025). A Review of Incorporating Psychological Theories in LLMs. arXiv. EACL 2026. https://doi.org/10.48550/ARXIV.2505.00003
Gong, Z., Cai, D., Ma, B., Sharma, M., On, T. Q., Morparia, M., Enos, B., Yu, Y., Resnik, P., Levitan, S., & Hirshburg, J. (2026). MCML - A Survey on Mental Health Datasets and Resources. https://mcml.ai/publications/gdm+26/

Resources

Mental Health Resources and Benchmark Website: A comprehensive repository and review proposing actionable criteria for fairness, generalizability, and privacy compliance in medical datasets. Available at: https://ziweig.github.io/mental-health-datasets-resources-review/
SMARTMiner & SMARTSpan Dataset: Framework and dataset (173 health coaching notes) for extracting and evaluating clinical treatment plans from unstructured text. Available at: https://github.com/IvaBojic/SMARTMiner
SPEECHMENTALMANIP Benchmark: Code and data for a novel synthetic multi-speaker benchmark for detecting mental manipulation in spoken dialogues. Available at: https://github.com/runjchen/speech_mentalmanip
Akan Cinematic Emotions (AkaCE) Dataset: The first multimodal emotion dialogue dataset for an African language, featuring 385 emotion-labeled dialogues. Available at: https://github.com/zehuiwu/Akan-Cinematic-Emotion