结构化角色扮演：基于问卷的合成治疗师-患者对话生成 (Roleplaying with Structure: Synthetic Therapist-Client Conversation Generation from Questionnaires)

Doan Nam Long Vu,Rui Tan,Lena Moench,Svenja Jule Francke,Daniel Woiwod,Florian Thomas-Odenthal,Sanna Stroth,Tilo Kircher,Christiane Hermann,Udo Dannlowski,Hamidreza Jamalabadi,Shaoxiong Ji

The development of AI for mental health is hindered by a lack of authentic therapy dialogues, due to strict privacy regulations and the fact that clinical sessions were historically rarely recorded. We present an LLM-driven pipeline that generates synthetic counseling dialogues based on structured client profiles and psychological questionnaires. Grounded on the principles of Cognitive Behavioral Therapy (CBT), our method creates synthetic therapeutic conversations for clinical disorders such as anxiety and depression. Our framework, SQPsych (Structured Questionnaire-based Psychotherapy), converts structured psychological input into natural language dialogues through therapist-client simulations. Due to data governance policies and privacy restrictions prohibiting the transmission of clinical questionnaire data to third-party services, previous methodologies relying on proprietary models are infeasible in our setting. We address this limitation by generating a high-quality corpus using open-weight LLMs, validated through human expert evaluation and LLM-based assessments. Our SQPsychLLM models fine-tuned on SQPsychConv achieve strong performance on counseling benchmarks, surpassing baselines in key therapeutic skills. Our findings highlight the potential of synthetic data to enable scalable, data-secure, and clinically informed AI for mental health support. We will release our code, models, and corpus at https://ai-mh.github.io/SQPsych

翻译：心理健康人工智能的发展因缺乏真实的治疗对话而受阻，这源于严格的隐私法规以及历史上临床会话鲜有记录的现实。我们提出了一种基于大语言模型的流程，能够根据结构化的患者档案与心理问卷生成合成咨询对话。基于认知行为疗法的原理，该方法可为焦虑症和抑郁症等临床障碍创建合成治疗对话。我们的框架SQPsych通过治疗师-患者模拟，将结构化心理输入转化为自然语言对话。由于数据治理政策与隐私限制禁止将临床问卷数据传输至第三方服务，以往依赖专有模型的方法在我们的场景中不可行。我们通过使用开源权重的大语言模型生成高质量语料库来突破此限制，并经由人类专家评估与大语言模型基准测试进行验证。基于SQPsychConv微调的SQPsychLLM模型在心理咨询基准测试中表现优异，在关键治疗技能上超越基线方法。我们的研究突显了合成数据在实现可扩展、数据安全且具有临床依据的心理健康支持人工智能方面的潜力。代码、模型与语料库将在https://ai-mh.github.io/SQPsych发布。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日