Preference Learning Unlocks LLMs' Psycho-Counseling Skills
Abstract
Applying large language models (LLMs) to assist in psycho-counseling is an emerging and meaningful approach, driven by the significant gap between patient needs and the availability of mental health support. However, current LLMs struggle to consistently provide effective responses to client speeches, largely due to the lack of supervision from high-quality real psycho-counseling data, whose content is typically inaccessible due to client privacy concerns. Furthermore, the quality of therapists' responses in available sessions can vary significantly based on their professional training and experience. Assessing the quality of therapists' responses remains an open challenge. In this work, we address these challenges by first proposing a set of professional and comprehensive principles to evaluate therapists' responses to client speeches. Using these principles, we create a preference dataset, PsychoCounsel-Preference, which contains 36k high-quality preference comparison pairs. This dataset aligns with the preferences of professional psychotherapists, providing a robust foundation for evaluating and improving LLMs in psycho-counseling. Experiments on reward modeling and preference learning demonstrate that PsychoCounsel-Preference is an excellent resource for LLMs to acquire essential skills for responding to clients in a counseling session. Our best-aligned model, PsychoCounsel-Llama3-8B, achieves an impressive win rate of 87% against GPT-4o. We release PsychoCounsel-Preference, PsychoCounsel-Llama3-8B and the reward model PsychoCounsel Llama3-8B-Reward to facilitate the research of psycho-counseling with LLMs at: https://hf.co/Psychotherapy-LLM.
Community
TL;DR: We propose a large preference dataset for psycho-counseling with professional principles. Our best-aligned model achieves an impressive win rate of 87% against GPT-4o.
Resources: https://huggingface.co/Psychotherapy-LLM
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- HamRaz: A Culture-Based Persian Conversation Dataset for Person-Centered Therapy Using LLM Agents (2025)
- MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer (2025)
- Evaluating an LLM-Powered Chatbot for Cognitive Restructuring: Insights from Mental Health Professionals (2025)
- EmoAssist: Emotional Assistant for Visual Impairment Community (2025)
- From Personas to Talks: Revisiting the Impact of Personas on LLM-Synthesized Emotional Support Conversations (2025)
- Trust Modeling in Counseling Conversations: A Benchmark Study (2025)
- Consistent Client Simulation for Motivational Interviewing-based Counseling (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 2
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper