Post
201
🚀 For those who interested in multilingual clinical case report sukmmarization 🩺📋, deligned to share a video-update to the earlier post on Qwen2.5 model family adaptation:
🎬 Video: https://www.youtube.com/watch?v=uOAiUvLghuE
This is 15-min skimming of the study (+ 5 mins for code) in which we overview the application of Qwen model family (72B as a teacher and 0.5B as a student) in summarization of the clinical reports, including detaied overview of the experiments organization. In particular, attempted to cover:
1. Background of previous Seq2Seq models to conclude their limitations
2. ChatML roles exploiting for distilation tuning in clinical report summarization
3. Known limitation of work and unleashing full capabilities
As in previous post, there is a model card that is also covered in video.
🤗 Huggingface: https://huggingface.co/nicolay-r/qwen25-05b-multiclinsum-standar
🎬 Video: https://www.youtube.com/watch?v=uOAiUvLghuE
This is 15-min skimming of the study (+ 5 mins for code) in which we overview the application of Qwen model family (72B as a teacher and 0.5B as a student) in summarization of the clinical reports, including detaied overview of the experiments organization. In particular, attempted to cover:
1. Background of previous Seq2Seq models to conclude their limitations
2. ChatML roles exploiting for distilation tuning in clinical report summarization
3. Known limitation of work and unleashing full capabilities
As in previous post, there is a model card that is also covered in video.
🤗 Huggingface: https://huggingface.co/nicolay-r/qwen25-05b-multiclinsum-standar