Spaces:

kai119
/

llama

Running

App Files Files Community

llama / data /xtuner /docs /zh_cn /get_started /quickstart.rst

kai119

Upload folder using huggingface_hub

22fb4ec verified 6 months ago

raw

history blame contribute delete

18.5 kB

	快速上手
	========

	本节中，我们将演示如何使用 XTuner 微调模型，帮助您快速上手 XTuner。

	在成功安装 XTuner
	后，便可以开始进行模型的微调。在本节中，我们将演示如何使用 XTuner，应用
	QLoRA 算法在 Colorist 数据集上微调 InternLM2-Chat-7B。

	Colorist 数据集（\ `HuggingFace
	链接 <https://huggingface.co/datasets/burkelibbey/colors>`__\ ；\ `ModelScope
	链接 <https://www.modelscope.cn/datasets/fanqiNO1/colors/summary>`__\ ）是一个根据颜色描述提供颜色选择与建议的数据集，经过该数据集微调的模型可以做到根据用户对于颜色的描述，从而给出16进制下的颜色编码，如用户输入“宁静而又相当明亮的浅天蓝色，介于天蓝色和婴儿蓝之间，因其亮度而带有一丝轻微的荧光感。”，模型输出
	\|image1\|\ ，该颜色很符合用户的描述。以下是该数据集的几条样例数据：

	+-----------------------+-----------------------+-------------------+
	\| 英文描述 \| 中文描述 \| 颜色 \|
	+=======================+=======================+===================+
	\| Light Sky Blue: A \| 浅天蓝色 \| #66ccff: \|image8\| \|
	\| calming, fairly \| ：一种介于天蓝和婴儿 \| \|
	\| bright color that \| 蓝之间的平和、相当明 \| \|
	\| falls between sky \| 亮的颜色，由于明亮而 \| \|
	\| blue and baby blue, \| 带有一丝轻微的荧光。 \| \|
	\| with a hint of slight \| \| \|
	\| fluorescence due to \| \| \|
	\| its brightness. \| \| \|
	+-----------------------+-----------------------+-------------------+
	\| Bright red: This is a \| 鲜红色： \| #ee0000: \|image9\| \|
	\| very vibrant, \| 这是一种非常鲜 \| \|
	\| saturated and vivid \| 艳、饱和、生动的红色 \| \|
	\| shade of red, \| ，类似成熟苹果或新鲜 \| \|
	\| resembling the color \| 血液的颜色。它是标准 \| \|
	\| of ripe apples or \| RGB \| \|
	\| fresh blood. It is as \| 调色板上的红色，不含 \| \|
	\| red as you can get on \| 任何蓝色或绿色元素。 \| \|
	\| a standard RGB color \| \| \|
	\| palette, with no \| \| \|
	\| elements of either \| \| \|
	\| blue or green. \| \| \|
	+-----------------------+-----------------------+-------------------+
	\| Bright Turquoise: \| 明亮的绿松石 \| #00ffcc: \|
	\| This color mixes the \| 色：这种颜色融合了鲜 \| \|image10\| \|
	\| freshness of bright \| 绿色的清新和淡蓝色的 \| \|
	\| green with the \| 宁静，呈现出一种充满 \| \|
	\| tranquility of light \| 活力的绿松石色调。它 \| \|
	\| blue, leading to a \| 让人联想到热带水域。 \| \|
	\| vibrant shade of \| \| \|
	\| turquoise. It is \| \| \|
	\| reminiscent of \| \| \|
	\| tropical waters. \| \| \|
	+-----------------------+-----------------------+-------------------+

	准备模型权重
	------------

	在微调模型前，首先要准备待微调模型的权重。

	.. _从-huggingface-下载-1:

	从 HuggingFace 下载
	~~~~~~~~~~~~~~~~~~~

	.. code:: bash

	pip install -U huggingface_hub

	# 拉取模型至 Shanghai_AI_Laboratory/internlm2-chat-7b
	huggingface-cli download internlm/internlm2-chat-7b \
	--local-dir Shanghai_AI_Laboratory/internlm2-chat-7b \
	--local-dir-use-symlinks False \
	--resume-download

	.. _从-modelscope-下载-1:

	从 ModelScope 下载
	~~~~~~~~~~~~~~~~~~

	由于从 HuggingFace
	拉取模型权重，可能存在下载过程不稳定、下载速度过慢等问题。因此在下载过程遇到网络问题时，我们则可以选择从
	ModelScope 下载 InternLM2-Chat-7B 的权重。

	.. code:: bash

	pip install -U modelscope

	# 拉取模型至当前目录
	python -c "from modelscope import snapshot_download; snapshot_download('Shanghai_AI_Laboratory/internlm2-chat-7b', cache_dir='.')"

	在完成下载后，便可以开始准备微调数据集了。

	此处附上 HuggingFace 链接与 ModelScope 链接：

	- HuggingFace
	链接位于：\ https://huggingface.co/internlm/internlm2-chat-7b

	- ModelScope
	链接位于：\ https://modelscope.cn/models/Shanghai_AI_Laboratory/internlm2-chat-7b/summary

	准备微调数据集
	--------------

	接下来，我们需要准备微调数据集。

	.. _从-huggingface-下载-2:

	从 HuggingFace 下载
	~~~~~~~~~~~~~~~~~~~

	.. code:: bash

	git clone https://huggingface.co/datasets/burkelibbey/colors

	.. _从-modelscope-下载-2:

	从 ModelScope 下载
	~~~~~~~~~~~~~~~~~~

	由于相同的问题，因此我们可以选择从 ModelScope 下载所需要的微调数据集。

	.. code:: bash

	git clone https://www.modelscope.cn/datasets/fanqiNO1/colors.git

	此处附上 HuggingFace 链接与 ModelScope 链接：

	- HuggingFace
	链接位于：\ https://huggingface.co/datasets/burkelibbey/colors

	- ModelScope 链接位于：\ https://modelscope.cn/datasets/fanqiNO1/colors

	准备配置文件
	------------

	XTuner 提供了多个开箱即用的配置文件，可以通过 ``xtuner list-cfg``
	查看。我们执行如下指令，以复制一个配置文件到当前目录。

	.. code:: bash

	xtuner copy-cfg internlm2_7b_qlora_colorist_e5 .

	配置文件名的解释：

	======== ==============================
	配置文件 internlm2_7b_qlora_colorist_e5
	======== ==============================
	模型名 internlm2_7b
	使用算法 qlora
	数据集 colorist
	训练时长 5 epochs
	======== ==============================

	此时该目录文件结构应如下所示：

	.. code:: bash

	.
	├── colors
	│ ├── colors.json
	│ ├── dataset_infos.json
	│ ├── README.md
	│ └── train.jsonl
	├── internlm2_7b_qlora_colorist_e5_copy.py
	└── Shanghai_AI_Laboratory
	└── internlm2-chat-7b
	├── config.json
	├── configuration_internlm2.py
	├── configuration.json
	├── generation_config.json
	├── modeling_internlm2.py
	├── pytorch_model-00001-of-00008.bin
	├── pytorch_model-00002-of-00008.bin
	├── pytorch_model-00003-of-00008.bin
	├── pytorch_model-00004-of-00008.bin
	├── pytorch_model-00005-of-00008.bin
	├── pytorch_model-00006-of-00008.bin
	├── pytorch_model-00007-of-00008.bin
	├── pytorch_model-00008-of-00008.bin
	├── pytorch_model.bin.index.json
	├── README.md
	├── special_tokens_map.json
	├── tokenization_internlm2_fast.py
	├── tokenization_internlm2.py
	├── tokenizer_config.json
	└── tokenizer.model

	修改配置文件
	------------

	\| 在这一步中，我们需要修改待微调模型路径和数据路径为本地路径，并且修改数据集加载方式。
	\| 此外，由于复制得到的配置文件是基于基座（Base）模型的，所以还需要修改
	``prompt_template`` 以适配对话（Chat）模型。

	.. code:: diff

	#######################################################################
	# PART 1 Settings #
	#######################################################################
	# Model
	- pretrained_model_name_or_path = 'internlm/internlm2-7b'
	+ pretrained_model_name_or_path = './Shanghai_AI_Laboratory/internlm2-chat-7b'

	# Data
	- data_path = 'burkelibbey/colors'
	+ data_path = './colors/train.jsonl'
	- prompt_template = PROMPT_TEMPLATE.default
	+ prompt_template = PROMPT_TEMPLATE.internlm2_chat

	...
	#######################################################################
	# PART 3 Dataset & Dataloader #
	#######################################################################
	train_dataset = dict(
	type=process_hf_dataset,
	- dataset=dict(type=load_dataset, path=data_path),
	+ dataset=dict(type=load_dataset, path='json', data_files=dict(train=data_path)),
	tokenizer=tokenizer,
	max_length=max_length,
	dataset_map_fn=colors_map_fn,
	template_map_fn=dict(
	type=template_map_fn_factory, template=prompt_template),
	remove_unused_columns=True,
	shuffle_before_pack=True,
	pack_to_max_length=pack_to_max_length)

	因此在这一步中，修改了
	``pretrained_model_name_or_path``\ 、\ ``data_path``\ 、\ ``prompt_template``
	以及 ``train_dataset`` 中的 ``dataset`` 字段。

	启动微调
	--------

	在完成上述操作后，便可以使用下面的指令启动微调任务了。

	.. code:: bash

	# 单机单卡
	xtuner train ./internlm2_7b_qlora_colorist_e5_copy.py
	# 单机多卡
	NPROC_PER_NODE=${GPU_NUM} xtuner train ./internlm2_7b_qlora_colorist_e5_copy.py
	# slurm 情况
	srun ${SRUN_ARGS} xtuner train ./internlm2_7b_qlora_colorist_e5_copy.py --launcher slurm

	正确输出的训练日志应类似如下所示：

	.. code:: text

	01/29 21:35:34 - mmengine - INFO - Iter(train) [ 10/720] lr: 9.0001e-05 eta: 0:31:46 time: 2.6851 data_time: 0.0077 memory: 12762 loss: 2.6900
	01/29 21:36:02 - mmengine - INFO - Iter(train) [ 20/720] lr: 1.9000e-04 eta: 0:32:01 time: 2.8037 data_time: 0.0071 memory: 13969 loss: 2.6049 grad_norm: 0.9361
	01/29 21:36:29 - mmengine - INFO - Iter(train) [ 30/720] lr: 1.9994e-04 eta: 0:31:24 time: 2.7031 data_time: 0.0070 memory: 13969 loss: 2.5795 grad_norm: 0.9361
	01/29 21:36:57 - mmengine - INFO - Iter(train) [ 40/720] lr: 1.9969e-04 eta: 0:30:55 time: 2.7247 data_time: 0.0069 memory: 13969 loss: 2.3352 grad_norm: 0.8482
	01/29 21:37:24 - mmengine - INFO - Iter(train) [ 50/720] lr: 1.9925e-04 eta: 0:30:28 time: 2.7286 data_time: 0.0068 memory: 13969 loss: 2.2816 grad_norm: 0.8184
	01/29 21:37:51 - mmengine - INFO - Iter(train) [ 60/720] lr: 1.9863e-04 eta: 0:29:58 time: 2.7048 data_time: 0.0069 memory: 13969 loss: 2.2040 grad_norm: 0.8184
	01/29 21:38:18 - mmengine - INFO - Iter(train) [ 70/720] lr: 1.9781e-04 eta: 0:29:31 time: 2.7302 data_time: 0.0068 memory: 13969 loss: 2.1912 grad_norm: 0.8460
	01/29 21:38:46 - mmengine - INFO - Iter(train) [ 80/720] lr: 1.9681e-04 eta: 0:29:05 time: 2.7338 data_time: 0.0069 memory: 13969 loss: 2.1512 grad_norm: 0.8686
	01/29 21:39:13 - mmengine - INFO - Iter(train) [ 90/720] lr: 1.9563e-04 eta: 0:28:36 time: 2.7047 data_time: 0.0068 memory: 13969 loss: 2.0653 grad_norm: 0.8686
	01/29 21:39:40 - mmengine - INFO - Iter(train) [100/720] lr: 1.9426e-04 eta: 0:28:09 time: 2.7383 data_time: 0.0070 memory: 13969 loss: 1.9819 grad_norm: 0.9127

	在训练开始前，可以看到模型的输出如下所示：

	.. code:: text

	2024/01/29 21:34:58 - mmengine - INFO - before_train in EvaluateChatHook.
	2024/01/29 21:35:03 - mmengine - INFO - Sample output:
	<s><\|im_start\|>system
	You are a professional color designer. Please provide the corresponding colors based on the description of Human.
	<\|im_end\|>
	<\|im_start\|>user
	请给我一个像天空一样清澈透明的蓝色。<\|im_end\|>
	<\|im_start\|>assistant
	为了匹配您所描述的“像天空一样清澈透明的蓝色”，我建议您选择一种名为“天蓝”（Cerulean）的颜色。这种颜色通常被用来代表天空、海洋和清澈的水域，它具有明亮

	2024/01/29 21:35:07 - mmengine - INFO - Sample output:
	<s><\|im_start\|>system
	You are a professional color designer. Please provide the corresponding colors based on the description of Human.
	<\|im_end\|>
	<\|im_start\|>user
	Please give me a clear blue like the sky.<\|im_end\|>
	<\|im_start\|>assistant
	The color you're describing is a clear, bright blue that resembles the color of the sky on a sunny day. This color is often referred to as "sky blue" or "cerulean blue." Here are some specific color codes and names that

	而在训练完全完成后，可以观察到模型的输出如下所示：

	.. code:: text

	2024/01/29 22:08:07 - mmengine - INFO - Sample output:
	<s><\|im_start\|>system
	You are a professional color designer. Please provide the corresponding colors based on the description of Human.
	<\|im_end\|>
	<\|im_start\|>user
	请给我一个像天空一样清澈透明的蓝色。<\|im_end\|>
	<\|im_start\|>assistant
	#1099ee<\|im_end\|>

	2024/01/29 22:08:08 - mmengine - INFO - Sample output:
	<s><\|im_start\|>system
	You are a professional color designer. Please provide the corresponding colors based on the description of Human.
	<\|im_end\|>
	<\|im_start\|>user
	Please give me a clear blue like the sky.<\|im_end\|>
	<\|im_start\|>assistant
	#0066dd<\|im_end\|>

	模型输出的颜色如下所示：

	- 天空一样清澈透明的蓝色：\ \|image11\|

	- A clear blue like the sky: \|image12\|

	不难发现，模型在经过训练后，其输出已经完全与数据集内容所对齐了。

	.. _模型转换--lora-合并:

	模型转换 + LoRA 合并
	--------------------

	在训练完成后，我们会得到几个 ``.pth`` 文件，这些文件存储了 QLoRA
	算法训练过程所更新的参数，而\ 不是\ 模型的全部参数。因此我们需要将这些
	``.pth`` 文件转换为 HuggingFace 格式，并合并入原始的语言模型权重中。

	模型转换
	~~~~~~~~

	XTuner 已经集成好了将模型转换为 HuggingFace 格式的工具，我们只需要执行

	.. code:: bash

	# 创建存放 hf 格式参数的目录
	mkdir work_dirs/internlm2_7b_qlora_colorist_e5_copy/iter_720_hf

	# 转换格式
	xtuner convert pth_to_hf internlm2_7b_qlora_colorist_e5_copy.py \
	work_dirs/internlm2_7b_qlora_colorist_e5_copy/iter_720.pth \
	work_dirs/internlm2_7b_qlora_colorist_e5_copy/iter_720_hf

	该条转换命令将会根据配置文件 ``internlm2_7b_qlora_colorist_e5_copy.py``
	的内容，将
	``work_dirs/internlm2_7b_qlora_colorist_e5_copy/iter_720.pth`` 转换为 hf
	格式，并保存在
	``work_dirs/internlm2_7b_qlora_colorist_e5_copy/iter_720_hf`` 位置。

	LoRA 合并
	~~~~~~~~~

	XTuner 也已经集成好了合并 LoRA 权重的工具，我们只需执行如下指令：

	.. code:: bash

	# 创建存放合并后的参数的目录
	mkdir work_dirs/internlm2_7b_qlora_colorist_e5_copy/merged

	# 合并参数
	xtuner convert merge Shanghai_AI_Laboratory/internlm2-chat-7b \
	work_dirs/internlm2_7b_qlora_colorist_e5_copy/iter_720_hf \
	work_dirs/internlm2_7b_qlora_colorist_e5_copy/merged \
	--max-shard-size 2GB

	与转换命令类似，该条合并参数命令会读取原始参数路径
	``Shanghai_AI_Laboratory/internlm2-chat-7b`` 以及转换为 hf
	格式的部分参数路径
	``work_dirs/internlm2_7b_qlora_colorist_e5_copy/iter_720_hf``\ ，将两部分参数合并后保存于
	``work_dirs/internlm2_7b_qlora_colorist_e5_copy/merged``\ ，其中每个参数切片的最大文件大小为
	2GB。

	与模型对话
	----------

	在合并权重后，为了更好地体会到模型的能力，XTuner
	也集成了与模型对话的工具。通过如下命令，便可以启动一个与模型对话的简易
	Demo。

	.. code:: bash

	xtuner chat work_dirs/internlm2_7b_qlora_colorist_e5_copy/merged \
	--prompt-template internlm2_chat \
	--system-template colorist

	当然，我们也可以选择不合并权重，而是直接与 LLM + LoRA Adapter
	进行对话，我们只需要执行如下指令：

	.. code:: bash

	xtuner chat Shanghai_AI_Laboratory/internlm2-chat-7b
	--adapter work_dirs/internlm2_7b_qlora_colorist_e5_copy/iter_720_hf \
	--prompt-template internlm2_chat \
	--system-template colorist

	其中 ``work_dirs/internlm2_7b_qlora_colorist_e5_copy/merged``
	是合并后的权重路径，\ ``--prompt-template internlm2_chat``
	指定了对话模板为 InternLM2-Chat，\ ``--system-template colorist``
	则是指定了与模型对话时的 System Prompt 为 Colorist 数据集所要求的模板。

	以下是一个例子：

	.. code:: text

	double enter to end input (EXIT: exit chat, RESET: reset history) >>> 宁静而又相当明亮的浅天蓝色，介于天蓝色和婴儿蓝之间，因其亮度而带有一丝轻微的荧光感。

	#66ccff<\|im_end\|>

	其颜色如下所示：

	宁静而又相当明亮的浅天蓝色，介于天蓝色和婴儿蓝之间，因其亮度而带有一丝轻微的荧光感。：\ \|image13\|

	.. \|image1\| image:: https://img.shields.io/badge/%2366ccff-66CCFF
	.. \|image2\| image:: https://img.shields.io/badge/%2366ccff-66CCFF
	.. \|image3\| image:: https://img.shields.io/badge/%23ee0000-EE0000
	.. \|image4\| image:: https://img.shields.io/badge/%2300ffcc-00FFCC
	.. \|image5\| image:: https://img.shields.io/badge/%2366ccff-66CCFF
	.. \|image6\| image:: https://img.shields.io/badge/%23ee0000-EE0000
	.. \|image7\| image:: https://img.shields.io/badge/%2300ffcc-00FFCC
	.. \|image8\| image:: https://img.shields.io/badge/%2366ccff-66CCFF
	.. \|image9\| image:: https://img.shields.io/badge/%23ee0000-EE0000
	.. \|image10\| image:: https://img.shields.io/badge/%2300ffcc-00FFCC
	.. \|image11\| image:: https://img.shields.io/badge/天空一样清澈透明的蓝色-1099EE
	.. \|image12\| image:: https://img.shields.io/badge/A_clear_blue_like_the_sky-0066DD
	.. \|image13\| image:: https://img.shields.io/badge/宁静而又相当明亮的浅天蓝色，介于天蓝色和婴儿蓝之间，因其亮度而带有一丝轻微的荧光感。-66CCFF