yueyulin
/

RwkvInQwen2.5-7B-base

Model card Files Files and versions Community

yueyulin commited on 6 days ago

Commit

0ab1a21

•

1 Parent(s): 4f55737

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -2,6 +2,10 @@ This is the first model which converts Qwen2.5-7B's checkpoint to RWKV-7 archite
 It's trained in one server with 8xA800 for one day which might not be that versatile. It shows an acceptable performance to chat with you fluently.
 The shortage is that this base model can't do math and related tasks. I'll add a more balanced data to improve that model's capability later.
-Please refer the https://github.com/yynil/RWKVinLLAMA/blob/rwkv_7/gradio/chat_rwkv7.py about how to use it.

 It's trained in one server with 8xA800 for one day which might not be that versatile. It shows an acceptable performance to chat with you fluently.
 The shortage is that this base model can't do math and related tasks. I'll add a more balanced data to improve that model's capability later.
+Please refer the https://github.com/yynil/RWKVinLLAMA/blob/rwkv_7/tests/test_chat_cli.py about how to use it.
+```python
+python tests/test_chat_cli.py --config_file configs/qwen_7b.yaml --is_hybrid --num_gpus 1
+```
+You may need to change the configuration yaml to point the path to the checkpoint path you downloaded.