base: include Llama base model and medusa head head: only medusa head ## convert 1. prepare medusa ```bash git clone https://github.com/FasterDecoding/Medusa.git cd Medusa pip install -e . ``` 2. python3 medusa_model_demo.py -m base -o head --use-full-key --verbose