Model Card for Model ID

image

Merged model using mergekit

This model aim to make a agent system with keeping given our waifu persona.

Merge Format

merge_method: model_stock
models:
    - model: Nexusflow/Athene-V2-Chat
    - model: Nexusflow/Athene-V2-Agent
    - model: Sao10K/72B-Qwen2.5-Kunou-v1
    - model: AXCXEPT/EZO-Qwen2.5-72B-Instruct
    - model: /root/data/ywnam/LLM/model_weights/Qwen/Qwen2.5-72B-Instruct/sft_vn_ver_2.0.1_dequantized_merged
    - model: EVA-UNIT-01/EVA-Qwen2.5-72B-v0.1
    - model: anthracite-org/magnum-v4-72b
base_model: Qwen/Qwen2.5-72B-Instruct
dtype: bfloat16
tokenizer_source: base

Model Details

Model Description

  • Developed by: spow12(yw_nam)
  • Shared by : spow12(yw_nam)
  • Model type: CausalLM
  • Language(s) (NLP): japanese, english
  • Finetuned from model : Qwen/Qwen2.5-72B-Instruct

Chat Format

<|im_start|>system
This is the system prompt.<|im_end|>
<|im_start|>user
Instructions placed here.<|im_end|>
<|im_start|>assistant
The model's response will be here.<|im_end|>

Dataset

SFT

  • Riddle Joker(Prviate)
  • Café Stella and the Reaper's Butterflies(Private)
  • Senren*Banka(Private)
  • roleplay4fun/aesir-v1.1
  • kalomaze/Opus_Instruct_3k
  • Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
  • Nopm/Opus_WritingStruct
  • PJMixers/hieunguyenminh_roleplay-deduped-ShareGPT
  • anthracite-org/stheno-filtered-v1.1
  • SicariusSicariiStuff/Bluemoon_Top50MB_Sorted_Fixed
  • Aratako/Magpie-Tanuki-8B-97k
  • Aratako_Synthetic_JP_EN_Coding_Dataset_801k
  • Aratako/Synthetic-Japanese-Roleplay-gpt-4o-mini-39.6k-formatted
  • Aratako/Synthetic-Japanese-Roleplay-NSFW-Claude-3.5s-15.3k-formatted
  • Aratako_Synthetic_JP_EN_Translation_Dataset_Magpie_Nemotron
  • Aratako_Rosebleu_1on1_Dialogues_RP
  • Team-ACE/ToolACE
  • SkunkworksAI/reasoning-0.01
  • HuggingFaceTB/smoltalk
  • microsoft_orca_agentinstruct_1M_v1
  • Aratako/Magpie-Tanuki-8B-97k

Use & Credit

This model is currently available for non-commercial & Research purpose only. Also, since I'm not detailed in licensing, I hope you use it responsibly.

By sharing this model, I hope to contribute to the research efforts of our community (the open-source community and Waifu Lovers).

Citation

@misc {ChatWaifu_12B_v2.0,
    author       = { YoungWoo Nam },
    title        = { spow12/ChatWaifu_72B_v2.2 },
    year         = 2024,
    url          = { https://huggingface.co/spow12/ChatWaifu_72B_v2.2 },
    publisher    = { Hugging Face }
}
Downloads last month
50
Safetensors
Model size
72.7B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for spow12/ChatWaifu_72B_v2.2

Datasets used to train spow12/ChatWaifu_72B_v2.2

Collection including spow12/ChatWaifu_72B_v2.2