LagPixelLOL's picture

LagPixelLOL

v2ray

AI & ML interests

Looking for compute sponsors, please contact me through my email [email protected]!

Recent Activity

updated a model 4 minutes ago
v2ray/GPT4chan-8B-AWQ
updated a model 4 minutes ago
v2ray/GPT4chan-8B-FP8
updated a model 4 minutes ago
v2ray/GPT4chan-8B
View all activity

Organizations

v2AI Foundation's profile picture Unofficial Mistral Community's profile picture Social Post Explorers's profile picture Cognitive Computations's profile picture Hugging Face Discord Community's profile picture

v2ray's activity

posted an update 36 minutes ago
view post
Post
5
GPT4chan Series Release

GPT4chan is a series of models I trained on v2ray/4chan dataset, which is based on lesserfield/4chan-datasets. The dataset contains mostly posts from 2023. Not every board is included, for example, /pol/ is NOT included. To see which boards are included, visit v2ray/4chan.

This release contains 2 models sizes, 8B and 24B. The 8B model is based on meta-llama/Llama-3.1-8B and the 24B model is based on mistralai/Mistral-Small-24B-Base-2501.

Why I made these models? Because for a long time after the original gpt-4chan model, there aren't any serious fine-tunes on 4chan datasets. 4chan is a good data source since it contains coherent replies and nice topics. It's fun to talk to an AI generated version of 4chan and get instant replies, and without the need to actually visit 4chan. You can also sort of analyze the content and behavior of 4chan posts by probing the model's outputs.

Disclaimer: The GPT4chan models should only be used for research purposes, the outputs they generated do not represent the view of me on the subjects. Moderate the responses before sending it online.

Model links:

Full model:
- v2ray/GPT4chan-8B
- v2ray/GPT4chan-24B

Adapter:
- v2ray/GPT4chan-8B-QLoRA
- v2ray/GPT4chan-24B-QLoRA

AWQ:
- v2ray/GPT4chan-8B-AWQ
- v2ray/GPT4chan-24B-AWQ

FP8:
- v2ray/GPT4chan-8B-FP8
updated a model 4 days ago
published a model 4 days ago
New activity in cognitivecomputations/DeepSeek-V3-AWQ 5 days ago
New activity in cognitivecomputations/DeepSeek-R1-AWQ 7 days ago

Deployment framework

8
#2 opened 12 days ago by
xro7
New activity in cognitivecomputations/DeepSeek-R1-AWQ 12 days ago

Smaller deepseek models?

6
#1 opened 14 days ago by
loshka2