TokisakiKurumi's picture

3 7 9

TokisakiKurumi

transZ

·

TokisakiKurumi2001

AI & ML interests

NLP

Recent Activity

upvoted an article 3 days ago

What is test-time compute and how to scale it?

new activity 18 days ago

transZ/lm-eval-piqa:Convert dataset to Parquet

updated a dataset 18 days ago

transZ/lm-eval-piqa

View all activity

Organizations

None yet

transZ's activity

upvoted an article 3 days ago

Article

What is test-time compute and how to scale it?

By

and 1 other •

4 days ago

• 15

New activity in transZ/lm-eval-piqa 18 days ago

Convert dataset to Parquet

#1 opened 18 days ago by

updated a dataset 18 days ago

transZ/lm-eval-piqa

Viewer • Updated 18 days ago • 21k • 34

published a dataset 18 days ago

transZ/lm-eval-piqa

Viewer • Updated 18 days ago • 21k • 34

liked a dataset about 1 month ago

princeton-nlp/QuRatedPajama-260B

Viewer • Updated Apr 16, 2024 • 254M • 1.16k • 7

liked 2 models about 1 month ago

microsoft/phi-4

Text Generation • Updated 6 days ago • 572k • 1.71k

nvidia/Hymba-1.5B-Base

Text Generation • Updated Jan 2 • 8.44k • 138

updated 2 models about 1 month ago

transZ/Llama-3.2-3B-26

transZ/Llama-3.2-3B-27

liked a model 3 months ago

migtissera/Tess-R1-Limerick-Llama-3.1-70B

Updated Nov 6, 2024 • 17 • 20

upvoted a collection 4 months ago

LLM Reasoning Papers

Papers to improve reasoning capabilities of LLMs • 20 items • Updated 26 days ago • 115

liked a dataset 4 months ago

migtissera/Synthia-v1.5-I

Viewer • Updated Sep 30, 2024 • 20.7k • 52 • 46

liked a model 4 months ago

amd/AMD-Llama-135m

Text Generation • Updated Oct 9, 2024 • 14.5k • 111

upvoted a collection 6 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 552

updated a dataset 7 months ago

transZ/GIO

Viewer • Updated Jul 5, 2024 • 2 • 55

liked a Space 8 months ago

Can You Run It? LLM version

Determine GPU requirements for large language models

liked a model 8 months ago

Locutusque/TinyMistral-248M

Text Generation • Updated May 9, 2024 • 2.1k • 42

upvoted a collection 9 months ago

Granite Code Models

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated Dec 18, 2024 • 182

upvoted a collection 11 months ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 227

reacted to davanstrien's post with 👍 11 months ago

Post

KTO offers an easier way to preference train LLMs (only 👍👎 ratings are required). As part of #DataIsBetterTogether, I've written a tutorial on creating a preference dataset using Argilla and Spaces.

Using this approach, you can create a dataset that anyone with a Hugging Face account can contribute to 🤯

See an example of the kind of Space you can create following this tutorial here: davanstrien/haiku-preferences

🆕 New tutorial covers:
💬 Generating responses with open models
👥 Collecting human feedback (do you like this model response? Yes/No)
🤖 Preparing a TRL-compatible dataset for training aligned models

Check it out here: https://github.com/huggingface/data-is-better-together/tree/main/kto-preference

2 replies

·