14 2 42

QuantPanda

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

huihui-ai/DeepSeek-V3-abliterated

liked a model 11 days ago

huihui-ai/kanana-nano-2.1b-instruct-abliterated

reacted to MoritzLaurer's post with 😔 about 2 months ago

FACTS is a great paper from @GoogleDeepMind on measuring the factuality of LLM outputs. You can now download their prompt templates from @huggingface to improve LLM-based fact-checking yourself! 📏 The paper introduces the FACTS Grounding benchmark for evaluating the factuality of LLM outputs. 🤖 Fact-checking is automated by an ensemble of LLM judges that verify if a response is fully grounded in a factual reference document. 🧪 The authors tested different prompt templates on held-out data to ensure their generalization. 📚 It's highly educational to read these templates to learn how frontier labs design prompts and understand their limitations. 💾 You can now download and reuse these prompt templates via the prompt-templates library! 🔄 The library simplifies sharing prompt templates on the HF hub or locally via standardized YAML files. Let’s make LLM work more transparent and reproducible by sharing more templates like this! Links 👇 - prompt-templates docs: https://moritzlaurer.github.io/prompt_templates/ - all templates on the HF Hub: https://huggingface.co/datasets/MoritzLaurer/facts-grounding-prompts - FACTS paper: https://storage.googleapis.com/deepmind-media/FACTS/FACTS_grounding_paper.pdf

View all activity

Organizations

None yet

QuantPanda's activity

liked a model 8 days ago

huihui-ai/DeepSeek-V3-abliterated

Updated 2 days ago • 71

liked a model 11 days ago

huihui-ai/kanana-nano-2.1b-instruct-abliterated

Text Generation • Updated 12 days ago • 433 • 3

reacted to MoritzLaurer's post with 😔 about 2 months ago

Post

3294

FACTS is a great paper from @GoogleDeepMind on measuring the factuality of LLM outputs. You can now download their prompt templates from @huggingface to improve LLM-based fact-checking yourself!

📏 The paper introduces the FACTS Grounding benchmark for evaluating the factuality of LLM outputs.

🤖 Fact-checking is automated by an ensemble of LLM judges that verify if a response is fully grounded in a factual reference document.

🧪 The authors tested different prompt templates on held-out data to ensure their generalization.

📚 It's highly educational to read these templates to learn how frontier labs design prompts and understand their limitations.

💾 You can now download and reuse these prompt templates via the prompt-templates library!

🔄 The library simplifies sharing prompt templates on the HF hub or locally via standardized YAML files. Let’s make LLM work more transparent and reproducible by sharing more templates like this!

Links 👇
- prompt-templates docs: https://moritzlaurer.github.io/prompt_templates/
- all templates on the HF Hub: MoritzLaurer/facts-grounding-prompts
- FACTS paper: https://storage.googleapis.com/deepmind-media/FACTS/FACTS_grounding_paper.pdf

reacted to lewtun's post with 🔥 2 months ago

Post

3891

I was initially pretty sceptical about Meta's Coconut paper [1] because the largest perf gains were reported on toy linguistic problems. However, these results on machine translation are pretty impressive!

https://x.com/casper_hansen_/status/1875872309996855343

Together with the recent PRIME method [2] for scaling RL, reasoning for open models is looking pretty exciting for 2025!

[1] Training Large Language Models to Reason in a Continuous Latent Space (2412.06769)
[2] https://huggingface.co/blog/ganqu/prime