1 14 81

Peter Tanski

pdtgct

AI & ML interests

Machine Learning, Artificial Intelligence

Recent Activity

upvoted a paper 2 days ago

Training Large Language Models to Reason in a Continuous Latent Space

upvoted a paper 4 days ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

upvoted a paper about 2 months ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

View all activity

Organizations

pdtgct's activity

upvoted a paper 2 days ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 16 days ago • 62

upvoted a paper 4 days ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31 • 12

upvoted a paper about 2 months ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 52

upvoted 2 articles 3 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 180

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 213

liked a model 4 months ago

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24 • 845 • 1.71k

upvoted a paper 4 months ago

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding

Paper • 2408.11049 • Published Aug 20 • 12

liked a model 5 months ago

Groq/Llama-3-Groq-70B-Tool-Use

Text Generation • Updated Aug 28 • 278 • 151

liked a model 6 months ago

deepseek-ai/deepseek-coder-1.3b-instruct

Text Generation • Updated Mar 7 • 12.1k • 100

upvoted a paper 6 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

upvoted a collection 7 months ago

Sparse Foundational Llama 2 Models

Collection

Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras • 27 items • Updated Sep 26 • 9

liked a model 8 months ago

NousResearch/Hermes-2-Pro-Llama-3-8B

Text Generation • Updated Sep 14 • 23.6k • 413

upvoted a collection 9 months ago

A little guide to building Large Language Models in 2024

Collection

Resources mentioned by @thomwolf in https://x.com/Thom_Wolf/status/1773340316835131757 • 19 items • Updated Apr 1 • 14

liked a Space 9 months ago

Running on CPU Upgrade

1.19k

🌟

C4AI Command Models

upvoted a paper 9 months ago

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

Paper • 2403.05313 • Published Mar 8 • 9

liked a model 9 months ago

hpcai-tech/grok-1

Text Generation • Updated Mar 28 • 12k • 73

upvoted a paper 9 months ago

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13 • 49

liked a model 10 months ago

HuggingFaceH4/starchat2-15b-v0.1

Text Generation • Updated Mar 13 • 27.3k • • 107

upvoted a paper 10 months ago

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11 • 90

liked a model 10 months ago

TheBloke/CodeLlama-70B-Instruct-AWQ

Text Generation • Updated Jan 30 • 58 • 13