Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kevin1020 's Collections
RAG
Prompting
Inference Acceleration
LLM Agents
Code Generation
Efficient Tuning
Token Compression
Efficient VLM via Image Token Compression
VLM
Long Context
Reasoning
Visualizations
Forward tuning
PEFT
ViT
Modular
Benchmarks
Efficient LLM

RAG

updated Oct 16, 2024
Upvote
1

  • Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

    Paper • 2310.11511 • Published Oct 17, 2023 • 75

  • REST: Retrieval-Based Speculative Decoding

    Paper • 2311.08252 • Published Nov 14, 2023

  • Active Retrieval Augmented Generation

    Paper • 2305.06983 • Published May 11, 2023 • 3

  • Retrieval-Augmented Generation for Large Language Models: A Survey

    Paper • 2312.10997 • Published Dec 18, 2023 • 10

  • RAFT: Adapting Language Model to Domain Specific RAG

    Paper • 2403.10131 • Published Mar 15, 2024 • 67

  • Larimar: Large Language Models with Episodic Memory Control

    Paper • 2403.11901 • Published Mar 18, 2024 • 32

  • OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs

    Paper • 2409.05152 • Published Sep 8, 2024 • 31

  • MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery

    Paper • 2409.05591 • Published Sep 9, 2024 • 30

  • VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

    Paper • 2410.10594 • Published Oct 14, 2024 • 24
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs