10086 14 204

Tien Dung

tiendung

tiendung

AI & ML interests

None yet

Recent Activity

updated a Space 12 days ago

Symato/tomtat

liked a dataset about 1 month ago

microsoft/orca-agentinstruct-1M-v1

updated a collection about 1 month ago

RAG

View all activity

Articles

Ưu tiên có thể diễn giải thông qua Mô hình Phần thưởng Đa mục tiêu và Hỗn hợp Chuyên gia

Sep 29

• 1

Organizations

tiendung's activity

updated a Space 12 days ago

Running

📉

Tóm Tắt dot AI

Tóm tắt và chat với các nội dung từ các links được cung cấp

liked a dataset about 1 month ago

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1 • 1.05M • 13.1k • 404

updated a collection about 1 month ago

RAG

Collection

liked a model about 1 month ago

5CD-AI/ColVintern-1B-v1

Feature Extraction • Updated Nov 14 • 295 • 6

liked a model about 2 months ago

ltg/gpt-bert-babylm-base

Updated Nov 11 • 6.45k • 6

liked 3 datasets about 2 months ago

upvoted a paper about 2 months ago

Unifying Multimodal Retrieval via Document Screenshot Embedding

Paper • 2406.11251 • Published Jun 17 • 9

liked a model about 2 months ago

MrLight/dse-qwen2-2b-mrl-v1

Updated Oct 4 • 15.1k • 38

liked a Space about 2 months ago

Running

📉

Tóm Tắt dot AI

Tóm tắt và chat với các nội dung từ các links được cung cấp

updated a collection about 2 months ago

RAG

Collection

liked a dataset about 2 months ago

5CD-AI/Vietnamese-THUIR-T2Ranking-gg-translated

Viewer • Updated Jun 5 • 361M • 127 • 19

upvoted a collection about 2 months ago

new architecture

Collection

20 items • Updated 8 days ago • 3

upvoted a paper about 2 months ago

GPT or BERT: why not both?

Paper • 2410.24159 • Published Oct 31 • 14

reacted to singhsidhukuldeep's post with 👀 about 2 months ago

Post

2098

Exciting Research Alert: Revolutionizing Dense Passage Retrieval with Entailment Tuning!

The good folks at HKUST have developed a novel approach that significantly improves information retrieval by leveraging natural language inference.

The entailment tuning approach consists of several key steps to enhance dense passage retrieval performance.

Data Preparation
- Convert questions into existence claims using rule-based transformations.
- Combine retrieval data with NLI data from SNLI and MNLI datasets.
- Unify the format of both data types using a consistent prompting framework.

Entailment Tuning Process
- Initialize the model using pre-trained language models like BERT or RoBERTa.
- Apply aggressive masking (β=0.8) specifically to the hypothesis components while preserving premise information.
- Train the model to predict the masked hypothesis tokens from the premise content.
- Run the training for 10 epochs using 8 GPUs, taking approximately 1.5-3.5 hours.

Training Arguments for Entailment Tuning (Yes! They Shared Them)
- Use a learning rate of 2e-5 with 100 warmup steps.
- Set batch size to 128.
- Apply weight decay of 0.01.
- Utilize the Adam optimizer with beta values (0.9, 0.999).
- Maintain maximum gradient norm at 1.0.

Deployment
- Index passages using FAISS for efficient retrieval.
- Shard vector store across multiple GPUs.
- Enable sub-millisecond retrieval of the top-100 passages per query.

Integration with Existing Systems
- Insert entailment tuning between pre-training and fine-tuning stages.
- Maintain compatibility with current dense retrieval methods.
- Preserve existing contrastive learning approaches during fine-tuning.

Simple, intuitive, and effective!

This advancement significantly improves the quality of retrieved passages for question-answering systems and retrieval-augmented generation tasks.