-
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
Paper • 2409.10516 • Published • 41 -
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse
Paper • 2409.11242 • Published • 7 -
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models
Paper • 2409.11136 • Published • 23 -
On the Diagram of Thought
Paper • 2409.10038 • Published • 14
Tongyao PRO
tyzhu
AI & ML interests
Natural Language Processing
Organizations
None yet
Collections
1
models
231
tyzhu/tiny_LLaMA_1b_8k_intradm4_proweb_8k_iter-200000-ckpt-step-100000_hf
Updated
•
41
tyzhu/tiny_LLaMA_1b_8k_intradm1_proweb_8k_iter-200000-ckpt-step-100000_hf
Updated
•
42
tyzhu/tiny_LLaMA_1b_32k_cc_32k_iter-100000-ckpt-step-100000_hf
Updated
•
93
tyzhu/tiny_LLaMA_1b_32k_dm2_cc_32k_iter-100000-ckpt-step-100000_hf
Updated
•
61
tyzhu/tiny_LLaMA_120M_2k_cc_repeat_2k_iter-060000-ckpt-step-15000_hf
Updated
tyzhu/tiny_LLaMA_120M_8k_cc_8k_iter-400000-ckpt-step-100000_hf
Updated
•
12
tyzhu/tiny_LLaMA_1b_8k_intramask_cc_8k_iter-480000-ckpt-step-60000_hf
Text Generation
•
Updated
•
71
tyzhu/tiny_LLaMA_1b_8k_intramask_cc_8k_iter-320000-ckpt-step-40000_hf
Text Generation
•
Updated
•
197
tyzhu/tiny_LLaMA_1b_8k_cc_8k_iter-400000-ckpt-step-50000_hf
Text Generation
•
Updated
•
112
tyzhu/tiny_LLaMA_1b_2k_cc_2k_iter-400000-ckpt-step-50000_hf
Updated
•
15
datasets
821
tyzhu/arc_c_tr
Viewer
•
Updated
•
2.32k
•
5
tyzhu/arc_e_tr
Viewer
•
Updated
•
9.82k
•
8
tyzhu/hellaswag_tr
Viewer
•
Updated
•
17.5k
•
33
tyzhu/tpo
Viewer
•
Updated
•
269
•
60
tyzhu/quality
Viewer
•
Updated
•
173
•
86
tyzhu/the-stack-py
Viewer
•
Updated
•
16.3M
•
145
•
1
tyzhu/pystack_clean
Viewer
•
Updated
•
9.44M
•
59
tyzhu/benchmark_root_jan7
Preview
•
Updated
•
1
tyzhu/id_cc_pool
Viewer
•
Updated
•
72.5M
•
486
tyzhu/proweb
Viewer
•
Updated
•
46.3M
•
37