gupta-tanish/Ultrafeedback-llama3-8b-Instruct-kmeans-selection-4vs4 Viewer • Updated about 6 hours ago • 61.3k
gupta-tanish/Ultrafeedback-llama3-8b-Instruct-kmeans-selection-4vs4 Viewer • Updated about 6 hours ago • 61.3k
gupta-tanish/Ultrafeedback-llama3-8b-Instruct-top4vsbottom4-selection Viewer • Updated about 9 hours ago • 60.8k
gupta-tanish/Ultrafeedback-llama3-8b-Instruct-top4vsbottom4-selection Viewer • Updated about 9 hours ago • 60.8k
gupta-tanish/verified-q-alignment-dynamic-preference-data-cur-score Viewer • Updated about 13 hours ago • 10.1k • 26
gupta-tanish/verified-q-alignment-dynamic-preference-data-cur-score Viewer • Updated about 13 hours ago • 10.1k • 26