arxiv:2412.04626
Suyuchen Wang
sheryc
AI & ML interests
Playing with LLMs
Recent Activity
liked
a dataset
7 days ago
ServiceNow/BigDocs-Bench
authored
a paper
7 days ago
BigDocs: An Open and Permissively-Licensed Dataset for Training
Multimodal Models on Document and Code Tasks
Organizations
models
None public yet
datasets
20
sheryc/instruct-concat-100-tokenized-llama3
Viewer
•
Updated
•
16k
•
36
sheryc/instruct-concat-100-tokenized-llama1-2
Viewer
•
Updated
•
16k
•
33
sheryc/wiki40b_it_test_1k_instances_processed
Viewer
•
Updated
•
1k
•
39
sheryc/wiki40b_it_test_1k_instances_processed_keep_title
Viewer
•
Updated
•
1k
•
33
sheryc/wiki40b_en_test_1k_instances_processed
Viewer
•
Updated
•
1k
•
33
sheryc/wiki40b_en_test_1k_instances_processed_keep_title
Viewer
•
Updated
•
1k
•
35
sheryc/wiki40b_fr_test_1k_instances_processed
Viewer
•
Updated
•
1k
•
36
sheryc/wiki40b_ja_test_1k_instances_processed_keep_title
Viewer
•
Updated
•
1k
•
37
sheryc/wiki40b_ko_test_1k_instances_processed_keep_title
Viewer
•
Updated
•
1k
•
32
sheryc/wiki40b_fr_test_1k_instances_processed_keep_title
Viewer
•
Updated
•
1k
•
35