arxiv:2402.13991
Szymon Tworkowski
syzymon
AI & ML interests
Language models, theorem proving and much more!
Recent Activity
authored
a paper
10 days ago
Magnushammer: A Transformer-based Approach to Premise Selection
authored
a paper
10 days ago
Structured Packing in LLM Training Improves Long Context Utilization
authored
a paper
10 days ago
Hierarchical Transformers Are More Efficient Language Models
Organizations
None yet
models
5
syzymon/long_llama_code_7b_instruct
Text Generation
•
Updated
•
26
•
11
syzymon/long_llama_code_7b
Text Generation
•
Updated
•
12
•
31
syzymon/long_llama_3b
Text Generation
•
Updated
•
84
•
120
syzymon/long_llama_3b_instruct
Text Generation
•
Updated
•
48
•
25
syzymon/long_llama_3b_v1_1
Text Generation
•
Updated
•
55
•
10
datasets
None public yet