Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
aisingapore
/
sea-lion-3b
like
16
Follow
AI Singapore
110
Text Generation
Transformers
Safetensors
11 languages
mpt
custom_code
text-generation-inference
Inference Endpoints
arxiv:
2101.09635
License:
mit
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
b4203f4
sea-lion-3b
4 contributors
History:
38 commits
dotw
add link to SEA-LION GitHub repo
b4203f4
about 1 year ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
LICENSE
1.06 kB
Update LICENSE
about 1 year ago
README.md
4.78 kB
add link to SEA-LION GitHub repo
about 1 year ago
adapt_tokenizer.py
1.72 kB
Add 3B model files
about 1 year ago
attention.py
21.6 kB
Add 3B model files
about 1 year ago
blocks.py
2.84 kB
Add 3B model files
about 1 year ago
config.json
1.27 kB
Add 3B model files
about 1 year ago
configuration_mpt.py
11 kB
Add 3B model files
about 1 year ago
custom_embedding.py
292 Bytes
Add 3B model files
about 1 year ago
fc.py
167 Bytes
Add 3B model files
about 1 year ago
ffn.py
1.75 kB
Add 3B model files
about 1 year ago
flash_attn_triton.py
28.2 kB
Add 3B model files
about 1 year ago
generation_config.json
91 Bytes
Add 3B model files
about 1 year ago
hf_prefixlm_converter.py
27.6 kB
Add 3B model files
about 1 year ago
meta_init_context.py
3.96 kB
Add 3B model files
about 1 year ago
model.safetensors
6.36 GB
LFS
Add 3B model files
about 1 year ago
modeling_mpt.py
24.2 kB
Add 3B model files
about 1 year ago
norm.py
3.12 kB
Add 3B model files
about 1 year ago
param_init_fns.py
11.9 kB
Add 3B model files
about 1 year ago
special_tokens_map.json
59 Bytes
Add 3B model files
about 1 year ago
tokenization_SEA_BPE.py
7.8 kB
Add 3B model files
about 1 year ago
tokenizer.model
4.57 MB
LFS
Add 3B model files
about 1 year ago
tokenizer_config.json
795 Bytes
Add 3B model files
about 1 year ago