mainsafetensorsbf16Public
superstar/albedo-qwen3-4b-v1
sha256:5fd839b885de236f2270cfab70d4ed7c6f55883e32a4cfa75863c31d50134eb8·Indexed 1d ago
Parameters
4.0B
Total size
7.5 GB
Files
13
Quantization
BF16
No README on this version
Push a README.md with the model files to see it rendered here.
Model architecture
config.json- Architecture
- Qwen3ForCausalLM
- Model type
- qwen3
- Hidden size
- 2,560
- Layers
- 36
- Attention heads
- 32 (8 kv)
- FFN size
- 9,728
- Vocab size
- 151,936
- Context window
- 40K
- Dtype
- bfloat16
Files
13 itemsmodel-00002-of-00003.safetensors
f45022a8e265
3.7 GB
safetensors
model-00001-of-00003.safetensors
f33efc0d32c6
3.7 GB
safetensors
model-00003-of-00003.safetensors
eb46497707d4
95.0 MB
safetensors
tokenizer.json
7c58a2ddb530
10.9 MB
vocab.json
ca10d7e9fb3e
2.6 MB
merges.txt
8831e4f1a044
1.6 MB
model.safetensors.index.json
5b36dbd79cdd
32.1 KB
tokenizer_config.json
e50f6e0b1e23
9.3 KB
chat_template.jinja
c945a4a87850
4.0 KB
config.json
829b9758aedb
809 B
added_tokens.json
c0284b582e14
707 B
generation_config.json
3b101cd89b14
188 B
special_tokens_map.json
b15f1f34e32d
175 B