mainsafetensorsbf16Public
robert/albedo-qwen3-4b-caution11
sha256:4bd6882e8f9ffbe5012baee740309bec6adfba827ae3f9a2009e7b312a342f1c·Indexed 20h ago
Parameters
4.0B
Total size
7.5 GB
Files
8
Quantization
BF16
No README on this version
Push a README.md with the model files to see it rendered here.
Model architecture
config.json- Architecture
- Qwen3ForCausalLM
- Model type
- qwen3
- Hidden size
- 2,560
- Layers
- 36
- Attention heads
- 32 (8 kv)
- FFN size
- 9,728
- Vocab size
- 151,936
- Context window
- 40K
- Dtype
- bfloat16
Files
8 itemsmodel-00001-of-00002.safetensors
23c85e4d85c5
4.7 GB
safetensors
model-00002-of-00002.safetensors
529ccec30e86
2.8 GB
safetensors
tokenizer.json
be75606093db
10.9 MB
model.safetensors.index.json
38719f5264e9
32.0 KB
tokenizer_config.json
aaf1e3542bcc
4.7 KB
chat_template.jinja
e132ae041e12
4.1 KB
config.json
018398443a85
727 B
generation_config.json
535ef5c50ad9
214 B