willkingsafetensorsbf16Public
rsgold/albedo-qwen3-4b-sharp-hu
sha256:14050ad65b75ff8dd63b49d81f4d7f9dbf480cb0884ce01d2bf1ed28bcbf1613·Indexed 1d ago
Parameters
4.0B
Total size
7.5 GB
Files
8
Quantization
BF16
No README on this version
Push a README.md with the model files to see it rendered here.
Model architecture
config.json- Architecture
- Qwen3ForCausalLM
- Model type
- qwen3
- Hidden size
- 2,560
- Layers
- 36
- Attention heads
- 32 (8 kv)
- FFN size
- 9,728
- Vocab size
- 151,936
- Context window
- 40K
- Dtype
- bfloat16
Files
8 itemsmodel.safetensors
668a6136996b
7.5 GB
safetensors
tokenizer.json
7c58a2ddb530
10.9 MB
vocab.json
ca10d7e9fb3e
2.6 MB
merges.txt
8831e4f1a044
1.6 MB
tokenizer_config.json
56b7c48b6198
9.3 KB
chat_template.jinja
c945a4a87850
4.0 KB
config.json
829b9758aedb
809 B
generation_config.json
3b101cd89b14
188 B