shallow/teutonic-q3-4b-5dnsrzl6-bfm-v14
Parameters
4.4B
Total size
8.2 GB
Files
15
Quantization
BF16
No README on this version
Push a README.md with the model files to see it rendered here.
Model architecture
config.json- Architecture
- Qwen3ForCausalLM
- Model type
- qwen3
- Hidden size
- 2,560
- Layers
- 36
- Attention heads
- 32 (8 kv)
- FFN size
- 9,728
- Vocab size
- 151,936
- Context window
- 40K
Files
15 itemsmodel-00003-of-00010.safetensors
bb51c0264c19
942.6 MB
safetensors
model-00008-of-00010.safetensors
c8460b5affd7
942.6 MB
safetensors
model-00007-of-00010.safetensors
d65813570102
937.6 MB
safetensors
model-00002-of-00010.safetensors
5642976e6aeb
937.6 MB
safetensors
model-00001-of-00010.safetensors
c248344c9418
934.4 MB
safetensors
model-00005-of-00010.safetensors
b49f3d62829b
915.1 MB
safetensors
model-00006-of-00010.safetensors
119a80ef8d63
915.1 MB
safetensors
model-00004-of-00010.safetensors
6e7cdf36695b
910.1 MB
safetensors
model-00010-of-00010.safetensors
4fa30d1db36a
741.9 MB
safetensors
model-00009-of-00010.safetensors
c58896dad5a7
237.5 MB
safetensors
tokenizer.json
be75606093db
10.9 MB
model.safetensors.index.json
bb6418a64342
32.1 KB
config.json
9430f79acea2
1.6 KB
tokenizer_config.json
2e3bd3cb5055
693 B
generation_config.json
b77461537176
213 B