shallow/teutonic-q3-4b-5dnsrzl6-bfm-v11
Parameters
4.0B
Total size
7.5 GB
Files
14
Quantization
BF16
No README on this version
Push a README.md with the model files to see it rendered here.
Model architecture
config.json- Architecture
- Qwen3ForCausalLM
- Model type
- qwen3
- Hidden size
- 2,560
- Layers
- 36
- Attention heads
- 32 (8 kv)
- FFN size
- 9,728
- Vocab size
- 151,936
- Context window
- 40K
Files
14 itemsmodel-00003-of-00009.safetensors
196c9ee91d62
942.6 MB
safetensors
model-00008-of-00009.safetensors
4a0691439235
942.6 MB
safetensors
model-00007-of-00009.safetensors
13df44224a8d
937.6 MB
safetensors
model-00002-of-00009.safetensors
86e4b360b4e9
937.6 MB
safetensors
model-00001-of-00009.safetensors
dd643de4d46e
934.4 MB
safetensors
model-00005-of-00009.safetensors
51d82da63c0f
915.1 MB
safetensors
model-00006-of-00009.safetensors
8a10904c8d0a
915.1 MB
safetensors
model-00004-of-00009.safetensors
182fc6ae986e
910.1 MB
safetensors
model-00009-of-00009.safetensors
efe6355d427b
237.5 MB
safetensors
tokenizer.json
be75606093db
10.9 MB
model.safetensors.index.json
842f6de59568
32.0 KB
config.json
ec1361bd4006
1.6 KB
tokenizer_config.json
5f007d04324a
696 B
generation_config.json
1c9ab72f98c3
213 B