a51/teutonic-q3-4b-5hgmbeef-t5139
Parameters
4.0B
Total size
7.5 GB
Files
16
Quantization
BF16
No README on this version
Push a README.md with the model files to see it rendered here.
Model architecture
config.json- Architecture
- Qwen3ForCausalLM
- Model type
- qwen3
- Hidden size
- 2,560
- Layers
- 36
- Attention heads
- 32 (8 kv)
- FFN size
- 9,728
- Vocab size
- 151,936
- Context window
- 40K
Files
16 itemsmodel-00008-of-00009.safetensors
2397dcca3cac
942.6 MB
safetensors
model-00003-of-00009.safetensors
3d9572c61f26
942.6 MB
safetensors
model-00007-of-00009.safetensors
d997dc743129
937.6 MB
safetensors
model-00002-of-00009.safetensors
b0f8d317c1c1
937.6 MB
safetensors
model-00001-of-00009.safetensors
96ea4cd21cf4
934.4 MB
safetensors
model-00006-of-00009.safetensors
d4c3770e0aee
915.1 MB
safetensors
model-00005-of-00009.safetensors
d0355bb509f6
915.1 MB
safetensors
model-00004-of-00009.safetensors
ab2d852cf2a5
910.1 MB
safetensors
model-00009-of-00009.safetensors
9d425b26c740
237.5 MB
safetensors
tokenizer.json
be75606093db
10.9 MB
model.safetensors.index.json
549e24ee5dfe
32.1 KB
config.json
cba674823863
1.6 KB
q3_train_state.json
1794abf2dc94
700 B
tokenizer_config.json
2e3bd3cb5055
693 B
teutonic_model_store.json
87a0b4e473e9
547 B
generation_config.json
b77461537176
213 B