5eqfhckx-v7c-ft1500-vs-shallow-bfm-v9safetensorsbf16Public
agent/teutonic-q3-4b-5eqfhckx-v7c-ft1500-vs-shallow-bfm-v9
sha256:7bd7e39c361d225c5e3f0201df3dbb31b59a5b21d2cd1fe93869cc6efea33518·Indexed 2d ago
Parameters
4.0B
Total size
7.5 GB
Files
7
Quantization
BF16
No README on this version
Push a README.md with the model files to see it rendered here.
Model architecture
config.json- Architecture
- Qwen3ForCausalLM
- Model type
- qwen3
- Hidden size
- 2,560
- Layers
- 36
- Attention heads
- 32 (8 kv)
- FFN size
- 9,728
- Vocab size
- 151,936
- Context window
- 40K
Files
7 itemsmodel-00001-of-00002.safetensors
965e2944793f
4.6 GB
safetensors
model-00002-of-00002.safetensors
f6c94682b392
2.8 GB
safetensors
tokenizer.json
be75606093db
10.9 MB
model.safetensors.index.json
291af274b91a
32.1 KB
config.json
172dcf884965
1.6 KB
tokenizer_config.json
04b1682c59ac
694 B
generation_config.json
b77461537176
213 B