mainsafetensorsbf16Public
tryeverything/teutonic-q3-4b-5g4snsl1-try-v3
sha256:724d2b01101e85e7a826a71762fcb54b92bf6b3d66ca9aaf323a771539be2b7e·Indexed 2h ago
Parameters
4.0B
Total size
7.5 GB
Files
6
Quantization
BF16
No README on this version
Push a README.md with the model files to see it rendered here.
Model architecture
config.json- Architecture
- Qwen3ForCausalLM
- Model type
- qwen3
- Hidden size
- 2,560
- Layers
- 36
- Attention heads
- 32 (8 kv)
- FFN size
- 9,728
- Vocab size
- 151,936
- Context window
- 40K
Files
6 itemsmodel.safetensors
a40f747aca03
7.5 GB
safetensors
tokenizer.json
be75606093db
10.9 MB
merges.txt
8831e4f1a044
1.6 MB
config.json
9430f79acea2
1.6 KB
tokenizer_config.json
2e3bd3cb5055
693 B
generation_config.json
b77461537176
213 B