5fcms2sh-HeavyBall-PSGDsafetensorsbf16Public
teutonic-lucas/teutonic-q3-4b-5fcms2sh-heavyball-psgd
sha256:6beb50efc74beea8391905aa1e786f25ac773fe1ad6beda838b33c5c97911cbd·Indexed 1d ago
Parameters
4.4B
Total size
8.2 GB
Files
7
Quantization
BF16
No README on this version
Push a README.md with the model files to see it rendered here.
Model architecture
config.json- Architecture
- Qwen3ForCausalLM
- Model type
- qwen3
- Hidden size
- 2,560
- Layers
- 36
- Attention heads
- 32 (8 kv)
- FFN size
- 9,728
- Vocab size
- 151,936
- Context window
- 40K
- Dtype
- bfloat16
Files
7 itemsmodel.safetensors
70f3f50822cb
8.2 GB
safetensors
tokenizer.json
c0382117ea32
6.7 MB
vocab.json
ca10d7e9fb3e
2.6 MB
merges.txt
8831e4f1a044
1.6 MB
tokenizer_config.json
3c04ed3ca964
9.5 KB
config.json
b8ec40a6e0ce
726 B
generation_config.json
8c970692323e
138 B