Add SFT training script and run Qwen3-0.6B-Base fine-tune

Train Qwen3-0.6B-Base (596M params) on 36K folksy proverb pairs
using full SFT with HuggingFace TRL. 3 epochs, 11 min on RTX 4090.

Results: train_loss=0.954, eval_loss=1.032, test_loss=1.031
Model checkpoint at folksy-model/final/ (not committed — 1.2 GB)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
John McCardle 2026-03-31 22:07:23 -04:00
commit 02daa7bb97
4 changed files with 919 additions and 0 deletions

2
.gitignore vendored
View file

@ -1 +1,3 @@
*__pycache__
.venv/
folksy-model/