Add SFT training script and run Qwen3-0.6B-Base fine-tune

Train Qwen3-0.6B-Base (596M params) on 36K folksy proverb pairs
using full SFT with HuggingFace TRL. 3 epochs, 11 min on RTX 4090.

Results: train_loss=0.954, eval_loss=1.032, test_loss=1.031
Model checkpoint at folksy-model/final/ (not committed — 1.2 GB)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

This commit is contained in:

John McCardle

2026-03-31 22:07:23 -04:00

parent 9298c425bc

commit 02daa7bb97

4 changed files with 919 additions and 0 deletions

2

.gitignore vendored

View file

 @ -1 +1,3 @@
 *__pycache__
 .venv/
 folksy-model/

Rows
Columns

Add SFT training script and run Qwen3-0.6B-Base fine-tune

2 .gitignore vendored Unescape Escape View file

2

.gitignore vendored

View file