folksy_idioms/.gitignore at 02daa7bb978df3b4a24f5b0a4d021086f2915df5 - john/folksy_idioms - Fight Fire with Fire Robotics - Software Repository

john/folksy_idioms

john 02daa7bb97 Add SFT training script and run Qwen3-0.6B-Base fine-tune

Train Qwen3-0.6B-Base (596M params) on 36K folksy proverb pairs
using full SFT with HuggingFace TRL. 3 epochs, 11 min on RTX 4090.

Results: train_loss=0.954, eval_loss=1.032, test_loss=1.031
Model checkpoint at folksy-model/final/ (not committed — 1.2 GB)

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

2026-03-31 22:07:23 -04:00

3 lines

34 B

Text

Raw Blame History

	`*__pycache__`
	`.venv/`
	`folksy-model/`