Efficient AI Lecture 14: LLM Post-Training
PEFT is one of the most practical ideas in LLM post-training.
Instead of updating the whole model, train a tiny targeted part:
- Adapters: small inserted modules
- Prompt tuning: soft prompt vectors
- Prefix tuning: KV prefixes
- LoRA: low-rank updates
- QLoRA: 4-bit base + LoRA
-Bit-Delta: 1-bit fine-tune delta
Small changes, big adaptation.
My note:
显示更多