Pretraining LLM
InstructGPT Finetuning
Reinforcement Learning with Human Feedback (RLHF)
Last updated 12 months ago