Pretraining LLM
InstructGPT Finetuning
Reinforcement Learning with Human Feedback (RLHF)
Last updated 2 years ago
Was this helpful?