Pretraining LLM
InstructGPT Finetuning
Reinforcement Learning with Human Feedback (RLHF)
Last updated 1 year ago