ChatGPT: Optimizing Language Models for Dialogue
Aligning Language Models to Follow Instructions
WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing
Aligning Language Models to Follow Instructions
Learning from Human Preferences
Proximal Policy Optimization
https://gpt.Chatapi.art/?
Building safer dialogue agents
https://jmcdonnell.substack.com/p/the-near-future-of-ai-is-action-driven