TECHBYTES:ChatGPT: Optimizing Language Models for Dialogue - OpenAI
OPEN AI"zzz CHAT GPT -3 ChatGPT (Generative Pre-trained Transformer) is a chatbot launched by OpenAI in November 2022. It builds on his OpenAI's GPT 3.5 family of large-scale language models and is fine-tuned with both supervised and reinforcement learning techniques. ChatGPT was launched as a prototype on November 30, 2022 and quickly gained attention for its detailed and clear answers in many knowledge areas. Its uneven factual accuracy is recognized as a significant shortcoming. Training ChatGPT has been improved based on his GPT-3.5 using supervised learning and reinforcement learning. Both approaches used human trainers to improve model performance. In supervised learning, the model was provided with a conversation in which the trainer acted as both the user and the AI assistant. In the reinforcement step, the human trainer first sorted the responses the model made in previous conversations. These rankings were used to create a further refined 'reward model' usi