Posts

Showing posts from December 27, 2022

TECHBYTES:ChatGPT: Optimizing Language Models for Dialogue - OpenAI

Image
OPEN AI"zzz CHAT GPT -3 ChatGPT (Generative Pre-trained Transformer) is a chatbot launched by OpenAI in November 2022. It builds on his OpenAI's GPT 3.5 family of large-scale language models and is fine-tuned with both supervised and reinforcement learning techniques. ChatGPT was launched as a prototype on November 30, 2022 and quickly gained attention for its detailed and clear answers in many knowledge areas. Its uneven factual accuracy is recognized as a significant shortcoming. Training ChatGPT has been improved based on his GPT-3.5 using supervised learning and reinforcement learning. Both approaches used human trainers to improve model performance. In supervised learning, the model was provided with a conversation in which the trainer acted as both the user and the AI ​​assistant. In the reinforcement step, the human trainer first sorted the responses the model made in previous conversations. These rankings were used to create a further refined 'reward model' usi