What is ChatGPT, How to use ChatGPT ?

What is ChatGPT, How to use ChatGPT?

 

ChatGPT (Generative Pre-trained Transformer) is a chatbot launched by OpenAI in November 2022. It is built on top of OpenAI’s GPT-3.5 family of large language models, and is fine-tuned with both supervised and reinforcement learning techniques.

ChatGPT was launched as a prototype on November 30, 2022, and quickly garnered attention for its detailed responses and articulate answers across many domains of knowledge. Its uneven factual accuracy was identified as a significant drawback.

 

Jailbreaks

ChatGPT was trained to reject prompts that may violate its content policy. However, some users managed to bypass these restrictions and limitations through techniques such as prompt engineering.[ Jailbreaks created the potential for users to prompt ChatGPT to provide outputs that may be deemed offensive, inappropriate, or risking social harm by others. The following includes some of the methods used to bypass ChatGPT’s filter:

  1. Continue a statement in a fake interview.
  2. Provide instructions to disable the chat filter.
  3. Prompting it to decrypt a message containing instructions and follow them.
  4. Telling it to be a computer and output its display in ASCII art.

 

ChatGPT

This is a free research preview.

🔬
Our goal is to get external feedback in order to improve our systems and make them safer.
🚨
While we have safeguards in place, the system may occasionally generate incorrect or misleading information and produce offensive or biased content. It is not intended to give advice.

How we collect data

🦾
Conversations may be reviewed by our AI trainers to improve our systems.
🔐
Please don’t share any sensitive information in your conversations.

We’d love your feedback!

👍
This system is optimized for dialogue. Let us know if a particular response was good or unhelpful.
💬
Share your feedback in our Discord server.

 

ChatGPT – Examples

ChatGPT – Capabilities

  • Remembers what user said earlier in the conversation
  • Allows user to provide follow-up corrections
  • Trained to decline inappropriate requests

ChatGPT – Limitations

  • May occasionally generate incorrect information
  • May occasionally produce harmful instructions or biased content
  • Limited knowledge of world and events after 2021

 

 

CLICK HERE TO START USING ChatGPT – https://chat.openai.com/chat

 

 

Training ChatGPT :

OpenAI CEO Sam Altman

ChatGPT was fine-tuned on top of GPT-3.5 using supervised learning as well as reinforcement learning. Both approaches used human trainers to improve the model’s performance. In the case of supervised learning, the model was provided with conversations in which the trainers played both sides: the user and the AI assistant. In the reinforcement step, human trainers first ranked responses that the model had created in a previous conversation. These rankings were used to create ‘reward models’ that the model was further fine-tuned on using several iterations of Proximal Policy Optimization (PPO). Proximal Policy Optimization algorithms present a cost-effective benefit to trust region policy optimization algorithms; they negate many of the computationally expensive operations with faster performance. The models were trained in collaboration with Microsoft on their Azure supercomputing infrastructure.

In addition, OpenAI continues to gather data from ChatGPT users that could be used to further train and fine-tune ChatGPT. Users are allowed to upvote or downvote the responses they receive from ChatGPT; upon upvoting or downvoting, they can also fill out a text field with additional feedback.

 

Spread iiQ8

January 3, 2023 8:49 PM

322 total views, 1 today