Tagged “chatgpt”

TIL the Waluigi Effect in LLMs Mar 29, 2023 chatgpt ai
The Waluigi Effect: After you train an LLM to satisfy a desirable property 'P', then it's easier to elicit the chatbot into satisfying the exact opposite of property 'P'.
TIL ChatGPT prompts Jan 13, 2023 ai chatgpt machinelearning
Don’t ask it to write an essay about how human error causes catastrophes. The AI will come up with a boring and straightforward piece that does the minimum possible to satisfy your simple demand.