A technique where AI models are refined based on human evaluations of their outputs. Like having a focus group constantly reviewing your work, except they’re training a system that might eventually write the great AI novel.
Synonyms:
Reinforcement Learning from Human Feedback