TAAFT
Free mode
100% free
Freemium
Free Trial
Deals
Create tool

Reinforcement Learning from Human Feedback

[ˌriːɪnˈfɔːsmənt ˈlɜːnɪŋ frəm ˈhjuːmən ˈfiːdbæk]
Ethics & Safety
Last updated: December 9, 2024

Definition

Training AI systems using human evaluations of their outputs

Detailed Explanation

Machine learning technique that incorporates human feedback to improve AI model behavior and align it with human preferences

Use Cases

Language model training Content moderation systems Autonomous system behavior refinement

Related Terms