grounding accuracy

GPT-5.2: A Shift in Evaluative Personality

GPT-5.2 has shifted its focus towards evaluative personality, making it highly distinguishable with a classification accuracy of 97.9%, compared to Claude's family at 83.9%. Interestingly, GPT-5.2 is more stringent on hallucinations and faithfulness, areas where Claude previously excelled, indicating OpenAI's emphasis on grounding accuracy. This has resulted in GPT-5.2 being more aligned with models like Sonnet and Opus 4.5 in terms of strictness, whereas GPT-4.1 is more lenient, similar to Gemini-3-Pro. The changes reflect a strategic move by OpenAI to enhance the reliability and accuracy of their models, which is crucial for applications requiring high trust in AI outputs.
Read Full Article
Read Full Article: GPT-5.2: A Shift in Evaluative Personality

Posted on

Jan 1, 2026

by

AIGeekery

in

Commentary, Deep Dives

Topics: AI models, AI development, AI reliability