ChatGPT 5.2 Pro
1h ago
p95 latency jumped to ~14s for short prompts.
OpenAI
Daily drift snapshot against a 21-day baseline with auto + human signals.
Last run Jan 13, 2026 (2h ago)
7-day drift
AUTO DUMB INDEX
48
Normal
vs baseline +3
Why it moved
Latency up
medTTFT slower
Delta +4
Instruction slips
lowFormat misses
Delta +2
Accuracy steady
lowNo major drift
Delta -1
Baseline window: 21 days
Accuracy
Objective tasks solved correctly.
38%
+1 vs baseline
Click to expand for recent values (mocked)
Reasoning robustness
Consistency across prompt variations.
34%
0 vs baseline
Click to expand for recent values (mocked)
Instruction following
Format and constraint compliance.
31%
+2 vs baseline
Click to expand for recent values (mocked)
Hallucination risk
Confident wrong answers on known items.
40%
+1 vs baseline
Click to expand for recent values (mocked)
Refusal anomaly
Unexpected refusals on safe prompts.
29%
-1 vs baseline
Click to expand for recent values (mocked)
Latency
p50/p95 response time drift.
44%
+4 vs baseline
Click to expand for recent values (mocked)
Variance
Run-to-run stability.
28%
-2 vs baseline
Click to expand for recent values (mocked)
Eval suite
Tier 0
Sanity checks
78
+1 today
12 tasks
Tier 1
Factual QA
73
-1 today
20 tasks
Tier 2
Reasoning + math
69
-2 today
18 tasks
Tier 3
Coding
71
0 today
12 tasks
Tier 4
Instruction stress
66
-1 today
10 tasks
Community
Top categories today
ChatGPT 5.2 Pro
1h ago
p95 latency jumped to ~14s for short prompts.