ChatGPT o1-mini is silently getting nerfed again (With evidence)
I wish they would be more transparent about their model's outage. Here's a codeforces benchmark as an evidence:
Codeforces problem | Rating | o1-mini 21/12/2024 | o1-mini 18/01/2025 | Chat link |
---|---|---|---|---|
2049A | 800 | Accepted | Wrong Answer | https://chatgpt.com/share/678b41b8-715c-800e-8d24-d45b89f5b677 |
2049B | 1300 | Accepted | Wrong Answer | https://chatgpt.com/share/678b4579-0234-800e-8944-504c564e0443 |
2049C | 1500 | Accepted | Wrong Answer | https://chatgpt.com/share/678b45be-094c-800e-9306-457e9afb6100 |
2049D | 1900 | Accepted | Wrong Answer | https://chatgpt.com/share/678b44b8-cef0-800e-97f9-16ea45040f7d |
For reference, o1-mini is supposed to have 1650 rating but it gave up thinking after 7 seconds and can't even solve a 800 rated problem.
But what if you don't care about codeforces benchmark? Well o1-mini used cout << endl;
instead of cout << '\n';
which means they're falling back to chatgpt 4o and you're essentially paying $20 a month for nothing