Same codebase, same sort of prompt, same scale. I was already on the fence. Models like Qwen, Kimi, or GLM5 already go a very long way while being vastly cheaper, and the new openAI models feels equivalent but with higher limits.
This is getting to the point where the right harness makes a bigger difference than the right model. I've been experimenting with some planner-executor-reviewer setup in opencode, and I'm starting to feel like multiple smaller models working together are netting me better results.
I more and more use Codex, because token usage is a blackbox and I think that we will see the next couple of month the usual three tier model evolving: free, normal, luxury.
2027 will be the year of token regulation by administrations worldwide. Until then take care for being ripped of at the luxury level.
According to the GLM documentation at least, you can edit you .env (~/.claude/settings.json) with an API key from the GLM Settings page. [0].
{ "env": { "ANTHROPIC_AUTH_TOKEN": "your_zai_api_key", "ANTHROPIC_BASE_URL": "https://api.z.ai/api/anthropic", "API_TIMEOUT_MS": "3000000" } }
Whilst you need to generate an API Key, you are still using the 'GLM Coding Plan' as per 'Methods for Using the GLM Coding Plan in Claude Code' documentation.
First, this has been going on for a week now and Anthropic support was gaslighting people, saying nothing has changed. Well, obviously it has, they said it now. Additionally, Anthropic saying weekly limits are unaffected is also a lie. I exhausted 37% of my weekly on the first day when this started on Monday. This has never happened before. So this is two lies in a row.
But now the bigger issue - what does Anthropic expect me to do when I'm out of quota on day 1 of the week? Emphasis on me having a lot of free time without access to claude.
There can only be 3 options:
1. I'll buy more $200 subscriptions from a provider that I've caught lying and gaslighting me. 2. I'll only work 1 day in a week and accomplish nothing. 3. I'll explore other models / providers to see if I can fill the gap.
The right choice is very clear. But can't Anthropic see that their only moat is that ecosystem was built around claude code because they were first to do this subscription stuff and their harness didn't get in the way too much?
Once everyone tightens limits, the competition shifts from model quality to pricing strategy and usage control.
They aren't far off - they burn a tiny fraction of the cash of OAI and achieve similar ARR despite this - but as they tighten the belt it's inevitable that companies like OAI come in and offer more subsidized (unsustainable) inference to get people to switch. They will inevitably do the same "rug pull".
It'll be interesting to see how this plays out.
IF you arn't planning a local LLm strategy, you're surely tying your lifeline to anchors.
Additionally, this was practically predicted and expected by so many people, the second the off-hours increase was announced.
Shoddy company
The longer this goes on the more it becomes clear Google is going to be the last one standing.
Users who aren't using their quota will gradually disappear when that 7% starts being loud in the other direction.