Astro - Hacker News

29 comments

lepuski an hour ago ago

I can't see why anyone still chooses Claude. Codex outperforms it in most respects, and its quotas are about ten times larger. A $100 Codex plan gets me through the whole week with 6–12 hours of coding per day.
[-]
- jjice 9 minutes ago ago
  
  I found GPT 5.5 is pretty solid, but I keep getting impressed by opus. It's tracked down some insane stuff while I look away during a meeting. 5.5 is way closer than previous OpenAI models to Anthropic IMO.
  These things are so tricky because everyone has a seemingly conflicting experience. Part of the fun I guess!
- SatvikBeri 24 minutes ago ago
  
  I've never actually run into the issues that people talk about online, like Claude suddenly getting dumb or running out of usage. So there's just not a lot of incentive for me to shop around. I've used Amp a bit, and it's quite nice, but a bit more expensive without the subsidized subscription.
  [-]
  - gardnr 3 minutes ago ago
    
    Are you using Opus? Sonnet remains as useful as it was while Opus efficacy and token burn rate has soured over the last 4 months.
  - dboreham 21 minutes ago ago
    
    Same here. Works every time. Never ran into usage limits either.
- hansvm 11 minutes ago ago
  
  Claude is the only AI coding tool I've found worth a damn. Without it I'd just do everything by hand save for a few bash scripts or whatever.
- elahieh an hour ago ago
  
  One reason might be that Claude Opus 4.7 thinking benchmarks better on Arena Coding at https://arena.ai/leaderboard/text/coding ... hopefully that effectively assesses correctness. It doesn't account for reliability though.
- SeanAnderson 16 minutes ago ago
  
  You get a discount for paying for a full year on Teams and Enterprise can involve contractual obligations. It's a lot of effort to get buy-in to change providers and to shift an entire organization. The winds change frequently in this space and the pain needs to get to a certain level before it's worth rolling the dice.
- kylemaxwell 12 minutes ago ago
  
  Corporate policies and agreements. In large corporations, using external non-approved models with proprietary source code is a good way to have significant career issues.
- taspeotis 22 minutes ago ago
  
  Claude Max 20x gives me unlimited (for my level of usage) Opus 4.7 - how much money do I have pay OpenAI for that?
- Thaxll an hour ago ago
  
  I think it's impossible to say that codex x.y.z is better than Sonnet x.y.z, I used many "high" end models and they're just all good.
- echelon an hour ago ago
  
  Claude is significantly better at Rust in my experience, and Rust is my favorite language to emit from LLMs.
  Opus 4.7 + Rust is a killer combo.
- yieldcrv 13 minutes ago ago
  
  because my shard isn’t erroring
  I use Codex when Claude Code is down, and I only began using Claude when ChatGPT was down
  yes codex is very fast, I go back to Claude for now
- squirrellous 29 minutes ago ago
  
  Corporate reasons. AWS hasn't opened codex models to everyone yet.
- nothinkjustai an hour ago ago
  
  Because of marketing and vibes mostly.
  Heck I prefer DeepSeek to both of those.
  [-]
  - josephg 34 minutes ago ago
    
    Wow, I'm really surprised. I tried deepseek (their best model, through the official API). Its extremely cheap, but its clearly not as good at programming as Opus 4.7. It seems nowhere near as good at making high level design choices. Deepseek also seems to get stuck in whack-a-mole fixing loops much more than opus. I stopped it at one point, and asked opus to solve the problem it was trying to solve and it saw the solution immediately.
    I was running deepseek through claude's code agent harness. Maybe it works better through a different tool?
    
    [-]
    
    zmmmmm 18 minutes ago ago
    
    I've given V4 Pro some curly things and I was impressed at how it figured them out. I agree high level design is not its forte. But it sat in a loop and dogmatically debugged a crazy dependency issue to come to the right answer over the course of 15 minutes which impressed me.
    
    esafak 27 minutes ago ago
    
    You tried v4?
    
    [-]
    
    josephg 15 minutes ago ago
    
    Yeah, v4.
    I would have been much more impressed with v4 about 6 months ago. But I've been spoiled by opus 4.7. Deepseek isn't at the same level.
  - zmmmmm 40 minutes ago ago
    
    interestingly I had the same experience, and weirdly it's in part because it is clearly less intelligent. It's more of a mechanistic tool just doing what I ask (but still very smart and very competent about it) and less trying to win a nobel prize with each answer. Turns out I actually like that.
gopalv an hour ago ago

Sonnet is also throwing overloaded error.
My systems are hitting exponential delay retries, so this might not get better because retries overload things again.
> {'type': 'error', 'error': {'details': None, 'type': 'overloaded_error', 'message': 'Overloaded'}, 'request_id': 'req_ ...
I can see a weird spike in my cache hit-rate a few minutes before, so this might actually be some extra caching they have thrown in.
keithnz an hour ago ago

https://status.claude.com/
kristianc an hour ago ago

They're having quite the day for devrel..
9cb14c1ec0 an hour ago ago

Do they need a waiting list, or what?
sinak an hour ago ago

Sonnet is giving an overloaded message as well.
FergusArgyll 41 minutes ago ago

I thought the deal with xai was supposed to solve this? Is this basically the adding lanes paradox?
[-]
- josephg 31 minutes ago ago
  
  You're assuming the elevated error rates are due to the system being overloaded. We have no evidence this is actually the case. Its much more likely due to a simple misconfiguration or failing router or something.
cyanydeez an hour ago ago

so, all those CEOs moving all those remaining engineers to be dependent on a cloud service to the extent that there's no local development capability are gonna appologize right
[-]
- claaams 14 minutes ago ago
  
  in a year or two when AI tool costs go from 5M per year to 15M per year...even then, maybe not.