I’ve used it at Meta. It’s very bad, if they released it in its current state it would be laughed at. I imagine they need to improve quality massively before it’s viable to release.
Occam’s razor tells me it’s probably because it’s not good. Perhaps running a company like survivor in a pressure cooker is not an effective management strategy.
Completely forgot that Meta was doing AI (and certainly spending billions doing so). They've got a lot of money, but are far behind on experience, talent, technology, and infrastructure.
The article makes it unclear if they are building a new model or if it's just the API. But I am guessing it's the API.
So it's "release to developers" rather than "new AI model". They cannot ship the API.
I would assume you would just provide an OpenAI compatible endpoint or two? But maybe they are not doing it that way.
Who knows what they are doing though. Maybe Meta has some kind of global API mesh thing and they can't quite make it work with vLLM or Sglang or something. Maybe they are building out a whole metered cloud IaaS for AI from scratch and that's just how long it takes. Maybe it's not technical complexity and just one of the managers is a problem.
Maybe they are delaying the API release until another more competitive model finishes training and testing.
If they release a model comparable to OpenAI / Antropic, will there be any reason left for 1T valuation of other companies? At that point, it will simply become Revenue proportional to Gigawatts available. Whoever got the energy wins.
DeepSeek and friends already exists, yet $1T valuations still exist. I think we are nearing a point where inference and cost metrics become the primary optimization for a while. Both capacity and costs are going to drive it from the demand side. I've personally moved to open weights already, now setting up vendor calls to make them available at work.
Talking with OpenCode and Fireworks, appreciate any recommendations that have SOC-2 and the like
If you spend more than 1 minute on Facebook, you realize what they are potentially training their data on, and it is not good. Their advertising algorithm is very good, I'll give them that.
I’ve used it at Meta. It’s very bad, if they released it in its current state it would be laughed at. I imagine they need to improve quality massively before it’s viable to release.
This is what I suspected. Wang was a generationally bad hire.
He has Meta SWEs making $250k+/year labeling data in AAI. He has exactly one move and it's this: https://i.imgflip.com/atotpp.jpg
But didn't Zuck say it will replace junior to mid level engineers on Joe Rogan podcast or something ?
Occam’s razor tells me it’s probably because it’s not good. Perhaps running a company like survivor in a pressure cooker is not an effective management strategy.
Also when you finally make it better, the others make theirs even better and you are still behind.
Completely forgot that Meta was doing AI (and certainly spending billions doing so). They've got a lot of money, but are far behind on experience, talent, technology, and infrastructure.
The article makes it unclear if they are building a new model or if it's just the API. But I am guessing it's the API.
So it's "release to developers" rather than "new AI model". They cannot ship the API.
I would assume you would just provide an OpenAI compatible endpoint or two? But maybe they are not doing it that way.
Who knows what they are doing though. Maybe Meta has some kind of global API mesh thing and they can't quite make it work with vLLM or Sglang or something. Maybe they are building out a whole metered cloud IaaS for AI from scratch and that's just how long it takes. Maybe it's not technical complexity and just one of the managers is a problem.
Maybe they are delaying the API release until another more competitive model finishes training and testing.
API server is not hard problem and not make sense for indefinite postpone. I think the more likely explanation is model quality.
Too bad for Meta, and very sad day Llama.
If they release a model comparable to OpenAI / Antropic, will there be any reason left for 1T valuation of other companies? At that point, it will simply become Revenue proportional to Gigawatts available. Whoever got the energy wins.
DeepSeek and friends already exists, yet $1T valuations still exist. I think we are nearing a point where inference and cost metrics become the primary optimization for a while. Both capacity and costs are going to drive it from the demand side. I've personally moved to open weights already, now setting up vendor calls to make them available at work.
Talking with OpenCode and Fireworks, appreciate any recommendations that have SOC-2 and the like
CloudFlare host the best models (Deepseek/Kimi)
*best open models
If you spend more than 1 minute on Facebook, you realize what they are potentially training their data on, and it is not good. Their advertising algorithm is very good, I'll give them that.
https://archive.is/ia01T