Author name: Kelly Newman

hbo-max-is-“way-underpriced,”-warner-bros.-discovery-ceo-says

HBO Max is “way underpriced,” Warner Bros. Discovery CEO says

Consumers in America would pay twice as much 10 years ago for content. People were spending, on average, $55 for content 10 years ago, and the quality of the content, the amount of content that we’re getting, the spend is 10 or 12 fold and they’re paying dramatically less. I think we want a good deal for consumers, but I think over time, there’s real opportunity, particularly for us, in that quality area, to raise price.

A question of quality

Zaslav is arguing that the quality of the shows and movies on HBO Max warrants an eventual price bump. But, in general, viewers find streaming services are getting less impressive. A Q4 2024 report from TiVo found that the percentage of people who think the streaming services that they use have “moderate to very good quality” has been declining since Q4 2021.

Bar graph From TiVO's Q4 2024 Video Trends report.

From TiVO’s Q4 2024 Video Trends report.

Credit: TiVo

From TiVO’s Q4 2024 Video Trends report. Credit: TiVo

Research also points to people being at their limit when it comes to TV spending. Hub Entertainment Research’s latest “Monetizing Video” study, released last month, found that for consumers, low prices “by far still matters most to the value of a TV service.”

Meanwhile, niche streaming services have been gaining in popularity as streaming subscribers grow bored with the libraries of mainstream streaming platforms and/or feel like they’ve already seen the best of what those services have to offer. Antenna, a research firm focused on consumer subscription services, reported this month that specialty streaming service subscriptions increased 12 percent year over year in 2025 thus far and grew 22 percent in the first half of 2024.

Zaslav would likely claim that HBO Max is an outlier when it comes to streaming library dissatisfaction. Although WBD’s streaming business (which includes Discovery+) turned a $293 million profit and grew subscriber-related revenue (which includes ad revenues) in its most recent earnings report, investors would likely be unhappy if the company rested on its financial laurels. WBD has one of the most profitable streaming businesses, but it still trails far behind Netflix, which posted an operating income of $3.8 billion in its most recent earnings.

Still, increasing prices is rarely welcomed by customers. With many other options for streaming these days (including free ones), HBO Max will have to do more to convince people that it is worth the extra money than merely making the claim.

HBO Max is “way underpriced,” Warner Bros. Discovery CEO says Read More »

yes,-ai-continues-to-make-rapid-progress,-including-towards-agi

Yes, AI Continues To Make Rapid Progress, Including Towards AGI

That does not mean AI will successfully make it all the way to AGI and superintelligence, or that it will make it there soon or on any given time frame.

It does mean that AI progress, while it could easily have been even faster, has still been historically lightning fast. It has exceeded almost all expectations from more than a few years ago. And it means we cannot disregard the possibility of High Weirdness and profound transformation happening within a few years.

GPT-5 had a botched rollout and was only an incremental improvement over o3, o3-Pro and other existing OpenAI models, but was very much on trend and a very large improvement over the original GPT-4. Nor would one disappointing model from one lab have meant that major further progress must be years away.

Imminent AGI (in the central senses in which that term AGI used, where imminent means years rather than decades) remains a very real possibility.

Part of this is covering in full Gary Marcus’s latest editorial in The New York Times, since that is the paper of record read by many in government. I felt that piece was in many places highly misleading to the typical Times reader.

Imagine if someone said ‘you told me in 1906 that there was increasing imminent risk of a great power conflict, and now it’s 1911 and there has been no war, so your fever dream of a war to end all wars is finally fading.’ Or saying that you were warned in November 2019 that Covid was likely coming, and now it’s February 2020 and no one you know has it, so it was a false alarm. That’s what these claims sound like to me.

I have to keep emphasizing this because it now seems to be an official White House position, with prominent White House official Sriram Krishnan going so far as to say on Twitter that AGI any time soon has been ‘disproven,’ and David Sacks spending his time ranting and repeating Nvidia talking points almost verbatim.

When pressed, there is often a remarkably narrow window in which ‘imminent’ AGI is dismissed as ‘proven wrong.’ But this is still used as a reason to structure public policy and one’s other decisions in life as if AGI definitely won’t happen for decades, which is Obvious Nonsense.

Sriram Krishnan: I’ll write about this separately but think this notion of imminent AGI has been a distraction and harmful and now effectively proven wrong.

Prinz: “Imminent AGI” was apparently “proven wrong” because OpenAI chose to name a cheap/fast model “GPT-5” instead of o3 (could have been done 4 months earlier) or the general reasoning model that won gold on both the IMO and the IOI (could have been done 4 months later).

Rob Miles: I’m a bit confused by all the argument about GPT-5, the truth seems pretty mundane: It was over-hyped, they kind of messed up the launch, and the model is good, a reasonable improvement, basically in line with the projected trend of performance over time.

Not much of an update.

To clarify a little, the projected trend GPT-5 fits with is pretty nuts, and the world is on track to be radically transformed if it continues to hold. Probably we’re going to have a really wild time over the next few years, and GPT-5 doesn’t update that much in either direction.

Rob Miles is correct here as far as I can tell.

If imminent means ‘within the next six months’ or maybe up to a year I think Sriram’s perspective is reasonable, because of what GPT-5 tells us about what OpenAI is cooking. For sensible values of imminent that are more relevant to policy and action, Sriram Krishnan is wrong, in a ‘I sincerely hope he is engaging in rhetoric rather than being genuinely confused about this, or his imminently only means in the next year or at most two’ way.

I am confused how he can be sincerely mistaken given how deep he is into these issues, or that he shares his reasons so we can quickly clear this up because this is a crazy thing to actually believe. I do look forward to Sriram providing a full explanation as to why he believes this. So far we we only have heard ‘GPT-5.’

Not only is imminent AGI not disproven, there are continuing important claims that it is likely. Here is some clarity on Anthropic’s continued position, as of August 31.

Prinz: Jack, I assume no changes to Anthropic’s view that transformative AI will arrive by the end of next year?

Jack Clark: I continue to think things are pretty well on track for the sort of powerful AI system defined in machines of loving grace – buildable end of 2026, running many copies 2027. Of course, there are many reasons this could not occur, but lots of progress so far.

Anthropic’s valuation has certainly been on a rocket ship exponential.

Do I agree that we are on track to meet that timeline? No. I do not. I would be very surprised to see it go down that fast, and I am surprised that Jack Clark has not updated based on, if nothing else, previous projections by Anthropic CEO Dario Amodei falling short. I do think it cannot be ruled out. If it does happen, I do not think you have any right to be outraged at the universe for it.

It is certainly true that Dario Amodei’s early predictions of AI writing most of the code, as in 90% of all code within 3-6 months after March 11. This was not a good prediction, because the previous generation definitely wasn’t ready and even if it had been that’s not how diffusion works, and has been proven definitively false, it’s more like 40% of all code generated by AI and 20%-25% of what goes into production.

Which is still a lot, but a lot less than 90%.

Here’s what I said at the time about Dario’s prediction:

Zvi Mowshowitz (AI #107): Dario Amodei says AI will be writing 90% of the code in 6 months and almost all the code in 12 months. I am with Arthur B here, I expect a lot of progress and change very soon but I would still take the other side of that bet. The catch is: I don’t see the benefit to Anthropic of running the hype machine in overdrive on this, at this time, unless Dario actually believed it.

I continue to be confused why he said it, it’s highly unstrategic to hype this way. I can only assume on reflection this was an error about diffusion speed more than it was an error about capabilities? On reflection yes I was correctly betting ‘no’ but that was an easy call. I dock myself more points on net here, for hedging too much and not expressing the proper level of skepticism. So yes, this should push you towards putting less weight on Anthropic’s projections, although primarily on the diffusion front.

As always, remember that projections of future progress include the possibility, nay the inevitability, of discovering new methods. We are not projecting ‘what if the AI labs all keep ramming their heads against the same wall whether or not it works.’

Ethan Mollick: 60 years of exponential growth in chip density was achieved not through one breakthrough or technology, but a series of problems solved and new paradigms explored as old ones hit limits.

I don’t think current AI has hit a wall, but even if it does, there many paths forward now.

Paul Graham: One of the things that strikes me when talking to AI insiders is how they believe both that they need several new discoveries to get to AGI, and also that such discoveries will be forthcoming, based on the past rate.

My talks with AI insiders also say we will need new discoveries, and we definitely will need new major discoveries in alignment. But it’s not clear how big those new discoveries need to be in order to get there.

I agree with Ryan Greenblatt that precise timelines for AGI don’t matter that much in terms of actionable information, but big jumps in the chance of things going crazy within a few years can matter a lot more. This is similar to questions of p(doom), where as long as you are in the Leike Zone of a 10%-90% chance of disaster, you mostly want to react in the same ways, but outside that range you start to see big changes in what makes sense.

Ryan Greenblatt: Pretty short timelines (<10 years) seem likely enough to warrant strong action and it's hard to very confidently rule out things going crazy in <3 years.

While I do spend some time discussing AGI timelines (and I’ve written some posts about it recently), I don’t think moderate quantitative differences in AGI timelines matter that much for deciding what to do. For instance, having a 15-year median rather than a 6-year median doesn’t make that big of a difference. That said, I do think that moderate differences in the chance of very short timelines (i.e., less than 3 years) matter more: going from a 20% chance to a 50% chance of full AI R&D automation within 3 years should potentially make a substantial difference to strategy.

Additionally, my guess is that the most productive way to engage with discussion around timelines is mostly to not care much about resolving disagreements, but then when there appears to be a large chance that timelines are very short (e.g., >25% in <2 years) it's worthwhile to try hard to argue for this. I think takeoff speeds are much more important to argue about when making the case for AI risk.

I do think that having somewhat precise views is helpful for some people in doing relatively precise prioritization within people already working on safety, but this seems pretty niche.

Given that I don’t think timelines are that important, why have I been writing about this topic? This is due to a mixture of: I find it relatively quick and easy to write about timelines, my commentary is relevant to the probability of very short timelines (which I do think is important as discussed above), a bunch of people seem interested in timelines regardless, and I do think timelines matter some.

Consider reflecting on whether you’re overly fixated on details of timelines.

Jason Calacanis of the All-In Podcast (where he is alongside AI Czar David Sacks) has a bold prediction, if you believe that his words have or are intended to have meaning. Which is an open question.

Jason: Before 2030 you’re going to see Amazon, which has massively invested in [AI], replace all factory workers and all drivers … It will be 100% robotic, which means all of those workers are going away. Every Amazon worker. UPS, gone. FedEx, gone.

Aaron Slodov: hi @Jason how much money can i bet you to take the other side of the factory worker prediction?

Jason (responding to video of himself saying the above): In 2035 this will not be controversial take — it will be reality.

Hard, soul-crushing labor is going away over the next decade. We will be deep in that transition in 2030, when humanoid robots are as common as bicycles.

Notice the goalpost move of ‘deep in that transition’ in 2030 versus saying full replacement by 2030, without seeming to understand there is any contradiction.

These are two very different predictions. The original ‘by 2030’ prediction is Obvious Nonsense unless you expect superintelligence and a singularity, probably involving us all dying. There’s almost zero chance otherwise. Technology does not diffuse that fast.

Plugging 2035 into the 2030 prediction is also absurd, if we take the prediction literally. No, you’re not going to have zero workers at Amazon, UPS and FedEx within ten years unless we’ve not only solved robotics and AGI, we’ve also diffused those technologies at full scale. In which case, again, that’s a singularity.

I am curious what his co-podcaster David Sacks or Sriram Krishnan would say here. Would they dismiss Jason’s confident prediction as already proven false? If not, how can one be confident that AGI is far? Very obviously you can’t have one without the other.

GPT-5 is not a good reason to dismiss AGI, and to be safe I will once again go into why, and why we are making rapid progress towards AGI.

GPT-5 and GPT-4 were both major leaps in benchmarks from the previous generation.

The differences are dramatic, and the time frame between releases was similar.

The actual big difference? That there was only one incremental release between GPT-3 and GPT-4, GPT-3.5, with little outside competition. Whereas between GPT-4 and GPT-5 we saw many updates. At OpenAI alone we saw GPT-4o, and o1, and o3, plus updates that didn’t involve number changes, and at various points Anthropic’s Claude and Google’s Gemini were plausibly on top. Our frog got boiled slowly.

Epoch AI: However, one major difference between these generations is release cadence. OpenAI released relatively few major updates between GPT-3 and GPT-4 (most notably GPT-3.5). By contrast, frontier AI labs released many intermediate models between GPT-4 and 5. This may have muted the sense of a single dramatic leap by spreading capability gains over many releases.

Benchmarks can be misleading, especially as we saturate essentially all of them often well ahead of predicted schedules, but the overall picture is not. The mundane utility and user experience jumps across all use cases are similarly dramatic. The original GPT-4 was a modest aid to coding, GPT-5 and Opus 4.1 transform how it is done. Most of the queries I make with GPT-5-Thinking or GPT-5-Pro would not be worth bothering to give to the original GPT-4, or providing the context would not even be possible. So many different features have been improved or added.

This ideas, frequently pushed by among others David Sacks, that everyone’s models are about the same and aren’t improving? These claims simply are not true. Observant regular users are not about to be locked into one model or ecosystem.

Everyone’s models are constantly improving. No one would seriously consider using models from the start of the year for anything but highly esoteric purposes.

The competition is closer than one would have expected. There are three major labs, OpenAI, Anthropic and Google, that each have unique advantages and disadvantages. At various times each have had the best model, and yes currently it is wise to mix up your usage depending on your particular use case.

Those paying attention are always ready to switch models. I’ve switched primary models several times this year alone, usually switching to a model from a different lab, and tested many others as well. And indeed we must switch models often either way, as it is expected that everyone’s models will change on the order of every few months, in ways that break the same things that would break if you swapped GPT-5 for Opus or Gemini or vice versa, all of which one notes typically run on three distinct sets of chips (Nvidia for GPT-5, Amazon Trainium for Anthropic and Google TPUs for Gemini) but we barely notice.

Most people notice AI progress much better when it impacts their use cases.

If you are not coding, and not doing interesting math, and instead asking simple things that do not require that much intelligence to answer correctly, then upgrading the AI’s intelligence is not going to improve your satisfaction levels much.

Jack Clark: Five years ago the frontier of LLM math/science capabilities was 3 digit multiplication for GPT-3. Now, frontier LLM math/science capabilities are evaluated through condensed matter physics questions. Anyone who thinks AI is slowing down is fatally miscalibrated.

David Shapiro: As I’ve said before, AI is “slowing down” insofar as most people are not smart enough to benefit from the gains from here on out.

Once you see this framing, you see the contrast everywhere.

Patrick McKenzie: I think a lot of gap between people who “get” LLMs and people who don’t is that some people understand current capabilities to be a floor and some people understand them to be either a ceiling or close enough to a ceiling.

And even if you explain “Look this is *obviouslya floor” some people in group two will deploy folk reasoning about technology to say “I mean technology decays in effectiveness all the time.” (This is not considered an insane POV in all circles.)

And there are some arguments which are persuasive to… people who rate social pressure higher than received evidence of their senses… that technology does actually frequently regress.

For example, “Remember how fast websites were 20 years ago before programmers crufted them up with ads and JavaScript? Now your much more powerful chip can barely keep up. Therefore, technological stagnation and backwards decay is quite common.”

Some people would rate that as a powerful argument. Look, it came directly from someone who knew a related shibboleth, like “JavaScript”, and it gestures in the direction of at least one truth in observable universe.

Oh the joys of being occasionally called in as the Geek Whisperer for credentialed institutions where group two is high status, and having to titrate how truthful I am about their worldview to get message across.

As in, it’s basically this graph but for AI:

Here’s another variant of this foolishness, note the correlation to ‘hitting a wall’:

Prem Kumar Aparanji: It’s not merely the DL “hitting a wall” (as @GaryMarcus put it & everybody’s latched on) now as predicted, even the #AI data centres required for all the training, fine-tuning, inferencing of these #GenAI models are also now predicted to be hitting a wall soon.

Quotes from Futurism: For context, Kupperman notes that Netflix brings in just $39 billion in annual revenue from its 300 million subscribers. If AI companies charged Netflix prices for their software, they’d need to field over 3.69 billion paying customers to make a standard profit on data center spending alone — almost half the people on the planet.

“Simply put, at the current trajectory, we’re going to hit a wall, and soon,” he fretted. “There just isn’t enough revenue and there never can be enough revenue. The world just doesn’t have the ability to pay for this much AI.”

Prinz: Let’s assume that AI labs can charge as much as Netflix per month (they currently charge more) and that they’ll never have any enterprise revenue (they already do) and that they won’t be able to get commissions from LLM product recommendations (will happen this year) and that they aren’t investing in biotech companies powered by AI that will soon have drugs in human trial (they already have). How will they ever possibly be profitable?

He wrote a guest opinion essay. Things didn’t go great.

That starts with the false title (as always, not entirely up to the author, and it looks like it started out as a better one), dripping with unearned condescension, ‘The Fever Dream of Imminent ‘Superintelligence’ Is Finally Breaking,’ and the opening paragraph in which he claims Altman implied GPT-5 would be AGI.

Here is the lead:

GPT-5, OpenAI’s latest artificial intelligence system, was supposed to be a game changer, the culmination of billions of dollars of investment and nearly three years of work. Sam Altman, the company’s chief executive, implied that GPT-5 could be tantamount to artificial general intelligence, or A.G.I. — A.I. that is as smart and as flexible as any human expert.

Instead, as I have written, the model fell short. Within hours of its release, critics found all kinds of baffling errors: It failed some simple math questions, couldn’t count reliably and sometimes provided absurd answers to old riddles. Like its predecessors, the A.I. model still hallucinates (though at a lower rate) and is plagued by questions around its reliability. Although some people have been impressed, few saw it as a quantum leap, and nobody believed it was A.G.I. Many users asked for the old model back.

GPT-5 is a step forward but nowhere near the A.I. revolution many had expected. That is bad news for the companies and investors who placed substantial bets on the technology.

Did you notice the stock market move in AI stocks, as those bets fell down to Earth when GPT-5 was revealed? No? Neither did I.

The argument above is highly misleading on many fronts.

  1. GPT-5 is not AGI, but this was entirely unsurprising – expectations were set too high, but nothing like that high. Yes, Altman teased that it was possible AGI could arrive relatively soon, but at no point did Altman claim that GPT-5 would be AGI, or that AGI would arrive in 2025. Approximately zero people had median estimates of AGI in 2025 or earlier, although there are some that have estimated the end of 2026, in particular Anthropic (they via Jack Clark continue to say ‘powerful’ AI buildable by end of 2026, not AGI arriving 2026).

  2. The claim that it ‘couldn’t count reliably’ is especially misleading. Of course GPT-5 can count reliably. The evidence here is a single adversarial example. For all practical purposes, if you ask GPT-5 to count something, it will count that thing.

  3. Old riddles is highly misleading. If you give it an actual old riddle it will nail it. What GPT-5 and other models get wrong are, again, adversarial examples that do not exist ‘in the wild’ but are crafted to pattern match well-known other riddles while having a different answer. Why should we care?

  4. GPT-5 still is not fully reliable but this is framed as it being still highly unreliable, when in most circumstances this is not the case. Yes, if you need many 9s of reliability LLMs are not yet for you, but neither are humans.

  5. AI valuations and stocks continue to be rising not falling.

  6. Yes, the fact that OpenAI chose to have GPT-5 not be a scaled up model does tell us that directly scaling up model size alone has ‘lost steam’ in relative terms due to the associated costs, but this is not news, o1 and o3 (and GPT-4.5) tell us this as well. We are now working primarily on scaling and improving in other ways, but very much there are still plans to scale up more in the future. In the context of all the other facts quoted about other scaled up models, it seems misleading to many readers to not mention that GPT-5 is not scaled up.

  7. Claims here are about failures of GPT-5-Auto or GPT-5-Base, whereas the ‘scaled up’ version of GPT-5 is GPT-5-Pro or at least GPT-5-Thinking.

  8. Gary Marcus clarifies that his actual position is on the order of 8-15 years to AGI, with 2029 being ‘awfully unlikely.’ Which is a highly reasonable timeline, but that seems pretty imminent. That’s crazy soon. That’s something I would want to be betting on heavily, and preparing for at great cost, AGI that soon seems like the most important thing happening in the world right now if likely true?

    1. The article does not give any particular timeline, and does not imply we will never get to AGI, but I very much doubt those reading the post would come away with the impression that things strictly smarter than people are only about 10 years away. I mean, yowsers, right?

The fact about ‘many users asked for the old model back’ is true, but lacking the important context that what users wanted was the old personality, so it risks giving an uninformed user the wrong impression.

To Gary’s credit, he then does hedge, as I included in the quote, acknowledging GPT-5 is indeed a good model representing a step forward. Except then:

And it demands a rethink of government policies and investments that were built on wildly overinflated expectations.

Um, no? No it doesn’t. That’s silly.

The current strategy of merely making A.I. bigger is deeply flawed — scientifically, economically and politically. Many things, from regulation to research strategy, must be rethought.

As many now see, GPT-5 shows decisively that scaling has lost steam.

Again, no? That’s not the strategy. Not ‘merely’ doing that. Indeed, a lot of the reason GPT-5 was so relatively unimpressive was GPT-5 was not scaled up so much. It was instead optimized for compute efficiency. There is no reason to have to rethink much of anything in response to a model that, as explained above, was pretty much exactly on the relevant trend lines.

I do appreciate this:

Gary Marcus: However, as I warned in a 2022 essay, “Deep Learning Is Hitting a Wall,” so-called scaling laws aren’t physical laws of the universe like gravity but hypotheses based on historical trends.

As in, the ‘hitting the wall’ claim was back in 2022. How did that turn out? Look at GPT-5, look at what we had available in 2022, and tell me we ‘hit a wall.’

What does ‘imminent’ superintelligence mean in this context?

Gary Marcus (NYT): The chances of A.G.I.’s arrival by 2027 now seem remote.

Notice the subtle goalpost move, as AGI ‘by 2027’ means AGI 2026. These people are gloating, in advance, that someone predicted a possibility of privately developed AGI in 2027 (with a median in 2028, in the AI 2027 scenario OpenBrain tells the government but does not release its AGI right away to the public) and then AGI will have not arrived, to the public, in 2026.

According to my sources (Opus 4.1 and GPT-5 Thinking) even ‘remote’ still means on the order of 2% chance in the next 16 months, implying an 8%-25% chance in 5 years. I don’t agree, but even if one did, that’s hardly something one can safety rule out.

But then, there’s this interaction on Twitter that clarifies what Gary Marcus meant:

Gary Marcus: Anyone who thinks AGI is impossible: wrong.

Anyone who thinks AGI is imminent: just as wrong.

It’s not that complicated.

Peter Wildeford: what if I think AGI is 4-15 years away?

Gary Marcus: 8-15 and we might reach an agreement. 4 still seems awfully unlikely to me. to many core cognitive problems aren’t really being addressed, and solutions may take a while to roll once we find the basic insights we are lacking.

But it’s a fair question.

That’s a highly reasonable position one can take. Awfully unlikely (but thus possible) in four years, likely in 8-15, median timeline of 2036 or so.

Notice that on the timescale of history, 8-15 years until likely AGI, the most important development in the history of history if and when it happens, seems actually kind of imminent and important? That should demand an aggressive policy response focused on what we are going to do when we get to do that, not be treated as a reason to dismiss this?

Imagine saying, in 2015, ‘I think AGI is far away, we’re talking 18-25 years’ and anticipating the looks you would get.

The rest of the essay is a mix of policy suggestions and research direction suggestions. If indeed he is right about research directions, of which I am skeptical, we would still expect to see rapid progress soon as the labs realize this and pivot.

A common tactic among LLM doubters, which was one of the strategies used in the NYT editorial, is to show a counterexample, where a model fails a particular query, and say ‘model can’t do [X]’ or the classic Colin Fraser line of ‘yep it’s dumb.’

Here’s a chef’s kiss example I saw on Monday morning:

I mean, that’s very funny, but it is rather obvious how it happened with the strawberries thing all over Twitter and thus the training data, and it tells us very little about overall performance.

In such situations, we have to differentiate between different procedures, the same as in any other scientific experiment. As in:

Did you try to make it fail, or try to set it up to succeed? Did you choose an adversarial or a typical example? Did you get this the first time you tried it or did you go looking for a failure? Are you saying it ‘can’t [X]’ because it can’t ever do [X], because it can’t ever do [X] out of the box, it can’t reliably do [X], or it can’t perfectly do [X], etc?

If you conflate ‘I can elicit wrong answers on [X] if I try’ with ‘it can’t do [X]’ then the typical reader will have a very poor picture.

Daniel Litt (responding to NYT article by Gary Marcus that says ‘[GPT-5] failed some simple math questions, couldn’t count reliably’): While it’s true one can elicit poor performance on basic math question from frontier models like GPT-5, IMO this kind of thing (in NYTimes) is likely to mislead readers about their math capabilities.

Derya Unutmaz: AI misinformation at the NYT is at its peak. What a piece of crap “newspaper” it has become. It’s not even worth mentioning the author of this article-but y’all can guess. Meanwhile, just last night I posted a biological method invented by GPT-5 Pro, & I have so much more coming!

Ethan Mollick: This is disappointing. Purposefully underselling what models can do is a really bad idea. It is possible to point out that AI is flawed without saying it can’t do math or count – it just isn’t true.

People need to be realistic about capabilities of models to make good decisions.

I think the urge to criticize companies for hype blends into a desire to deeply undersell what models are capable of. Cherry-picking errors is a good way of showing odd limitations to an overethusiastic Twitter crowd, but not a good way of making people aware that AI is a real factor.

Shakeel: The NYT have published a long piece by Gary Marcus on why GPT-5 shows scaling doesn’t work anymore. At no point does the piece mention that GPT-5 is not a scaled up model.

[He highlights the line from the post, ‘As many now see, GPT-5 shows decisively that scaling has lost steam.’]

Tracing Woods: Gary Marcus is a great demonstration of the power of finding a niche and sticking to it

He had the foresight to set himself up as an “AI is faltering” guy well in advance of the technology advancing faster than virtually anyone predicted, and now he’s the go-to

The thing I find most impressive about Gary Marcus is the way he accurately predicted AI would scale up to an IMO gold performance and then hit a wall (upcoming).

Gary Marcus was not happy about these responses, and doubled down on ‘but you implied it would be scaled up, no takesies backsies.’

Gary Marcus (replying to Shakeel directly): this is intellectually dishonest, at BEST it at least as big as 4.5 which was intended as 5 which was significantly larger than 4 it is surely scaled up compared to 4 which is what i compared it to.

Shakeel: we know categorically that it is not an OOM scale up vs. GPT-4, so … no. And there’s a ton of evidence that it’s smaller than 4.5.

Gary Marcus (QTing Shakeel): intellectually dishonest reply to my nytimes article.

openai implied implied repeatedly that GPT-5 was a scaled up model. it is surely scaled up relative to GPT-4.

it is possible – openAI has been closed mouth – that it is same size as 4.5 but 4.5 itself was surely scaled relative to 4, which is what i was comparing with.

amazing that after years of discussion of scaling the new reply is to claim 5 wasn’t scaled at all.

Note that if it wasn’t, contra all the PR, that’s even more reason to think that OpenAI knows damn well that is time for leaning on (neuro)symbolic tools and that scaling has reached diminishing returns.

JB: It can’t really be same in parameter count as gpt4.5 they really struggled serving that and it was much more expensive on the API to use

Gary Marcus: so a company valued at $300b that’s raised 10 of billions didn’t have the money to scale anymore even though there whole business plan was scaling? what does that tell you?

I am confused how one can claim Shakeel is being intellectually dishonest. His statement is flat out true. Yes, of course the decision not to scale

It tells me that they want to scale how much they serve the model and how much they do reasoning at inference time, and that this was the most economical solution for them at the time. JB is right that very, very obviously GPT-4.5 is a bigger model than GPT-5 and it is crazy to not realize this.

A post like this would be incomplete if I failed to address superforecasters.

I’ve been over this several times before, where superforecasters reliably have crazy slow projections for progress and even crazier predictions that when we do make minds smarter than ourselves that is almost certainly not an existential risk.

My coverage of this started way back in AI #14 and AI #9 regarding existential risk estimates, including Tetlock’s response to AI 2027. One common theme in such timeline projections is predicting Nothing Ever Happens even when this particular something has already happened.

Now that the dust settled on models getting IMO Gold in 2025, it is a good time to look back on the fact that domain experts expected less progress in math than we got, and superforecasters expected a lot less, across the board.

Forecasting Research Institute: Respondents—especially superforecasters—underestimated AI progress.

Participants predicted the state-of-the-art accuracy of ML models on the MATH, MMLU, and QuaLITY benchmarks by June 2025.

Domain experts assigned probabilities of 21.4%, 25%, and 43.5% to the achieved outcomes. Superforecasters assigned even lower probabilities: just 9.3%, 7.2%, and 20.1% respectively.

The International Mathematical Olympiad results were even more surprising. AI systems achieved gold-level performance at the IMO in July 2025. Superforecasters assigned this outcome just a 2.3% probability. Domain experts put it at 8.6%.

Garrison Lovely: This makes Yudkowsky and Paul Christiano’s predictions of IMO gold by 2025 look even more prescient (they also predicted it a ~year before this survey was conducted).

Note that even Yudkowsky and Christiano had only modest probability that the IMO would fall as early as 2025.

Andrew Critch: Yeah sorry forecasting fam, ya gotta learn some AI if you wanna forecast anything, because AI affects everything and if ya don’t understand it ya forecast it wrong.

Or, as I put it back in the unrelated-to-AI post Rock is Strong:

Everybody wants a rock. It’s easy to see why. If all you want is an almost always right answer, there are places where they almost always work.

The security guard has an easy to interpret rock because all it has to do is say “NO ROBBERY.” The doctor’s rock is easy too, “YOU’RE FINE, GO HOME.” This one is different, and doesn’t win the competitions even if we agree it’s cheating on tail risks. It’s not a coherent world model.

Still, on the desk of the best superforecaster is a rock that says “NOTHING EVER CHANGES OR IS INTERESTING” as a reminder not to get overexcited, and to not assign super high probabilities to weird things that seem right to them.

Thus:

Daniel Eth: In 2022, superforecasters gave only a 2.3% chance of an AI system achieving an IMO gold by 2025. Yet this wound up happening. AI progress keeps being underestimated by superforecasters.

I feel like superforecasters are underperforming in AI (in this case even compared to domain experts) because two reference classes are clashing:

• steady ~exponential increase in AI

• nothing ever happens.

And for some reason, superforecasters are reaching for the second.

Hindsight is hindsight, and yes you will get a 98th percentile result 2% of the time. But I think at 2.3% for 2025 IMO Gold, you are not serious people.

That doesn’t mean that being serious people was the wise play here. The incentives might well have been to follow the ‘nothing ever happens’ rock. We still have to realize this, as we can indeed smell what the rock is cooking.

A wide range of potential paths of AI progress are possible. There are a lot of data points that should impact the distribution of outcomes, and one must not overreact to any one development. One should especially not overreact to not being blown away by progress for a span of a few months. Consider your baseline that’s causing that.

My timelines for hitting various milestones, including various definitions of AGI, involve a lot of uncertainty. I think not having a lot of uncertainty is a mistake.

I especially think saying either ‘AGI almost certainly won’t happen within 5 years’ or ‘AGI almost certainly will happen within 15 years,’ would be a large mistake. There are so many different unknowns involved.

I can see treating full AGI in 2026 as effectively a Can’t Happen. I don’t think you can extend that even to 2027, although I would lay large odds against it hitting that early.

A wide range of medians seem reasonable to me. I can see defending a median as early as 2028, or one that extends to 2040 or beyond if you think it is likely that anything remotely like current approaches cannot get there. I have not put a lot of effort into picking my own number since the exact value currently lacks high value of information. If you put a gun to my head for a typical AGI definition I’d pick 2031, but with no ‘right to be surprised’ if it showed up in 2028 or didn’t show up for a while. Consider the 2031 number loosely held.

To close out, consider once again: Even if you we agreed with Gary Marcus and said 8-15 years, with median 2036? Take a step back and realize how soon and crazy that is.

Discussion about this post

Yes, AI Continues To Make Rapid Progress, Including Towards AGI Read More »

judge:-anthropic’s-$1.5b-settlement-is-being-shoved-“down-the-throat-of-authors”

Judge: Anthropic’s $1.5B settlement is being shoved “down the throat of authors”

At a hearing Monday, US district judge William Alsup blasted a proposed $1.5 billion settlement over Anthropic’s rampant piracy of books to train AI.

The proposed settlement comes in a case where Anthropic could have owed more than $1 trillion in damages after Alsup certified a class that included up to 7 million claimants whose works were illegally downloaded by the AI company.

Instead, critics fear Anthropic will get off cheaply, striking a deal with authors suing that covers less than 500,000 works and paying a small fraction of its total valuation (currently $183 billion) to get away with the massive theft. Defector noted that the settlement doesn’t even require Anthropic to admit wrongdoing, while the company continues raising billions based on models trained on authors’ works. Most recently, Anthropic raised $13 billion in a funding round, making back about 10 times the proposed settlement amount after announcing the deal.

Alsup expressed grave concerns that lawyers rushed the deal, which he said now risks being shoved “down the throat of authors,” Bloomberg Law reported.

In an order, Alsup clarified why he thought the proposed settlement was a chaotic mess. The judge said he was “disappointed that counsel have left important questions to be answered in the future,” seeking approval for the settlement despite the Works List, the Class List, the Claim Form, and the process for notification, allocation, and dispute resolution all remaining unresolved.

Denying preliminary approval of the settlement, Alsup suggested that the agreement is “nowhere close to complete,” forcing Anthropic and authors’ lawyers to “recalibrate” the largest publicly reported copyright class-action settlement ever inked, Bloomberg reported.

Of particular concern, the settlement failed to outline how disbursements would be managed for works with multiple claimants, Alsup noted. Until all these details are ironed out, Alsup intends to withhold approval, the order said.

One big change the judge wants to see is the addition of instructions requiring “anyone with copyright ownership” to opt in, with the consequence that the work won’t be covered if even one rights holder opts out, Bloomberg reported. There should also be instruction that any disputes over ownership or submitted claims should be settled in state court, Alsup said.

Judge: Anthropic’s $1.5B settlement is being shoved “down the throat of authors” Read More »

switch-modder-owes-nintendo-$2-million-after-representing-himself-in-court

Switch modder owes Nintendo $2 million after representing himself in court

Daly’s pro se legal representation in the case was notable for its use of several novel affirmative defenses, including arguments that Nintendo’s “alleged copyrights are invalid,” that Nintendo “does not have standing to bring suit,” and that Nintendo “procured a contract [with Daly] through fraudulent means.” For the record, the judgment in this case reasserts that Nintendo “owns valid copyrights in works protected by the TPMs, including Nintendo games and the Nintendo Switch operating system.”

In addition to $2 million in damages, Daly is specifically barred from “obtaining, possessing, accessing, or using” any DRM circumvention device or hacked console, with or without the intent to sell it. The judgment also bars Daly from publishing or “linking to” any website with instructions for hacking consoles and from “reverse engineering” any Nintendo consoles or games. Control of Daly’s ModdedHardware.com domain name will also be transferred to Nintendo.

Nintendo’s latest legal victory comes years after a $4.5 million plea deal with Gary “GaryOPA” Bowser, one of the leaders behind Team Xecuter and its SX line of Switch hacking devices. Bowser also served 14 months of a 40-month prison sentence in that case and said last year that he will likely be paying Nintendo back for the rest of his life.

Switch modder owes Nintendo $2 million after representing himself in court Read More »

geoengineering-will-not-save-humankind-from-climate-change

Geoengineering will not save humankind from climate change

A team of the world’s best ice and climate researchers studied a handful of recently publicized engineering concepts for protecting Earth’s polar ice caps and found that none of them are likely to work.

Their peer-reviewed research, published Tuesday, shows some of the untested ideas, such as dispersing particles in the atmosphere to dim sunlight or trying to refreeze ice sheets with pumped water, could have unintended and dangerous consequences.

The various speculative notions that have been floated, mainly via public relations efforts, include things such as spreading reflective particles over newly formed sea ice to promote its persistence and growth; building giant ocean-bottom sea walls or curtains to deflect warmer streams of water away from ice shelves; pumping water from the base of glaciers to the surface to refreeze it, and even intentionally polluting the upper atmosphere with sulfur-based or other reflective particles to dim sunlight.

Research shows the particle-based sunlight-dimming concept could shift rainfall patterns like seasonal monsoons critical for agriculture in some areas, and also intensify regional heat, precipitation, and drought extremes. And the authors of the new paper wrote that some of the mechanical interventions to preserve ice would likely disrupt regional ocean ecosystems, including the marine food chain, from tiny krill to giant whales.

Lead author Martin Siegert, a glaciologist at the University of Exeter, said that to provide a comprehensive view of the challenges, the new paper included 40 authors with expertise in fields including oceanography, marine biology, glaciology, and atmospheric science.

The paper counters a promotional geo-engineering narrative with science-based evidence showing the difficulties and unintended consequences of some of the aspirational ventures, he said. Most geoengineering ideas are climate Band-Aids at best. They only address symptoms, he added, but don’t tackle the root cause of the problem—greenhouse gas emissions.

Geoengineering will not save humankind from climate change Read More »

f1-in-italy:-look-what-happens-when-the-downforce-comes-off

F1 in Italy: Look what happens when the downforce comes off

That was enough to allow Piastri past. However, the team instructed the championship leader to slow down and relinquish the position to Norris. It was a team mistake, not a driver mistake, and McLaren is doing everything in its power to ensure the eventual champion gets there because of their driving and not some external factor. Piastri didn’t sound exactly happy on the radio. But F1 is a team sport, and racing drivers are employees—when your boss gives you an order, it’s wise to do what they ask and argue about it after the fact, if continued employment is one of your goals.

Oscar Piastri (L) and Lando Norris (R) have a very 21st century relationship. Jakub Porzycki/NurPhoto via Getty Images

For many, a slow pit stop is just one of those things bestowed by the racing gods, and even Verstappen pointed that out when informed by his engineer of the change in positions behind him. After the race, Norris seemed a little embarrassed to have been given the place back, but the emerging consensus from former drivers was that, since Norris had been asked about pit stop priority, and had been undercut anyway, that was sufficient to excuse the request.

McLaren’s approach to handling its drivers is markedly different from the all-out war we saw when Lewis Hamilton and Fernando Alonso raced for it in 2007. Then, neither went home with the big trophy at the end of the year—their infighting allowed Kimi Raikkonen to take the title for Ferrari instead.

That won’t happen this year; either Norris or Piastri will be crowned at the end of the year, with the other having to wait at least another year. The pair have even been asked how they want the team to celebrate in the event the other driver wins—a sensitivity that feels refreshingly new for Formula 1.

Formula 1 heads to Azerbaijan in two weeks for another low-downforce race. Can we expect another Verstappen victory?

F1 in Italy: Look what happens when the downforce comes off Read More »

all-54-lost-clickwheel-ipod-games-have-now-been-preserved-for-posterity

All 54 lost clickwheel iPod games have now been preserved for posterity

Last year, we reported on the efforts of classic iPod fans to preserve playable copies of the downloadable clickwheel games that Apple sold for a brief period in the late ’00s. The community was working to get around Apple’s onerous FairPlay DRM by having people who still owned original copies of those (now unavailable) games sync their accounts to a single iTunes installation via a coordinated Virtual Machine. That “master library” would then be able to provide playable copies of those games to any number of iPods in perpetuity.

At the time, the community was still searching for iPod owners with syncable copies of the last few titles needed for their library. With today’s addition of Real Soccer 2009 to the project, though, all 54 official iPod clickwheel games are now available together in an easily accessible format for what is likely the first time.

All at once, then slowly

GitHub user Olsro, the originator of the iPod Clickwheel Games Preservation Project, tells Ars that he lucked into contact with three people who had large iPod game libraries in the first month or so after the project’s launch last October. That includes one YouTuber who had purchased and maintained copies of 39 distinct games, even repurchasing some of the upgraded versions Apple sold separately for later iPod models.

Ars’ story on the project shook out a few more iPod owners with syncable iPod game libraries, and subsequent updates in the following days left just a handful of titles unpreserved. But that’s when the project stalled, Olsro said, with months wasted on false leads and technical issues that hampered the effort to get a complete library.

“I’ve put a lot of time into coaching people that [had problems] transferring the files and authorizing the account once with me on the [Virtual Machine],” Olsro told Ars. “But I kept motivation to continue coaching anyone else coming to me (by mail/Discord) and making regular posts to increase awareness until I could find finally someone that could, this time, go with me through all the steps of the preservation process,” he added on Reddit.

Getting this working copy of Real Soccer 2009 was an “especially cursed” process, Olsro said.

Getting this working copy of Real Soccer 2009 was an “especially cursed” process, Olsro said. Credit: Olsro / Reddit

Getting working access to the final unpreserved game, Real Soccer 2009, was “especially cursed,” Olsro tells Ars. “Multiple [people] came to me during this summer and all attempts failed until a new one from yesterday,” he said. “I even had a situation when someone had an iPod Nano 5G with a playable copy of Real Soccer, but the drive was appearing empty in the Windows Explorer. He tried recovery tools & the iPod NAND just corrupted itself, asking for recovery…”

All 54 lost clickwheel iPod games have now been preserved for posterity Read More »

what-to-expect-(and-not-expect)-from-yet-another-september-apple-event

What to expect (and not expect) from yet another September Apple event


An all-new iPhone variant, plus a long list of useful (if predictable) upgrades.

Apple’s next product announcement is coming soon. Credit: Apple

Apple’s next product announcement is coming soon. Credit: Apple

Apple’s next product event is happening on September 9, and while the company hasn’t technically dropped any hints about what’s coming, anyone with a working memory and a sense of object permanence can tell you that an Apple event in the month of September means next-generation iPhones.

Apple’s flagship phones have changed in mostly subtle ways since 2022’s iPhone 14 Pro added the Dynamic Island and 2023’s refreshes switched from Lightning to USB-C. Chips get gradually faster, cameras get gradually better, but Apple hasn’t done a seismic iPhone X-style rethinking of its phones since, well, 2017’s iPhone X.

The rumor mill thinks that Apple is working on a foldable iPhone—and such a device would certainly benefit from years of investment in the iPad—but if it’s coming, it probably won’t be this year. That doesn’t mean Apple is totally done iterating on the iPhone X-style design, though. Let’s run down what the most reliable rumors have said we’re getting.

The iPhone 17

Last year’s iPhone 16 Pro bumped the screen sizes from 6.1 and 6.7 inches to 6.3 and 6.9 inches. This year’s iPhone 17 will allegedly get a 6.3-inch screen with a high-refresh-rate ProMotion panel, but the iPhone Plus is said to be going away. Credit: Apple

Apple’s vanilla one-size-fits-most iPhone is always the centerpiece of the lineup, and this year’s iteration is expected to bring the typical batch of gradual iterative upgrades.

The screen will supposedly be the biggest beneficiary, upgrading from 6.1 inches to 6.3 inches (the same size as the current iPhone 16 Pro) and adding a high-refresh-rate ProMotion screen that has typically been reserved for the Pro phones. Apple is always careful not to add too many “Pro”-level features to the entry-level iPhones, but this one is probably overdue—even less-expensive Android phones like the Pixel 9a ship often ship with 90 Hz or 120 Hz screens at this point. It’s not clear whether that will also enable the always-on display feature that has also historically been exclusive to the iPhone Pro, but the fluidity upgrade will be nice regardless.

Aside from that, there aren’t many specific improvements we’ve seen reported on, but there are plenty we can comfortably guess at. Improved front- and rear-facing cameras and a new Apple A19-series chip with at least the 8GB of RAM needed to support Apple Intelligence are both pretty safe bets.

But there’s one thing we supposedly won’t get, which is a new large-sized iPhone Plus. That brings us to our next rumor.

The “iPhone Air”

For the last few years, every new iPhone launch has actually brought us four iPhones—a regular iPhone in two different sizes and an iPhone Pro with a better camera, better screen, faster chip, and other improvements in a regular size and a large size.

It’s the second size of the regular iPhone that has apparently given Apple some trouble. It made a couple of generations of “iPhone mini,” an attempt to address a small-but-vocal contingent of Phones Are Just Too Big These Days people that apparently didn’t sell well enough to continue making. That was replaced by the iPhone Plus, aimed at people who wanted a bigger screen but who weren’t ready to pay for an iPhone Pro Max.

The Plus phones at least gave the iPhone lineup a nice symmetry—two tiers of phone, with a regular one and a big one at each tier—but rumors suggest that the Plus phone is also going away this year. Like the iPhone mini before it, it apparently just wasn’t selling well enough to be worth the continued effort.

That brings us to this year’s fourth iPhone: Apple is supposedly planning to release an “iPhone Air,” which will weigh less than the regular iPhone and is said to be 5.5 or 6 mm thick, depending on who you ask (the iPhone 16 is 7.8 mm).

A 6.3-inch ProMotion display and A19-series chip are also expected to be a part of the iPhone Air, but rather than try to squeeze every feature of the iPhone 17 into a thinner phone, it sounds like the iPhone 17 Air will cater to people who are willing to give a few things up in the interest of getting a thinner and lighter device. It will reportedly have worse battery life than the regular iPhone and just a single-lens camera setup (though the 48 MP sensors Apple has switched to in recent iPhones do make it easier to “fake” optical zoom features than it used to be).

We don’t know anything about the pricing for any of these phones, but Bloomberg’s Mark Gurman suggests that the iPhone Air will be positioned between the regular iPhone and the iPhone Pro—more like the iPad lineup, where the Air is the mid-tier choice, and less like the Mac, where the Air is the entry-level laptop.

iPhone 17 Pro

Apple’s Pro iPhones are generally “the regular iPhone, but more,” and sometimes they’re “what all iPhones will look like in a couple of years, but available right now for people who will pay more for it.” The new ones seem set to continue in that vein.

The most radical change will apparently be on the back—Apple is said to be switching to an even larger camera array that stretches across the entire top-rear section of the phone, an arrangement you’ll occasionally see in some high-end Android phones (Google’s Pixel 10 is one). That larger camera bump will likely enable a few upgrades, including a switch from a 12 MP sensor for the telephoto zoom lens to a 48 MP sensor. And it will also be part of a more comprehensive metal-and-glass body that’s more of a departure from the glass-backed-slab design Apple has been using since the iPhone 12.

A 48MP telephoto sensor could increase the amount of pseudo-optical zoom that the iPhone can offer. The main iPhones will condense a 48 MP photo down to 12 MP when you’re in the regular shooting mode, binning pixels to improve image quality. For zoomed-in photos, it can just take a 12 MP section out of the middle of the 48 MP image—you lose the benefit of pixel binning, but you’re still getting a “native resolution” photo without blurry digital zoom. With a better sensor, Apple could do exactly the same thing with the telephoto lens.

Apple reportedly isn’t planning any changes to screen size this year—still 6.3 inches for the regular Pro and 6.9 inches for the Max. But they are said to be getting new “A19 Pro” series chips that are superior to the regular A19 processors (though in what way, exactly, we don’t yet know). But it could shrink the amount of screen space dedicated to the Dynamic Island.

New Apple Watches

Apple Watch Series 10

The Apple Watch Series 10 from 2024. Credit: Apple

New iPhone announcements are usually paired with new Apple Watch announcements, though if anything, the Watch has changed even less than the iPhone has over the last few years.

The Apple Watch Series 11 won’t be getting a screen size increase—the Series 10 bumped things up a smidge just last year, from 41 and 45 mm to 42 and 46 mm. But the screen will apparently have a higher maximum brightness—always useful for outdoor visibility—and there will be a modestly improved Apple S11 chip on the inside.

The entry-level Apple Watch SE is also apparently due for an upgrade. The current second-generation SE still uses an Apple S8 chip, and Apple Watch Series 4-era 40 and 44 mm screens that don’t support always-on operation. In other words, there’s plenty that Apple could upgrade here without cannibalizing sales of the mainstream Series 11 watch.

Finally, after missing out on an update last year, Apple also reportedly plans to deliver a new Apple Watch Ultra, with the larger 46 mm screen from the Series 10/11 watches and the same updated S11 chip as the regular Apple Watch. The current Apple Watch Ultra 2 already has a brighter screen than the Series 10—3,000 nits, up from 2,000—so it’s not clear whether the Apple Watch Ultra 3’s screen would also get brighter or if the Series 11’s screen is just getting a brightness boost to match what the Ultra can do.

Smart home, TV, and audio

Though iPhones and Apple Watches are usually a lock for a September event, other products and accessory updates are also possible.

Of these, the most high-profile is probably a refresh for the Apple TV 4K streaming box, which would be its first update in three years. Rumors suggest that the main upgrade for a new model would be an Apple A17 Pro chip, introduced for the iPhone 15 Pro and also used in the iPad mini 7. The A17 Pro is paired with 8GB of RAM, which makes it Apple’s smallest and cheapest chip that’s capable of Apple Intelligence. Apple hasn’t done anything with Apple Intelligence on the Apple TV directly, but to date, that has been partly because none of the hardware is capable of it.

Also in the “possible but not guaranteed” column: new high-end AirPods Pro, the first-ever internal update to 2020’s HomePod Mini speaker, a new AirTag location tracker, and a straightforward internals-only refresh of the Vision Pro headset. Any, all, or none of these could break cover at the event next week, but Gurman claims they’re all “coming soon.”

New software updates

Devices running Apple’s latest beta operating systems. Credit: Apple

We know most of what there is to know about iOS 26, iPadOS 26, macOS 26, and Apple’s other software updates this year, thanks to a three-month-old WWDC presentation and months of public beta testing. There might be a feature or two exclusive to the newest iPhones, but that sort of thing is usually camera-related and usually pretty minor.

The main thing to expect will be release dates for the final versions of all of the updates. Apple usually releases a near-final release candidate build on the day of the presentation, gives developers a week or so to finalize and submit their updated apps for App Review, and then releases the updates after that. Expect to see them rolled out to everyone sometime the week of September 15th (though an earlier release is always a possibility).

What’s probably not happening

We’d be surprised to see anything related to the Mac or the iPad at the event next week, even though several models are in a window where the timing is about right for an Apple M5 refresh.

Macs and iPads have shared the stage with the iPhone before, but in more recent years, Apple has held these refreshes back for another, smaller event later in October or November. If Apple has new MacBook Pro or iPad Pro models slated for 2025, we’d expect to see them in a month or two.

Photo of Andrew Cunningham

Andrew is a Senior Technology Reporter at Ars Technica, with a focus on consumer tech including computer hardware and in-depth reviews of operating systems like Windows and macOS. Andrew lives in Philadelphia and co-hosts a weekly book podcast called Overdue.

What to expect (and not expect) from yet another September Apple event Read More »

covid-vaccine-locations-vanish-from-google-maps-due-to-supposed-“technical-issue”

COVID vaccine locations vanish from Google Maps due to supposed “technical issue”

Vaccine results in Maps

Results for the flu vaccine appear in Maps, but not COVID. The only working COVID results are hundreds of miles away.

Credit: Ryan Whitwam

Results for the flu vaccine appear in Maps, but not COVID. The only working COVID results are hundreds of miles away. Credit: Ryan Whitwam

Ars reached out to Google for an explanation, receiving a cryptic and somewhat unsatisfying reply. “Showing accurate information on Maps is a top priority,” says a Google spokesperson. “We’re working to fix this technical issue.”

So far, we are not aware of other Maps searches that have been similarly affected. Google has yet to respond to further questions on the nature of the apparent glitch, which has wiped out COVID vaccine information in Maps while continuing to return results for other medical services and immunizations.

The sudden eroding of federal support for routine vaccinations lurks in the background with this bizarre issue. When the Trump administration decided to rename the Gulf of Mexico, Google was widely hectored for its decision to quickly show “Gulf of America” on its maps, aligning with the administration’s preferred nomenclature. With the ramping up of anti-vaccine actions at the federal level, it is tempting to see a similar, nefarious purpose behind these disappearing results.

At present, we have no evidence that the change in Google’s search results was intentional or targeted specifically at COVID immunization—indeed, making that change in such a ham-fisted way would be inadvisable. It does seem like an ill-timed and unusually specific “technical issue,” though. If Google provides further details on the missing search results, we’ll post an update.

COVID vaccine locations vanish from Google Maps due to supposed “technical issue” Read More »

new-ai-model-turns-photos-into-explorable-3d-worlds,-with-caveats

New AI model turns photos into explorable 3D worlds, with caveats

Training with automated data pipeline

Voyager builds on Tencent’s earlier HunyuanWorld 1.0, released in July. Voyager is also part of Tencent’s broader “Hunyuan” ecosystem, which includes the Hunyuan3D-2 model for text-to-3D generation and the previously covered HunyuanVideo for video synthesis.

To train Voyager, researchers developed software that automatically analyzes existing videos to process camera movements and calculate depth for every frame—eliminating the need for humans to manually label thousands of hours of footage. The system processed over 100,000 video clips from both real-world recordings and the aforementioned Unreal Engine renders.

A diagram of the Voyager world creation pipeline.

A diagram of the Voyager world creation pipeline. Credit: Tencent

The model demands serious computing power to run, requiring at least 60GB of GPU memory for 540p resolution, though Tencent recommends 80GB for better results. Tencent published the model weights on Hugging Face and included code that works with both single and multi-GPU setups.

The model comes with notable licensing restrictions. Like other Hunyuan models from Tencent, the license prohibits usage in the European Union, the United Kingdom, and South Korea. Additionally, commercial deployments serving over 100 million monthly active users require separate licensing from Tencent.

On the WorldScore benchmark developed by Stanford University researchers, Voyager reportedly achieved the highest overall score of 77.62, compared to 72.69 for WonderWorld and 62.15 for CogVideoX-I2V. The model reportedly excelled in object control (66.92), style consistency (84.89), and subjective quality (71.09), though it placed second in camera control (85.95) behind WonderWorld’s 92.98. WorldScore evaluates world generation approaches across multiple criteria, including 3D consistency and content alignment.

While these self-reported benchmark results seem promising, wider deployment still faces challenges due to the computational muscle involved. For developers needing faster processing, the system supports parallel inference across multiple GPUs using the xDiT framework. Running on eight GPUs delivers processing speeds 6.69 times faster than single-GPU setups.

Given the processing power required and the limitations in generating long, coherent “worlds,” it may be a while before we see real-time interactive experiences using a similar technique. But as we’ve seen so far with experiments like Google’s Genie, we’re potentially witnessing very early steps into a new interactive, generative art form.

New AI model turns photos into explorable 3D worlds, with caveats Read More »

former-nasa-chief-says-united-states-likely-to-lose-second-lunar-space-race

Former NASA chief says United States likely to lose second lunar space race

The hearing, titled “There’s a Bad Moon on the Rise: Why Congress and NASA Must Thwart China in the Space Race,” had no witnesses who disagreed with this viewpoint. They included Allen Cutler, CEO of the Coalition for Deep Space Exploration, the chief lobbying organization for SLS, Orion, and Gateway; Jim Bridenstine, former NASA Administrator who now leads government operations for United Launch Alliance; Mike Gold of Redwire, a Gateway contractor; and Lt. General John Shaw, former Space Command official.

The hearing before the committee chaired by Cruz, Commerce, Science, and Transportation, included the usual mishmash of parochial politics, lobbying for traditional space, back slapping, and fawning—at one point, Gold, a Star Trek fan, went so far as to assert that Cruz is the “Captain Kirk” of the US Senate.

Beyond this, however, there was a fair amount of teeth gnashing about the fact that the United States faces a serious threat from China, which appears to be on course to put humans on the Moon before NASA can return there with the Artemis Program. China aims to land humans at the South Pole before the year 2030.

NASA likely to lose “race”

Bridenstine, who oversaw the creation of the Artemis Program half a decade ago, put it most bluntly: “Unless something changes, it is highly unlikely the United States will beat China’s projected timeline to the Moon’s surface,” he said.

Bridenstine and others on the panel criticized the complex nature of SpaceX’s Starship-based lunar lander, which NASA selected in April 2021 as a means to get astronauts down to the lunar surface and back. The proposal relies on Starship being refueled in low-Earth orbit by multiple Starship tanker launches.

Former NASA chief says United States likely to lose second lunar space race Read More »

trump’s-move-of-spacecom-to-alabama-has-little-to-do-with-national-security

Trump’s move of SPACECOM to Alabama has little to do with national security


The Pentagon says the move will save money, but acknowledges risk to military readiness.

President Donald Trump speaks to the media in the Oval Office at the White House on September 2, 2025 in Washington, DC. Credit: Alex Wong/Getty Images

President Donald Trump announced Tuesday that US Space Command will be relocated from Colorado to Alabama, returning to the Pentagon’s plans for the command’s headquarters from the final days of Trump’s first term in the White House.

The headquarters will move to the Army’s Redstone Arsenal in Huntsville, Alabama. Trump made the announcement in the Oval Office, flanked by Republican members of the Alabama congressional delegation.

The move will “help America defend and dominate the high frontier,” Trump said. It also marks another twist on a contentious issue that has pitted Colorado and Alabama against one another in a fight for the right to be home to the permanent headquarters of Space Command (SPACECOM), a unified combatant command responsible for carrying out military operations in space.

Space Command is separate from the Space Force and is made up of personnel from all branches of the armed services. The Space Force, on the other hand, is charged with supplying personnel and technology for use by multiple combatant commands. The newest armed service, established in 2019 during President Trump’s first term, is part of the Department of the Air Force, which also had the authority for recommending where to base Space Command’s permanent headquarters.

“US Space Command stands ready to carry out the direction of the president following today’s announcement of Huntsville, Alabama, as the command’s permanent headquarters location,” SPACECOM wrote on its official X account.

Military officials in the first Trump administration considered potential sites in Colorado, Florida, Nebraska, New Mexico, and Texas before the Air Force recommended basing Space Command in Huntsville, Alabama, on January 13, 2021, a week before Trump left office.

Members of Colorado’s congressional delegation protested the decision, suggesting the recommendation was political. Trump won a larger share of votes in Alabama in 2016, 2020, and 2024 than in any of the other states in contention. On average, a higher percentage of Colorado’s citizens cast their votes against Trump than in the other five states vying for Space Command’s permanent headquarters.

Trump’s reasons

Trump cited three reasons Tuesday for basing Space Command in Alabama. He noted Redstone Arsenal’s proximity to other government and industrial space facilities, the persistence of Alabama officials in luring the headquarters away from Colorado, and Colorado’s use of mail-in voting, a policy that has drawn Trump’s ire but is wholly unrelated to military space matters.

“That played a big factor, also,” Trump said of Colorado’s mail-in voting law.

None of the reasons for the relocation that Trump mentioned in his remarks on Tuesday explained why Alabama is a better place for Space Command’s headquarters than Colorado, although the Air Force has pointed to cost savings as a rationale for the move.

A Government Accountability Office (GAO) investigation concluded in 2022 that the Air Force did not follow “best practices” in formulating its recommendation to place Space Command at Redstone Arsenal, leading to “significant shortfalls in its transparency and credibility.”

A separate report in 2022 from the Pentagon’s own inspector general concluded the Air Force’s basing decision process was “reasonable” and complied with military policy and federal law, but criticized the decision-makers’ record-keeping.

Former President Joe Biden’s secretary of the Air Force, Frank Kendall, stood by the recommendation in 2023 to relocate Space Command to Alabama, citing an estimated $426 million in cost savings due to lower construction and personnel costs in Huntsville relative to Colorado Springs. However, since then, Space Command achieved full operational capability at Peterson Space Force Base, Colorado.

Now-retired Army Gen. James Dickinson raised concerns about moving Space Command from Colorado to Alabama. Credit: US Space Force/Tech. Sgt. Luke Kitterman

Army Gen. James Dickinson, head of Space Command from 2020 until 2023, favored keeping the headquarters in Colorado, according to a separate inspector general report released earlier this year.

“Mission success is highly dependent on human capital and infrastructure,” Dickinson wrote in a 2023 memorandum to the secretary of the Air Force. “There is risk that most of the 1,000 civilians, contractors, and reservists will not relocate to another location.”

One division chief within Space Command’s plans and policy directorate told the Pentagon’s inspector general in May 2024 that they feared losing 90 percent of their civilian workforce if the Air Force announced a relocation. A representative of another directorate told the inspector general’s office that they could say “with certainty” only one of 25 civilian employees in their division would move to a new headquarters location.

Officials at Redstone Arsenal and information technology experts at Space Command concluded it would take three to four years to construct temporary facilities in Huntsville with the same capacity, connectivity, and security as those already in use in Colorado Springs, according to the DoD inspector general.

Tension under Biden

Essentially, the inspector general reported, officials at the Pentagon made cost savings their top consideration in where to garrison Space Command. Leaders at Space Command prioritized military readiness.

President Biden decided in July 2023 that Space Command’s headquarters would remain in Colorado Springs. The decision, according to the Pentagon’s press secretary at the time, would “ensure peak readiness in the space domain for our nation during a critical period.” Alabama lawmakers decried Biden’s decision in favor of Colorado, claiming it, too, was politically motivated.

Space Command reached full operational capability at its headquarters at Peterson Space Force Base, Colorado, two years ahead of schedule in December 2023. At the time, Space Command leaders said they could only declare Space Command fully operational upon the selection of a permanent headquarters.

Now, a year-and-a-half later, the Trump administration will uproot the headquarters and move it more than 1,000 miles to Alabama. But it hasn’t been smooth sailing for Space Command in Colorado.

A new report by the GAO published in May said Space Command faced “ongoing personnel, facilities, and communications challenges” at Peterson, despite the command’s declaration of full operational capability. Space Command officials told the GAO the command’s posture at Peterson is “not sustainable long term and new military construction would be needed” in Colorado Springs.

Space Command was originally established in 1985. The George W. Bush administration later transferred responsibility for military space activities to the US Strategic Command, as part of a post-9/11 reorganization of the military’s command structure. President Trump reestablished Space Command in 2019, months before Congress passed legislation to make the Space Force the nation’s newest military branch.

Throughout its existence, Space Command has been headquartered at Peterson Space Force Base in Colorado Springs. But now, Pentagon officials say the growing importance of military space operations and potentially space warfare requires Space Command to occupy a larger headquarters than the existing facility at Peterson.

Peterson Space Force Base is also the headquarters of North American Aerospace Defense Command, or NORAD, US Northern Command, and Space Operations Command, all of which work closely with Space Command. Space Command officials told the GAO there were benefits in being co-located with operational space missions and centers, where engineers and operators control some of the military’s most important spacecraft in orbit.

Several large space companies also have significant operations or headquarters in the Denver metro area, including Lockheed Martin, United Launch Alliance, BAE Systems, and Sierra Space.

In Alabama, ULA and Blue Origin operate rocket and engine factories near Huntsville. NASA’s Marshall Space Flight Center and the Army’s Space and Missile Defense Command are located at Redstone Arsenal itself.

The headquarters building at Peterson Space Force Base, Colorado. Credit: US Space Force/Keefer Patterson

Colorado’s congressional delegation—six Democrats and four Republicansissued a joint statement Tuesday expressing their disappointment in Trump’s decision.

“Today’s decision to move US Space Command’s headquarters out of Colorado and to Alabama will directly harm our state and the nation,” the delegation said in a statement. “We are united in fighting to reverse this decision. Bottom line—moving Space Command headquarters weakens our national security at the worst possible time.”

The relocation of Space Command headquarters is estimated to bring about 1,600 direct jobs to Huntsville, Alabama. The area surrounding the headquarters will also derive indirect economic benefits, something Colorado lawmakers said they fear will come at the expense of businesses and workers in Colorado Springs.

“Being prepared for any threats should be the nation’s top priority; a crucial part of that is keeping in place what is already fully operational,” the Colorado lawmakers wrote. “Moving Space Command would not result in any additional operational capabilities than what we have up and running in Colorado Springs now. Colorado Springs is the appropriate home for US Space Command, and we will take the necessary action to keep it there.”

Alabama’s senators and representatives celebrated Trump’s announcement Tuesday.

“The Air Force originally selected Huntsville in 2021 based 100 percent on merit as the best choice,” said Rep. Robert Aderholt (R-Alabama). “President Biden reversed that decision based on politics. This wrong has been righted and Space Command will take its place among Huntsville’s world-renowned space, aeronautics, and defense leaders.”

Democratic Colorado Gov. Jared Polis said in a statement that the Trump administration should provide “full transparency” and the “full details of this poor decision.”

“We hope other vital military units and missions are retained and expanded in Colorado Springs. Colorado remains an ideal location for future missions, including Golden Dome,” Polis said, referring to the Pentagon’s proposed homeland missile defense system.

Photo of Stephen Clark

Stephen Clark is a space reporter at Ars Technica, covering private space companies and the world’s space agencies. Stephen writes about the nexus of technology, science, policy, and business on and off the planet.

Trump’s move of SPACECOM to Alabama has little to do with national security Read More »