Author name: DJ Henderson

critical-citrixbleed-2-vulnerability-has-been-under-active-exploit-for-weeks

Critical CitrixBleed 2 vulnerability has been under active exploit for weeks

A critical vulnerability allowing hackers to bypass multifactor authentication in network management devices made by Citrix has been actively exploited for more than a month, researchers said. The finding is at odds with advisories from the vendor saying there is no evidence of in-the-wild exploitation.

Tracked as CVE-2025-5777, the vulnerability shares similarities with CVE-2023-4966, a security flaw nicknamed CitrixBleed, which led to the compromise of 20,000 Citrix devices two years ago. The list of Citrix customers hacked in the CitrixBleed exploitation spree included Boeing, Australian shipping company DP World, Commercial Bank of China, and the Allen & Overy law firm. A Comcast network was also breached, allowing threat actors to steal password data and other sensitive information belonging to 36 million Xfinity customers.

Giving attackers a head start

Both CVE-2025-5777 and CVE-2023-4966 reside in Citrix’s NetScaler Application Delivery Controller and NetScaler Gateway, which provide load balancing and single sign-on in enterprise networks, respectively. The vulnerability causes vulnerable devices to leak—or “bleed”—small chunks of memory contents after receiving modified requests sent over the Internet.

By repeatedly sending the same requests, hackers can piece together enough data to reconstruct credentials. The original CitrixBleed had a severity rating of 9.8. CitrixBleed 2 has a severity rating of 9.2.

Citrix disclosed the newer vulnerability and released a security patch for it on June 17. In an update published nine days later, Citrix said it was “currently unaware of any evidence of exploitation.” The company has provided no updates since then.

Researchers, however, say that they have found evidence that CitrixBleed 2, as the newer vulnerability is being called, has been actively exploited for weeks. Security firm Greynoise said Monday that a search through its honeypot logs found exploitation as early as July 1. On Tuesday, independent researcher Kevin Beaumont said telemetry from those same honeypot logs indicates that CitrixBleed 2 has been exploited since at least June 23, three days before Citrix said it had no evidence of such attacks.

Citrix’s failure to disclose active exploitation is only one of the details researchers say was missing from the advisories. Last week, security firm watchTowr published a post titled “How Much More Must We Bleed? – Citrix NetScaler Memory Disclosure (CitrixBleed 2 CVE-2025-5777).” It criticized Citrix for withholding indicators that customers could use to determine if their networks were under attack. On Monday, fellow security firm Horizon3.ai said much the same thing. Company researchers wrote:

Critical CitrixBleed 2 vulnerability has been under active exploit for weeks Read More »

on-alpha-school

On Alpha School

The epic 18k word writeup on Austin’s flagship Alpha School is excellent. It is long, but given the blog you’re reading now, if you have interest in such topics I’d strongly consider reading the whole thing.

One must always take such claims and reports with copious salt. But in terms of the core claims about what is happening and why it is happening, I find this mostly credible. I don’t know how far it can scale but I suspect quite far. None of this involves anything surprising, and none of it even involves much use of generative AI.

Rui Ma here gives a shorter summary and offers takeaways compatible with mine.

  1. What Is It?

  2. What It Isn’t.

  3. Intrinsic Versus Extrinsic Motivation.

  4. High Versus Low Structure Learners.

  5. I’ve Got a Theory.

  6. Is This Really The True Objection?

This is essentially goal factoring that combines several known effective techniques.

In particular:

  1. Spaced repetition and mastery, require full learning without letting things drop.

  2. Immediate problem sets with immediate feedback and explanation.

  3. Tracking clicks and eye focus and providing feedback on that too.

  4. Gamified reward systems for atomic learning actions, paid prizes.

  5. 1-on-1 attention upon request, 5-to-1 overall student-teacher ratio.

  6. Short bursts with breaks.

  7. Flexibility on what kids do when within academics and freedom to push ahead.

  8. Not wasting time on things you don’t care about, getting rid of bad methods.

  9. Within that framework, find reasonable educational software, use it.

  10. Afternoon projects and tasks always involve concrete and measurable output. ‘Check charts’ give bigger missions to do that gate ability to advance grade levels, to get kids used to longer harder things and developing agency.

You get all the academics in during the morning, and advance much faster than normal. Then you have the afternoon left to do whatever you want, including filling in any ‘soft’ skills you decide were important. You don’t try to do it all in some sort of all-purpose Swiss-army-knife lecture classroom and pretend it’s not pre-Guttenberg.

Most time is spent learning on a computer, watching videos and doing problems, but if you ever need help you can ask for a teacher, and if you ever struggle they bring one in for you. There’s still a lot of human attention going into all of this.

Does it work for academics? This ia a very skin-in-the-game way to assert that it does, and reports all say that it does, regardless of exactly how well:

The school’s “100% Money Back guarantee” is that every student who attends will be in the top 1% academically and win at least one national academic competition (for kids who start in kindergarten they guarantee 1350+ SAT and 5s on APs by 8th grade).

You can and should worry that they are effectively teaching to various tests or focusing narrowly on subsets of academics, but regular school does that a lot too, the entire afternoon is free for other things, and also there is a fixed amount that you can present good results via test optimization. You can’t get years worth of extra results. MAP Growth Speed findings work the same way, at some point it can’t be faked.

Spaced repetition works wonders as does ensuring mastery, and being able to customize for what each individual child actually needs right now. A giant portion of time spent in school is obviously wasted by failure to reinforce or by teaching things that aren’t relevant or by simple boredom and so on. Immediate feedback is huge.

Selection effects also are important here, on various levels, but I think these results are bigger than that can account for, by a lot.

Is all of this optimal? Not even close, but it doesn’t have to be. The baseline it is up against is downright pathetic.

This include most expensive private schools, like the horror story example that is the first section of this review. They work hard for the right to pay tens of thousands a year to get a peer group of others that did the same, and in exchange get a school entirely uninterested in teaching their student anything that can be verified or that the patents value. When they challenge the school, the school threatens to kick them out in return.

That’s the standard you’re up against.

So there is no reason that the core of this wouldn’t work, or wouldn’t scale, once a given student gets to the point they can participate. Performance would fall off somewhat as you lose other advantages, like selection and buy-in and the ability to bid higher for a mostly better distribution of teachers, but all of that seems easily survivable.

The only part whose effectiveness seems non-obvious and the system might fail to scale is the incentive program, the gamified rewards, and the possibility that this would fail as motivation for a lot of or even most students. I’ll tackle that in a bit.

It would work and scale even better if you incorporate generative AI. Certainly most of the time that one is ‘stuck’ in these situations, generative AI can help you a lot in becoming unstuck, or letting kids follow their curiosity in various ways. You can (if we can’t find a better answer) disable or monitor AI use during assessments.

This isn’t a way to save money or hire fewer teachers, but I notice this is weird at least for the morning portion? Shouldn’t it be that, if they want that?

If a pattern of stumbles appears the system will automatically task the student to book a “coaching call” with a remote teacher (most of these teachers seem to be based in Brazil). Kids can also choose to self-book calls with the “coaches” at any time.

Today she booked it at 11: 10 and had the call at 11: 15, but she said once it took her two days to get the meeting. I asked her how often she has a call and she said less than once a day, but more than once a week.

Thus, the remote teachers can’t possibly be that large a part of the 5:1 ratio, and presumably are not expensive. This also points to a potential improvement, since an in-person tutoring session would be more effective when possible. The physically present teacher should be able to handle a lot of kids at once during academic time if they are all on their computers.

Thus the 5:1 ratio must be coming from the afternoon activities, which is cool but presumably optional. The system works without it. The actual marginal costs here for an additional student that matter should be quite low.

It also isn’t aristocratic tutoring. I am very confident that aristocratic tutoring, as in constant 1-on-1 attention from a variety of experts, is the most effective strategy available if you have the resources to do it and you combine it with other best practices like spaced repetition. This is an attempt to get a lot of the benefits of that without the associated extremely high costs. I would also expect incorporating generative AI to help move us further in this direction.

What are you giving up from the ‘traditional’ school experience?

From what I can tell you are mostly giving up ‘classes,’ meaning lectures where a group of kids sit in desks and listen to someone talk with some amount of interaction. Which, again, seems like an obvious terrible way to accomplish essentially anything? If you think that the interactions within that setting are somehow socially important or incidentally teach other skills other than how to sit still, obey and be bored for extended periods, in a way that can’t be made up for several times over with a free afternoon for other things, I notice I am confused why you would think this.

If you do think the main goal of school is to learn to sit still and be bored and quiet and obey, well, okay then, Alpha School should not be your top choice, but I am confused why you would want that in 2025.

It also is not a way to avoid screen time, since the academics are done on a device. If you think that this is intrinsically important, that is an issue. My model of ‘screen time’ is that it depends what it is for and how it works, at least once you’re into primary school, so it is not an issue.

It also isn’t a way to ensure that all children learn ‘equally’ or ‘equitably,’ to prevent kids from learning too much too fast (oh no?!) or learn someone’s preferred ideological beliefs. Again, different goals. If those are your goals, then Alpha School is not for you.

However, even if you did in theory want to ensure equal or equitable learning outcomes, as in you actively did not want kids to learn, then this is still great news. Because this lets everyone learn faster, ensuring everyone gets to the same target. Then, if some kids might learn too much, you can make them stop learning. Also, check your uniforms. I think there might be skulls on them.

They sell a home school version of Alpha School for on the order of $10k/year. It does not work as well. The post attributes this difference mostly to the lack of AlphaBucks. As in, everything about this being at a school mostly doesn’t matter, except for there being an adult to watch the kid, and for the AlphaBucks.

The secret ingredient is not crime. It is AlphaBucks, paid for good performance.

Which is, for mostly bad reasons, less popular than crime.

Alpha schools have their own in-house currency. Alpha has “Alpha bucks”; GT School has “GT bucks”. My understanding is that they work a little differently on each campus, but the overall philosophy is the same. This review will focus on the details of the GT system since it is what I know best.

If the students complete their 2-hour learning “minimums” each day they earn about 10 GT Bucks. They get additional bonuses for every lesson they complete beyond their minimums. They also get a bonus if they finish their minimums within the scheduled time (vs going home and doing them later), additional bonuses if the entire class completes their minimums during the allotted time, and weekly bonuses for hitting longer term targets.

They only get credit if they both complete their lessons AND get 80% or higher on the problem sets within the lesson. If they get 79% they still move on (with the questions they missed coming back later for review), but they don’t get the GT bucks associated with the lesson (this stops gaming where the kids rush through the lessons just to get “bucks”)

A GT buck is worth 10-cents. So if they are really pushing a kid could be earning roughly $2 per day.

Fryer paid kids to read books, GT pays kids to do lessons.

Once a kid has earned a collection of GT bucks they can spend those bucks at the GT-store. The Alpha store has a wide selection of offerings. The GT store, because it is a much smaller school, is more like a catalog.

The kids are then described as using various strategies. Some spend as they go. Others save up for big purchases, or save indefinitely.

All reports are that it worked.

We tried getting the kids to work on it for about an hour per day, but it was a fight every time. It was the same content they would be doing at GT, but without the GT structure, and it did not work.

But once the kids started at GT, those same iXL lessons became a game for them. I remember taking the kids to the park one day after school. They asked me, “Instead of playing can you set up a hotspot so we can do a few more lessons? I want to earn more GT-Bucks!”.

Was it bad that they were being bribed to do lessons? 76% of Americans would think so. But it definitely worked.

My middle daughter – who is the most driven by money – has completed more than two full grades of school in ~20-weeks (60% of the school year), and shows no signs of slowing down.

I believe the reports. My experience with my own children, and my own experience both now and as a child, and as a game designer, and everything else I have seen, lines up with this.

I’ve seen it work with screen time. I’ve seen it work with ‘reasonable requests.’ I’ve seen it work with daily login bonuses, including when the prize is nothing except a message. I’ve seen it work with essentially anything, no matter how little objective value is involved. Gamification works when you set your variables correctly. Everyone loves a prize and recorded incremental progress.

Another objection is that you need peer groups as part of motivation. Well, Alpha School still has that, you can absolutely compare what you are doing to others, talk to peers and so on. I don’t see the problem here.

The better objection is the idea that extrinsic motivation will destroy intrinsic motivation. Once you pay them to learn, the theory goes, they won’t want to learn except to get paid. That is a highly reasonable theory. There is a long literature of extrinsic motivation crowding out intrinsic. The article cites other literature saying that paying can lead to building up habits and intrinsic motivation over time, and that the program seems to work.

I want to specifically address the objection that some learners are ‘high structure,’ and therefore need the very classrooms that bore the rest of us to tears and waste large portions of our childhood, but which somehow it would still supposedly be wrong to free the ‘low structure’ learners from too early.

Alpha School very obviously provides a different but clearly very high structure. If what students need is structure, a firm hand, a particular thing to do next, and to be kept on track? Very obviously that is not going to be where this project falls apart.

The standard theory, as I understand it, is that the reason for undermining motivation is when the reward undermines locus of control, and the reward you offer is now seen as the reason for the behavior, and that implementation details matter a lot.

I notice that gamification of rewards helps retain locus of control. The kid is the one in charge of getting and allocating the rewards, so they feel in control of the process.

I also notice myself thinking about it in this way, too:

  1. Extrinsic motivation to do [X] destroys inherent intrinsic motivation to do [X].

  2. Extrinsic motivation to do [X] does not destroy motivation to do [X] to get [Y].

Or, in English:

  1. If I pay you to do something inherently fun it will become less inherently fun.

  2. If I pay you to do useful things where you see their value, you develop habits and learn they are useful. So you will keep doing it even after I stop paying you.

Why? Because the brain is not stupid, and this is not all about crowding out or locus of control, although all three things are closely related.

If I pay you to do something that you previously did because it was fun, then you are now doing it in ways and quantities that are not fun. You break the association. So the brain learns that the activity is not fun, on top of the locus of control issue, and the habit is that you do this because you have to and it isn’t fun. Your motivation dies.

If I pay you to do something because it works, then you do it because you are paid, but then you notice that it works (or even that it is fun because I set it up to be fun and then paid you to do it that way), and that this is also a good reason. You learn to do it for two reasons, and you notice that you’re doing it because of the results not only because of the payments. Then, when I take the money away, you’ll know how to do it, you’ll have the habit of doing it and it paying off, and thus you’ll want to keep doing it.

I also noticed, upon asking for research reports on the question, that what Alpha School is doing mirrors all of the ‘get the implementation details right’ results from the literature:

  1. Rewarding fundamental behaviors works better than rewarding test performance.

  2. Rewards work well for drill-style efforts, and are destructive for fun activities.

  3. Immediate rewards outperform delayed rewards, note that they give the AlphaBucks on the spot even though the cashed in reward may be delayed.

  4. Tying rewards to specific competence standards enhances intrinsic motivation.

  5. When rewards provide information rather than controlling behavior, they enhance motivation. The implementation details do this here.

  6. Competence support demands appropriate challenges, clear success criteria, and informational feedback.

  7. Autonomy-supportive delivery is crucial for any reward system. Here the child determines how to cash out the reward, and what order to do activities in.

Then on top of that we have the gamification aspects. So there are still implementation dangers, but this seems like very clearly good design.

This reinforces that we have every reason to expect the AlphaBucks system, as described, to work, even though other incentive systems sometimes backfire.

Paying has a bad rap partly for silly moralistic reasons, and largely because most of the time such systems get implemented poorly. In particular, the most common place we pay people to do things is jobs and work, and there we often implement in a way that destroys motivation, especially via paying for time or other billables. That’s bad design. AlphaBucks is good design.

It keeps becoming increasingly obvious that we can make massive Pareto improvements over classical school. This is the most glaring example. The only big disadvantage that actually matters is that it remains expensive, but that will improve over time, and for what you get it is already a fantastic deal.

Marginal costs for the active ingredients should be low, including for the homeschool package where there seem like clear paths to fix the motivational issues (as in, to introduce AlphaBucks, likely via creating virtual pods of associated students, which also helps with other things and seems like an obvious play once you scale enough for this.)

One can contrast this positive vision with the extensive defense of the current school system that was the next review in the ACX contest, where it is explained that all of you people thinking schools look like the kids sit around all day not learning don’t have the proper context to understand why it all actually makes sense. Because school, you see, isn’t designed to maximize learning, it is designed to maximize motivation, whereas ‘individualized learning has failed’ because it is not motivating without other students.

Here’s their own actual positive case:

What if we were brutally honest when a family enrolls their child in school? Here’s what we would say:

  1. If your child is a no-structure learner, they will be bored here. They will probably learn some things, but they will often sit in lessons where they know everything the teacher is teaching, and they’ll spend a lot of their time sitting around waiting for other students to catch up.

  2. If your child is a low-structure learner, they will still often be bored as our school isn’t very efficient, but the structure and routine will ensure they get a basic level of literacy and numeracy. Maybe they’ll like school, probably because of gym class and being around their friends, maybe they won’t, but they’ll learn some things.

  3. That said, the school you pick doesn’t matter too much. Your child will learn about as much anywhere else. If your child is a high-structure learner, they will need a lot of very structured teaching.

  4. Our teachers vary widely: some are good at providing that structure, others aren’t. Your child will gradually fall behind, and will perpetually feel a bit dumb and a bit slow compared to everyone else. But we will do our best to keep them moving along with their peers because that’s the best idea we have to motivate them.

  5. Hopefully, with some help, they’ll graduate high school on time. There’s a risk they just won’t have the skills, or they’ll be discouraged by constantly feeling dumb and just give up.

  6. Oh, and we aren’t very good at understanding what causes students to be motivated. It’s absolutely correlated with socioeconomic status, so it would be helpful if you’re rich, but there’s a lot of variability and plenty of rich kids need that structure too.

That’s the case from the person who thinks school is designed properly? That’s what you want to do with childhood?

Burn. It. With. Fire.

(Or, ideally, if we keep our heads, reenact Cool Guys Don’t Look At Explosions.)

What good is the hypothesis that school is designed to maximize motivation? It can help us understand all sorts of phenomena.

I often hear an argument from homeschoolers that they can accomplish in two hours a day (or some other small amount of time) what schools do in seven or eight. I don’t doubt that at all. Schools aren’t particularly efficient at facilitating learning. Schools are good at educating everyone at once.

So why would anyone with the means to not do so send their child to such a thing?

You might think that we’ve found the solution to tracking. We just need to get all the no- and low-structure learners together and let them move much faster.

Here’s the issue. The no-structure learners will always be bored, as long as we are committed to putting them into classrooms where everyone learns the same thing. And those classrooms where everyone learns the same thing are exactly what the low-structure learners need.

As soon as you create a higher track there will be a ton of demand for it. Parents will insist that their kid join. And as it grows, it won’t be able to accelerate very quickly. You still need the structure of a classroom where everyone is learning the same thing, and that just isn’t a very efficient way to teach.

So, now hear me out… don’t use classrooms that require this? These are no-structure learners, who by your own admission will always be bored in your classes, so don’t impose your god damned stupid class structure on them at all?

Or, if you can’t do that in context, and again hear me out… create different tracks, use tests as gates for them, and if the kid can’t hack the one moving quickly, move them out of the track into another one that they can handle?

And what about all the reports that Montessori does motivation way better than standard school systems, if you are not trying to do a full revolution?

Tracking is necessary in high school because students diverge too much (despite forcing them not to beforehand) but definitely fails earlier because of reasons, despite all the parents favoring it and everyone involved constantly saying it works (and my understanding of the research also saying that it very clearly works)?

I would also ask the author, so if Alpha School’s methods did successfully motivate students to learn, would you then have everyone switch over? If not, why not?

There were constant assertions of what we can’t do or doesn’t work, including all of ‘personalization,’ without evidence. The piece is infuriating throughout. It did not update me the way the author intended.

After all of this, am I going to consider Alpha School New York? Absolutely. I went to schedule a tour, although they don’t seem to have anything available until October. I do notice that one thing that wasn’t discussed were behavioral issues that might interfere with getting the child to use the software. But also I notice that children with behavioral issues usually are happy to get into using software, so this could easily be a much lower difficulty problem.

Discussion about this post

On Alpha School Read More »

oldest-wooden-tools-in-east-asia-may-have-come-from-any-of-three-species

Oldest wooden tools in East Asia may have come from any of three species

That leaves a few possibilities: Denisovans, Homo heidelbergensis (the common ancestor of Neanderthals, Denisovans, and our species), or Homo erectus. All three species could have lived in the area at the time. But nobody at Gantangqing left behind any convenient, readily identifiable bones along with their wooden tools, stone tools, and butchered animal bones (so inconsiderate of them), making it hard to pin down exactly which species these 300,000-year-old hunter-gatherers belonged to.

Homo erectus had been in Asia for more than a million years by the time Gantangqing’s lakeshore was occupied; the oldest Homo erectus fossils in Asia are from Indonesia and date back 1.8 million years. They also stuck around until quite recently. In caves at a site called Zhoukoudian, outside Beijing in eastern China, Homo erectus remains date to sometime between 700,000 and 200,000 years ago (there’s still a lot of debate on exactly how old the site is).

All of that means that Homo erectus’ presence in the region overlaps the age of the wood tools at Gantangqing. And the stone tools found nearby are fairly simple cores and flakes that don’t rule out Homo erectus as their makers. Archaeologists haven’t unearthed evidence of Homo erectus making or using sophisticated wooden tools like this, but for a species that managed to harness fire and cross miles of ocean, it’s not too wild a speculation.

On the other hand, we know that Denisovans were probably in the area, too, or at least not too far away. A recently identified Denisovan skull from Harbin, China, is 146,000 years old but bears a striking resemblance to other hominin skulls from sites all over China, which range from 300,000 to 200,000 years old. And making finely crafted wooden tools fits with everything we know about Denisovan capabilities.

Then there’s Homo heidelbergensis, the direct ancestor of Denisovans. In fact, it’s a little hard to tell where hominins stop being Homo heidelbergensis and start being Denisovans, or even whether the distinction matters. It’s a problem paleoanthropologists refer to as the “muddle in the Middle,” since both species date to the Middle Pleistocene. So if Homo erectus and Denisovans are in the running, so is Homo heidelbergensis, by default.

And unless someone finds a telltale skull nearby or another very similar toolkit at a site with telltale skulls to consult, we may not know for sure.

Science, 2023. DOI: 10.1126/science.adr8540  (About DOIs).

Oldest wooden tools in East Asia may have come from any of three species Read More »

the-curious-rise-of-giant-tablets-on-wheels

The curious rise of giant tablets on wheels


Not quite a TV, not your average tablet

Hands-on with KTC’s 32-inch Android tablet on a rolling pedestal, the A32Q7 Pro.

KTC MegPad 32-inch Android Tablet (A32Q7 Pro)

KTC’s MegPad 32-inch Android Tablet (A32Q7 Pro). Credit: Scharon Harding

KTC’s MegPad 32-inch Android Tablet (A32Q7 Pro). Credit: Scharon Harding

Over the past few years, LG has set off a strange tech trend that’s been rolling onto devices sold across Amazon and other online electronics retailers.

In 2022, the company launched the StanbyME, which is essentially a $1,000 27-inch tablet running LG’s smart TV operating system (OS), webOS, but lacking a tuner. LG’s press release announcing the device described it as a “wireless private TV screen with a built-in battery” that is easily portable and ideal for watching shows and movies, in addition to  “video conferencing with family and coworkers and viewing online lectures.”

Today, the StanbyME competes against a slew of similar devices, including some from Samsung, but mostly from smaller brands and running Android.

I’ve had one of these devices, the KTC MegPad 32-inch Android Tablet (A32Q7 Pro), rolling around my home for a few weeks, and I’m left curious about what’s driving the growth of StanbyME-like devices, which are noticeably niche and expensive. I’m also uncertain whether these hybrid devices have an ongoing place in a consumer tech world already inundated with big-screen TVs, small-screen tablets, and beloved laptops.

Hands-on

Unlike LG’s StanbyME, KTC’s device doesn’t run a smart TV OS. Instead, it’s a 32-inch Android 13 tablet. Still, KTC heavily markets the MegPad’s ability to serve as streaming hardware, and that’s one of the best uses I found for it.

A big ol’ tablet on wheels. Scharon Harding

Treating the MegPad like a smart TV on wheels meant I could have a living-room-like experience in more places throughout my home. I could watch TV in bed with a more visible screen set at a more comfortable distance than what I’d achieve with a laptop or tablet. It also meant flexibility. I don’t like having a permanent TV in my room (how would I ever get out of bed?), so I appreciated the ability to roll the MegPad out of my room or twist it so that the screen faced away from me.

The MegPad is also a diplomatic solution for homes with limited TVs or computers. This could be helpful for homes with kids with varied interests or in my home, where a speedy, 55-inch TV in the living room is the best screen available by far. I was able to let my partner take the big screen for gaming and still hang out nearby while streaming on the MegPad. I don’t have a central coffee table in my living room, but the mobile tablet enabled me to watch shows without a device weighing down my lap or making me connect a wireless speaker for better volume.

KTC’s device also has a helpful leg-up over LG’s StanbyME via its HDMI port, which makes the MegPad work like a regular monitor. Determining where to safely rest a device tethered to this mobile machine is something you’ll have to figure out on your own, though.

KTC MegPad 32-inch Android Tablet (A32Q7 Pro)

The port selection on the panel’s backside.

Credit: Scharon Harding

The port selection on the panel’s backside. Credit: Scharon Harding

Compared to the TV mounted on my living room wall, the MegPad is much easier to move from room to room, but it’s easy to overestimate how seamless transporting it is. Yes, it’s on a set of five 360-degree wheels, but the wheels don’t lock, and the device weighs 40.3 pounds, per its Amazon listing. That means I had to exert a decent amount of effort to move it over floor transition strips, across uneven floors, and from hardwood to carpet.

KTC MegPad 32-inch Android Tablet (A32Q7 Pro)

The charging port and power button are on the stand’s base.

Credit: Scharon Harding

The charging port and power button are on the stand’s base. Credit: Scharon Harding

A fully rotating screen, however, makes up for some of my mobility complaints and diversifies the MegPad’s potential uses. Besides streaming, for example, the MegPad was great for watching yoga videos online, (which calls for viewing the screen from different heights and positions). It also proved to be an ideal setup for creating a large, print-out collage, which included a lot of dragging, dropping, and cropping of images.

How the MegPad moves.

How the MegPad moves.

How the MegPad moves. Credit: KTC

Not a real TV

You can do a lot with a sizeable Android tablet. But with TV and movie watching being some of the most obvious uses, it’s important to note that neither the MegPad nor any of its rollable rivals are real TVs.

For one, there’s no tuner, though in the streaming world, that matters less to many of today’s TV viewers.

Further, the MegPad, like many StanbyME-like devices, uses Android 13, which doesn’t require paying vendor licensing fees like built-for smart TV OSes, such as Android TV/Google TV and webOS, would. There are some benefits to that, though.

To start, Android 13 doesn’t have the integrated ads that Android TV or the Google TV interface does. Google claims that the Google TV platform doesn’t use automatic content recognition (ACR), but as Consumer Reports has noted, Google collects “data from TVs that use its smart TV platform—and there’s no opting out of Google’s policies during setup if you want smart TV functionality.” Further, Google may combine that data with user data from third parties for advertising purposes. A spokesperson for KTC confirmed to me that the MegPad doesn’t use ACR.

As a tablet, the MegPad is compatible with more apps, many of which aren’t supported by Google TVs, like Google Sheets, Microsoft Word, Reddit, and Signal.

Android tablets are also more appropriate for storing documents, photos, and other files than smart TVs are. Although it’s likely less roomy than your PC, the MegPad has 128GB of internal storage.

But since this is an Android tablet and not a Google TV, there are no integrated channels and no live-TV-only option, which stops the device from collecting diagnostic information. Google TV would also include a more streaming-friendly user interface and the ability to watch content from different streaming providers without switching apps.

Further differing from LG’s StanbyME and real TVs, the MegPad doesn’t include a traditional remote. The tablet comes with a basic Bluetooth mouse, but due to the tablet’s portability, I frequently used the tablet without a flat surface within arm’s reach available for comfortable mouse control. The touchscreen is reliable, but gestures can be cumbersome on a tablet this large, and the display was often out of my hand’s reach.

KTC MegPad 32-inch Android Tablet (A32Q7 Pro)

The tablet comes with this mouse and removable mouse stand.

Credit: Scharon Harding

The tablet comes with this mouse and removable mouse stand. Credit: Scharon Harding

The new portable TV?

With TVs getting larger and people turning to portable gadgets like phones and laptops for TV watching, true portable TVs have become a rarity. Demand for a small device dedicated to on-the-go TV viewing has dropped significantly since the last century. Meanwhile, fabs and supply chains are built around monitor and TV-sized displays, making it difficult to incorporate some of the most desirable display technologies, like OLED, into smaller-sized panels with competitive prices.

As a result, devices like the MegPad and Amazon’s Echo Show have become the new de facto stand-ins for portable TVs, even though they’re not true TV sets. Even LG’s StanbyME Go, a 27-inch webOS-powered display packed into a briefcase, is a far cry from what most of us would traditionally consider a portable TV.

LG StanByMe Go at a picnic

LG’s StanbyMe GO.

Credit: LG

LG’s StanbyMe GO. Credit: LG

Again, these tablets have more versatility than the small, telescoping-antenna-equipped boxes you used to stick on your kitchen counter or hand to a hyper kid during road trips. But they also require a reliance on Big Tech software and all the privacy and ethical implications that come with that.

From left to right: Casio EV 570, Sony Watchman, and Casio EV 660.

You don’t see many of these anymore. From left to right: Casio EV 570, Sony Watchman, and Casio EV 660.

You don’t see many of these anymore. From left to right: Casio EV 570, Sony Watchman, and Casio EV 660. Credit: Richard Derk/Los Angeles Times via Getty Images

KTC also sees the MegPad’s appeal as a pseudo-TV. The MegPad’s product page emphasizes users’ ability to “watch favorite shows/movies directly—no PC needed” and to “stream Netflix [and] YouTube… more effortlessly on your smart TV.” Its Amazon product page also promotes the keywords “portable TV,” “rolling TV,” “mobile TV,” and “standing TV.” This is all despite the MegPad not technically being a true TV.

“KTC defines the MegPad A32Q7Pro as a portable, smart, touchscreen monitor,” KTC’s spokesperson told me. “It combines key traits of a smart display and a large-screen tablet. While it shares some features with smart TVs, tablets, and monitors, it doesn’t fully belong to any single traditional category. It’s a hybrid device designed to bridge those use cases.”

Android tablets on wheels

Many devices like the MegPad represent a push for more Android-powered, non-Google devices that has been buoyed by a program that Google launched in 2022, the Enterprise Devices Licensing Agreement (EDLA).

As explained by partners like BenQ, EDLA is a way for third parties to incorporate Google Mobile Services (GMS), which are Google’s most commonly used apps and APIs bundled for use across different types of devices. GMS apps include popular software like Google Drive, Gmail, the Google Play Store, and YouTube.

“Previously, GMS was only officially available for smartphones, tablets, TVs, and wearables. Under the new EDLA, the list of devices eligible for GMS certification has now been expanded to include enterprise solutions such as smart boards,” a blog from BenQ, which has EDLA-certified smart displays, reads.

Since 2022, (the year LG’s StanbyME launched), there has been an uptick in non-Google devices with this EDLA certification. One of the categories taking advantage of the newer program is tablets on wheels, like the MegPad and similar options from Kefeya, Apolosign, Innocn, and DuraPro.

Demonstrating the marketing value of EDLA certification, the MegPad’s product page reads: “Google EDLA certification provides secure, direct access to Google services and the Google Play Store with regular updates, offering greater stability and data protection than open app ecosystems with unverified apps.”

Most EDLA-certified devices seem to be interactive displays used for education. With EDLA certification, devices like the MegPad may also draw the attention of educators or even businesses. Meanwhile, Google is happy to hand out EDLA certifications, as they can drive Android adoption, giving Google more data and access to customers outside of the typical Android devices, such as phones. Products like the MegPad can also be easier to shop with (Google loves when people use its offerings to shop) than Android devices with smaller screens.

Who’s this for?

I’ve been fascinated by the MegPad and similar devices because they introduce a unique approach to streaming, web browsing, and productivity. But ultimately, they’re hard to recommend when there are other personal gadgets that are more affordable and often take up less space.

I had fun with the MegPad and appreciated the flexibility it offered, especially in my smaller NYC home. There are some specific use cases where products like this could excel, like if you want to bring a computer or screen into a room that doesn’t always need one. It was also helpful as an entertainment center for my father post-surgery, when he primarily had to lie on one side in bed.

Overall, the growing presence of devices like the MegPad underscores a confluence occurring between smart TVs, tablets, monitors, and smart displays. With software being forced into more types of displays, often in the interest of gathering more user data, it’s an interesting time to consider what you want from your next screen—be it computing power, a certain size, the omission or inclusion of web connectivity, and mobility.

It appears that the MegPad and similar tablets are trying to take advantage of the attention that LG garners when launching distinctive devices like its StanbyME line. Besides a StanbyME lookalike, Apolosign also makes a device similar to the StanbyME Go.

Apolosign's 27

Apolosign’s PackGo is very similar to LG’s StanbyME Go. Credit: Apolosign

Three years after LG made TV-esque devices on wheels a talking point, more brands are trying to roll into the market. That includes LG’s best TV frenemy, Samsung, which has been using the form factor in limited geographies to drive sales of “smart monitors.”

Tech brands have ulterior motives for pushing this newer form factor that go beyond filling a gap in consumer gadgets. But if a large tablet or small smart display with wheels fits your needs, the options are there, and they should meet most expectations.

Photo of Scharon Harding

Scharon is a Senior Technology Reporter at Ars Technica writing news, reviews, and analysis on consumer gadgets and services. She’s been reporting on technology for over 10 years, with bylines at Tom’s Hardware, Channelnomics, and CRN UK.

The curious rise of giant tablets on wheels Read More »

meta’s-“ai-superintelligence”-effort-sounds-just-like-its-failed-“metaverse”

Meta’s “AI superintelligence” effort sounds just like its failed “metaverse”


Zuckerberg and company talked up another supposed tech revolution four short years ago.

Artist’s conception of Mark Zuckerberg looking into our glorious AI-powered future. Credit: Facebook

In a memo to employees earlier this week, Meta CEO Mark Zuckerberg shared a vision for a near-future in which “personal [AI] superintelligence for everyone” forms “the beginning of a new era for humanity.” The newly formed Meta Superintelligence Labs—freshly staffed with multiple high-level acquisitions from OpenAI and other AI companies—will spearhead the development of “our next generation of models to get to the frontier in the next year or so,” Zuckerberg wrote.

Reading that memo, I couldn’t help but think of another “vision for the future” Zuckerberg shared not that long ago. At his 2021 Facebook Connect keynote, Zuckerberg laid out his plan for the metaverse, a virtual place where “you’re gonna be able to do almost anything you can imagine” and which would form the basis of “the next version of the Internet.”

“The future of the Internet” of the recent past.

“The future of the Internet” of the recent past. Credit: Meta

Zuckerberg believed in that vision so much at the time that he abandoned the well-known Facebook corporate brand in favor of the new name “Meta.” “I’m going to keep pushing and giving everything I’ve got to make this happen now,” Zuckerberg said at the time. Less than four years later, Zuckerberg seems to now be “giving everything [he’s] got” for a vision of AI “superintelligence,” reportedly offering pay packages of up to $300 million over four years to attract top talent from other AI companies (Meta has since denied those reports, saying, “The size and structure of these compensation packages have been misrepresented all over the place”).

Once again, Zuckerberg is promising that this new technology will revolutionize our lives and replace the ways we currently socialize and work on the Internet. But the utter failure (so far) of those over-the-top promises for the metaverse has us more than a little skeptical of how impactful Zuckerberg’s vision of “personal superintelligence for everyone” will truly be.

Meta-vision

Looking back at Zuckerberg’s 2021 Facebook Connect keynote shows just how hard the company was selling the promise of the metaverse at the time. Zuckerberg said the metaverse would represent an “even more immersive and embodied Internet” where “everything we do online today—connecting socially, entertainment, games, work—is going to be more natural and vivid.”

Mark Zuckerberg lays out his vision for the metaverse in 2021.

“Teleporting around the metaverse is going to be like clicking a link on the Internet,” Zuckerberg promised, and metaverse users would probably switch between “a photorealistic avatar for work, a stylized one for hanging out, and maybe even a fantasy one for gaming.” This kind of personalization would lead to “hundreds of thousands” of artists being able to make a living selling virtual metaverse goods that could be embedded in virtual or real-world environments.

“Lots of things that are physical today, like screens, will just be able to be holograms in the future,” Zuckerberg promised. “You won’t need a physical TV; it’ll just be a one-dollar hologram from some high school kid halfway across the world… we’ll be able to express ourselves in new joyful, completely immersive ways, and that’s going to unlock a lot of amazing new experiences.”

A pre-rendered concept video showed metaverse users playing poker in a zero-gravity space station with robot avatars, then pausing briefly to appreciate some animated 3D art a friend had encountered on the street. Another video showed a young woman teleporting via metaverse avatar to virtually join a friend attending a live concert in Tokyo, then buying virtual merch from the concert at a metaverse afterparty from the comfort of her home. Yet another showed old men playing chess on a park bench, even though one of the players was sitting across the country.

Meta-failure

Fast forward to 2025, and the current reality of Zuckerberg’s metaverse efforts bears almost no resemblance to anything shown or discussed back in 2021. Even enthusiasts describe Meta’s Horizon Worlds as a “depressing” and “lonely” experience characterized by “completely empty” venues. And Meta engineers anonymously gripe about metaverse tools that even employees actively avoid using and a messy codebase that was treated like “a 3D version of a mobile app. “

screen sharing

Even Meta employees reportedly don’t want to work in Horizon Workrooms.

Even Meta employees reportedly don’t want to work in Horizon Workrooms. Credit: Facebook

The creation of a $50 million creator fund seems to have failed to encourage peeved creators to give the metaverse another chance. Things look a bit better if you expand your view past Meta’s own metaverse sandbox; the chaotic world of VR Chat attracts tens of thousands of daily users on Steam alone, for instance. Still, we’re a far cry from the replacement for the mobile Internet that Zuckerberg once trumpeted.

Then again, it’s possible that we just haven’t given Zuckerberg’s version of the metaverse enough time to develop. Back in 2021, he said that “a lot of this is going to be mainstream” within “the next five or 10 years.” That timeframe gives Meta at least a few more years to develop and release its long-teased, lightweight augmented reality glasses that the company showed off last year in the form of a prototype that reportedly still costs $10,000 per unit.

Zuckerberg shows off prototype AR glasses that could change the way we think about “the metaverse.” Credit: Bloomberg / Contributor | Bloomberg

Maybe those glasses will ignite widespread interest in the metaverse in a way that Meta’s bulky, niche VR goggles have utterly failed to. Regardless, after nearly four years and roughly $60 billion in VR-related losses, Meta thus far has surprisingly little to show for its massive investment in Zuckerberg’s metaverse vision.

Our AI future?

When I hear Zuckerberg talk about the promise of AI these days, it’s hard not to hear echoes of his monumental vision for the metaverse from 2021. If anything, Zuckerberg’s vision of our AI-powered future is even more grandiose than his view of the metaverse.

As with the metaverse, Zuckerberg now sees AI forming a replacement for the current version of the Internet. “Do you think in five years we’re just going to be sitting in our feed and consuming media that’s just video?” Zuckerberg asked rhetorically in an April interview with Drawkesh Patel. “No, it’s going to be interactive,” he continued, envisioning something like Instagram Reels, but “you can talk to it, or interact with it, and it talks back, or it changes what it’s doing. Or you can jump into it like a game and interact with it. That’s all going to be AI.”

Mark Zuckerberg talks about all the ways superhuman AI is going to change our lives in the near future.

As with the Metaverse, Zuckerberg sees AI as revolutionizing the way we interact with each other. He envisions “always-on video chats with the AI” incorporating expressions and body language borrowed from the company’s work on the metaverse. And our relationships with AI models are “just going to get more intense as these AIs become more unique, more personable, more intelligent, more spontaneous, more funny, and so forth,” Zuckerberg said. “As the personalization loop kicks in and the AI starts to get to know you better and better, that will just be really compelling.”

Zuckerberg did allow that relationships with AI would “probably not” replace in-person connections, because there are “things that are better about physical connections when you can have them.” At the same time, he said, for the average American who has three friends, AI relationships can fill the “demand” for “something like 15 friends” without the effort of real-world socializing. “People just don’t have as much connection as they want,” Zuckerberg said. “They feel more alone a lot of the time than they would like.”

A toy robot saying

Why chat with real friends on Facebook when you can chat with AI avatars?

Credit: Benj Edwards / Getty Images

Why chat with real friends on Facebook when you can chat with AI avatars? Credit: Benj Edwards / Getty Images

Zuckerberg also sees AI leading to a flourishing of human productivity and creativity in a way even his wildest metaverse imaginings couldn’t match. Zuckerberg said that AI advancement could “lead toward a world of abundance where everyone has these superhuman tools to create whatever they want.” That means personal access to “a super powerful [virtual] software engineer” and AIs that are “solving diseases, advancing science, developing new technology that makes our lives better.”

That will also mean that some companies will be able to get by with fewer employees before too long, Zuckerberg said. In customer service, for instance, “as AI gets better, you’re going to get to a place where AI can handle a bunch of people’s issues,” he said. “Not all of them—maybe 10 years from now it can handle all of them—but thinking about a three- to five-year time horizon, it will be able to handle a bunch.“

In the longer term, Zuckerberg said, AIs will be integrated into our more casual pursuits as well. “If everyone has these superhuman tools to create a ton of different stuff, you’re going to get incredible diversity,” and “the amount of creativity that’s going to be unlocked is going to be massive,” he said. “I would guess the world is going to get a lot funnier, weirder, and quirkier, the way that memes on the Internet have gotten over the last 10 years.”

Compare and contrast

To be sure, there are some important differences between the past promise of the metaverse and the current promise of AI technology. Zuckerberg claims that a billion people use Meta’s AI products monthly, for instance, utterly dwarfing the highest estimates for regular use of “the metaverse” or augmented reality as a whole (even if many AI users seem to balk at paying for regular use of AI tools). Meta coders are also reportedly already using AI coding tools regularly in a way they never did with Meta’s metaverse tools. And people are already developing what they consider meaningful relationships with AI personas, whether that’s in the form of therapists or romantic partners.

Still, there are reasons to be skeptical about the future of AI when current models still routinely hallucinate basic facts, show fundamental issues when attempting reasoning, and struggle with basic tasks like beating a children’s video game. The path from where we are to a supposed “superhuman” AI is not simple or inevitable, despite the handwaving of industry boosters like Zuckerberg.

Artist’s conception of Carmack’s VR avatar waving goodbye to Meta.

Artist’s conception of Carmack’s VR avatar waving goodbye to Meta.

At the 2021 rollout of Meta’s push to develop a metaverse, high-ranking Meta executives like John Carmack were at least up front about the technical and product-development barriers that could get in the way of Zuckerberg’s vision. “Everybody that wants to work on the metaverse talks about the limitless possibilities of it,” Carmack said at the time (before departing the company in late 2022). “But it’s not limitless. It is a challenge to fit things in, but you can make smarter decisions about exactly what is important and then really optimize the heck out of things.”

Today, those kinds of voices of internal skepticism seem in short supply as Meta sets itself up to push AI in the same way it once backed the metaverse. Don’t be surprised, though, if today’s promise that we’re at “the beginning of a new era for humanity” ages about as well as Meta’s former promises about a metaverse where “you’re gonna be able to do almost anything you can imagine.”

Photo of Kyle Orland

Kyle Orland has been the Senior Gaming Editor at Ars Technica since 2012, writing primarily about the business, tech, and culture behind video games. He has journalism and computer science degrees from University of Maryland. He once wrote a whole book about Minesweeper.

Meta’s “AI superintelligence” effort sounds just like its failed “metaverse” Read More »

tiktok-is-being-flooded-with-racist-ai-videos-generated-by-google’s-veo-3

TikTok is being flooded with racist AI videos generated by Google’s Veo 3

The release of Google’s Veo 3 video generator in May represented a disconcerting leap in AI video quality. While many of the viral AI videos we’ve seen are harmless fun, the model’s pixel-perfect output can also be used for nefarious purposes. On TikTok, which may or may not be banned in the coming months, users have noticed a surplus of racist AI videos, courtesy of Google’s Veo 3.

According to a report from MediaMatters, numerous TikTok accounts have started posting AI-generated videos that use racist and antisemitic tropes in recent weeks. Most of the AI vitriol is aimed at Black people, depicting them as “the usual suspects” in crimes, absent parents, and monkeys with an affinity for watermelon. The content also targets immigrants and Jewish people. The videos top out at eight seconds and bear the “Veo” watermark, confirming they came from Google’s leading AI model.

The compilation video below has examples pulled from TikTok since the release of Veo 3, but be warned, it contains racist and antisemitic content. Some of the videos are shocking, which is likely the point—nothing drives engagement on social media like anger and drama. MediaMatters reports that the original posts have numerous comments echoing the stereotypes used in the video.

Hateful AI videos generated by Veo 3 spreading on TikTok.

Google has stressed security when announcing new AI models—we’ve all seen an AI refuse to complete a task that runs afoul of its guardrails. And it’s never fun when you have genuinely harmless intentions, but the system throws a false positive and blocks your output. Google has mostly struck the right balance previously, but it appears that Veo 3 is more compliant. We’ve tested a few simple prompts with Veo 3 and found it easy to reproduce elements of these videos.

Clear but unenforced policies

TikTok’s terms of service ban this kind of content. “We do not allow any hate speech, hateful behavior, or promotion of hateful ideologies. This includes explicit or implicit content that attacks a protected group,” the community guidelines read. Despite this blanket ban on racist caricatures, the hateful Veo 3 videos appear to be spreading unchecked.

TikTok is being flooded with racist AI videos generated by Google’s Veo 3 Read More »

everything-that-could-go-wrong-with-x’s-new-ai-written-community-notes

Everything that could go wrong with X’s new AI-written community notes


X says AI can supercharge community notes, but that comes with obvious risks.

Elon Musk’s X arguably revolutionized social media fact-checking by rolling out “community notes,” which created a system to crowdsource diverse views on whether certain X posts were trustworthy or not.

But now, the platform plans to allow AI to write community notes, and that could potentially ruin whatever trust X users had in the fact-checking system—which X has fully acknowledged.

In a research paper, X described the initiative as an “upgrade” while explaining everything that could possibly go wrong with AI-written community notes.

In an ideal world, X described AI agents that speed up and increase the number of community notes added to incorrect posts, ramping up fact-checking efforts platform-wide. Each AI-written note will be rated by a human reviewer, providing feedback that makes the AI agent better at writing notes the longer this feedback loop cycles. As the AI agents get better at writing notes, that leaves human reviewers to focus on more nuanced fact-checking that AI cannot quickly address, such as posts requiring niche expertise or social awareness. Together, the human and AI reviewers, if all goes well, could transform not just X’s fact-checking, X’s paper suggested, but also potentially provide “a blueprint for a new form of human-AI collaboration in the production of public knowledge.”

Among key questions that remain, however, is a big one: X isn’t sure if AI-written notes will be as accurate as notes written by humans. Complicating that further, it seems likely that AI agents could generate “persuasive but inaccurate notes,” which human raters might rate as helpful since AI is “exceptionally skilled at crafting persuasive, emotionally resonant, and seemingly neutral notes.” That could disrupt the feedback loop, watering down community notes and making the whole system less trustworthy over time, X’s research paper warned.

“If rated helpfulness isn’t perfectly correlated with accuracy, then highly polished but misleading notes could be more likely to pass the approval threshold,” the paper said. “This risk could grow as LLMs advance; they could not only write persuasively but also more easily research and construct a seemingly robust body of evidence for nearly any claim, regardless of its veracity, making it even harder for human raters to spot deception or errors.”

X is already facing criticism over its AI plans. On Tuesday, former United Kingdom technology minister, Damian Collins, accused X of building a system that could allow “the industrial manipulation of what people see and decide to trust” on a platform with more than 600 million users, The Guardian reported.

Collins claimed that AI notes risked increasing the promotion of “lies and conspiracy theories” on X, and he wasn’t the only expert sounding alarms. Samuel Stockwell, a research associate at the Centre for Emerging Technology and Security at the Alan Turing Institute, told The Guardian that X’s success largely depends on “the quality of safeguards X puts in place against the risk that these AI ‘note writers’ could hallucinate and amplify misinformation in their outputs.”

“AI chatbots often struggle with nuance and context but are good at confidently providing answers that sound persuasive even when untrue,” Stockwell said. “That could be a dangerous combination if not effectively addressed by the platform.”

Also complicating things: anyone can create an AI agent using any technology to write community notes, X’s Community Notes account explained. That means that some AI agents may be more biased or defective than others.

If this dystopian version of events occurs, X predicts that human writers may get sick of writing notes, threatening the diversity of viewpoints that made community notes so trustworthy to begin with.

And for any human writers and reviewers who stick around, it’s possible that the sheer volume of AI-written notes may overload them. Andy Dudfield, the head of AI at a UK fact-checking organization called Full Fact, told The Guardian that X risks “increasing the already significant burden on human reviewers to check even more draft notes, opening the door to a worrying and plausible situation in which notes could be drafted, reviewed, and published entirely by AI without the careful consideration that human input provides.”

X is planning more research to ensure the “human rating capacity can sufficiently scale,” but if it cannot solve this riddle, it knows “the impact of the most genuinely critical notes” risks being diluted.

One possible solution to this “bottleneck,” researchers noted, would be to remove the human review process and apply AI-written notes in “similar contexts” that human raters have previously approved. But the biggest potential downfall there is obvious.

“Automatically matching notes to posts that people do not think need them could significantly undermine trust in the system,” X’s paper acknowledged.

Ultimately, AI note writers on X may be deemed an “erroneous” tool, researchers admitted, but they’re going ahead with testing to find out.

AI-written notes will start posting this month

All AI-written community notes “will be clearly marked for users,” X’s Community Notes account said. The first AI notes will only appear on posts where people have requested a note, the account said, but eventually AI note writers could be allowed to select posts for fact-checking.

More will be revealed when AI-written notes start appearing on X later this month, but in the meantime, X users can start testing AI note writers today and soon be considered for admission in the initial cohort of AI agents. (If any Ars readers end up testing out an AI note writer, this Ars writer would be curious to learn more about your experience.)

For its research, X collaborated with post-graduate students, research affiliates, and professors investigating topics like human trust in AI, fine-tuning AI, and AI safety at Harvard University, the Massachusetts Institute of Technology, Stanford University, and the University of Washington.

Researchers agreed that “under certain circumstances,” AI agents can “produce notes that are of similar quality to human-written notes—at a fraction of the time and effort.” They suggested that more research is needed to overcome flagged risks to reap the benefits of what could be “a transformative opportunity” that “offers promise of dramatically increased scale and speed” of fact-checking on X.

If AI note writers “generate initial drafts that represent a wider range of perspectives than a single human writer typically could, the quality of community deliberation is improved from the start,” the paper said.

Future of AI notes

Researchers imagine that once X’s testing is completed, AI note writers could not just aid in researching problematic posts flagged by human users, but also one day select posts predicted to go viral and stop misinformation from spreading faster than human reviewers could.

Additional perks from this automated system, they suggested, would include X note raters quickly accessing more thorough research and evidence synthesis, as well as clearer note composition, which could speed up the rating process.

And perhaps one day, AI agents could even learn to predict rating scores to speed things up even more, researchers speculated. However, more research would be needed to ensure that wouldn’t homogenize community notes, buffing them out to the point that no one reads them.

Perhaps the most Musk-ian of ideas proposed in the paper, is a notion of training AI note writers with clashing views to “adversarially debate the merits of a note.” Supposedly, that “could help instantly surface potential flaws, hidden biases, or fabricated evidence, empowering the human rater to make a more informed judgment.”

“Instead of starting from scratch, the rater now plays the role of an adjudicator—evaluating a structured clash of arguments,” the paper said.

While X may be moving to reduce the workload for X users writing community notes, it’s clear that AI could never replace humans, researchers said. Those humans are necessary for more than just rubber-stamping AI-written notes.

Human notes that are “written from scratch” are valuable to train the AI agents and some raters’ niche expertise cannot easily be replicated, the paper said. And perhaps most obviously, humans “are uniquely positioned to identify deficits or biases” and therefore more likely to be compelled to write notes “on topics the automated writers overlook,” such as spam or scams.

Photo of Ashley Belanger

Ashley is a senior policy reporter for Ars Technica, dedicated to tracking social impacts of emerging policies and new technologies. She is a Chicago-based journalist with 20 years of experience.

Everything that could go wrong with X’s new AI-written community notes Read More »

meta,-tiktok-can’t-toss-wrongful-death-suit-from-mom-of-“subway-surfing”-teen

Meta, TikTok can’t toss wrongful death suit from mom of “subway surfing” teen

Section 230 has so far failed to shield Meta and TikTok owner ByteDance from a lawsuit raised by a mother who alleged that her son’s wrongful death followed a flood of “subway surfing” videos platforms intentionally targeted to teens in New York.

In a decision Monday, New York State Supreme Court Judge Paul Goetz largely denied social media companies’ motions to dismiss claims they argued should be barred under Section 230 and the First Amendment. Goetz said that the mother, Norma Nazario, had adequately alleged that subway surfing content “was purposefully fed” to her son Zackery “because of his age” and “not because of any user inputs that indicated he was interested in seeing such content.”

Unlike other Section 230 cases in which platforms’ algorithms were determined to be content-neutral, Goetz wrote that in this case, “it is plausible that the social media defendants’ role exceeded that of neutral assistance in promoting content and constituted active identification of users who would be most impacted by the content.”

Platforms may be forced to demystify algorithms

Moving forward, Nazario will have a chance to seek discovery that could show exactly how Zackery came to interact with the subway surfing content. In her complaint, she did not ask for the removal of all subway surfing content but rather wants to see platforms held accountable for allegedly dangerous design choices that supposedly target unwitting teens.

“Social media defendants should not be permitted to actively target young users of its applications with dangerous ‘challenges’ before the user gives any indication that they are specifically interested in such content and without warning,” Nazario has argued.

And if she’s proven right, that means platforms won’t be forced to censor any content but must instead update algorithms to stop sending “dangerous” challenges to keep teens engaged at a time when they’re more likely to make reckless decisions, Goetz suggested.

Meta, TikTok can’t toss wrongful death suit from mom of “subway surfing” teen Read More »

trump’s-tariff-threat-pushes-canada-to-scrap-digital-services-tax

Trump’s tariff threat pushes Canada to scrap digital services tax

In a sudden reversal, Canada has caved and will remove its digital services tax after trade talks with the US suddenly fell apart this weekend.

Blocked just hours before taking effect, the controversial digital services tax (DST) would have charged big US tech companies like Apple, Google, and Meta a 3 percent tax on all digital services revenue earned from Canadian users. Frustrating US tech giants, Canada also sought to collect retroactive taxes dating back to 2022.

Over the weekend, President Donald Trump claimed the tax was a “direct and blatant attack” on US tech companies and terminated the trade talks, while threatening to impose a new tariff rate on Canadian goods by July 4.

On Sunday, Canadian Prime Minister Mark Carney seemingly bowed to Trump’s pressure campaign, abruptly doing an “about turn” after previously refusing to pause the DST despite Trump’s opposition, NBC News reported.

But it wasn’t just Trump pushing Carney to reconsider the tax. A nonprofit representing CEOs and leaders of some of Canada’s biggest businesses, the Business Council of Canada, had warned that Carney defending the tax risked “undermining Canada’s economic relationship with its most important trading partner,” Al Jazeera reported.

If Trump were to impose new tariffs on Canada, it could have “large ripple effects across both economies,” the Council warned, potentially disrupting markets for automobiles, minerals, energy, and aluminum. And Trump—who has been bashing Canada with annexation threats throughout trade talks—had also threatened a Section 301 investigation into impacts of the DST on the US economy, which meant other punitive measures could be coming if the DST wasn’t removed. To Canada’s business leaders, the costs of defending the DST were seemingly becoming too high.

Trump’s tariff threat pushes Canada to scrap digital services tax Read More »

scotus-upholds-part-of-aca-that-makes-preventive-care-fully-covered

SCOTUS upholds part of ACA that makes preventive care fully covered

The USPSTF is made up of 16 medical experts who carefully review scientific data and run models to assess what preventive health interventions are best, using a framework of recommendation gradings from A to F. Any recommendations graded A or B by the task force are required by the ACA to be covered by health plans at no cost to patients.

The US health department argued that the task force members are, in fact, appointed, and under control of the health secretary, a role currently filled by anti-vaccine advocate Robert F. Kennedy Jr.

Two lower courts in Texas sided with the Christian group, saying that the government violated the appointments clause.

But today, in a 6–3 ruling, the Supreme Court disagreed. Chief Justice John Roberts and Justices Amy Coney Barrett, Brett Kavanaugh, Elena Kagan, Ketanji Brown Jackson, and Sonia Sotomayor made up the majority.

Writing on their behalf, Kavanaugh explained: “Task Force members are supervised and directed by the Secretary, who in turn answers to the President, preserving the chain of command in Article II.”

While the ruling means that coverage of preventive health care is no longer under threat, the ruling clarifies that the health secretary has direct authority over the USPSTF. The clarification raises concern that the current secretary, Kennedy, could remove task force members and/or undo recommendations to suit his personal ideology, as he is now doing with the vaccine advisory board at the Centers for Disease Control and Prevention.

SCOTUS upholds part of ACA that makes preventive care fully covered Read More »

childhood-and-education-#11:-the-art-of-learning

Childhood and Education #11: The Art of Learning

In honor of the latest (always deeply, deeply unpopular) attempts to destroy tracking and gifted and talented programs, and other attempts to get children to actually learn things, I thought it a good time to compile a number of related items.

  1. Lack Of Tracking Hurts Actual Everyone.

  2. Not Tracking Especially Hurts Those Who Are Struggling.

  3. No Child Left Behind Left Behind.

  4. Read Early, Read Often.

  5. Mirror, Mirror.

  6. Spaced Repetition.

  7. Learning Methods.

  8. Interruptions.

  9. Memorization.

  10. Math is Hard.

  11. Get to Work.

  12. The Whiz Kids.

  13. High School Does Not Seem To Teach Kids Much.

  14. Two Kinds of Essays.

Gifted programs and educational tracking are also super duper popular, it is remarkably absurd that our political process cannot prevent these programs from being destroyed.

As in things like this keep happening:

NY Post: Seattle Public Schools shuts down gifted and talented program for being oversaturated with white and asian students.

Once again, now, we face that threat in my home of New York City. The Democratic nominee for New York City Mayor is opposed to gifted and talented programs, and wants to destroy them. Yet few people seem to have much noticed, or decided to much care. Once people realize the danger it may well be too late.

To state the obvious, if you group children by ability in each subject rather than age, they learn better. Yes, there are logistical concerns, but the benefits are immense. Gifted programs are great but mostly seem like a patch to the fact that we are so obsessed with everyone in the room being in the same ‘grade’ at all times.

I agree with Tracing Woods that ‘teaches each according to their ability’ is the bare minimum before I will believe that your institution is making a real attempt to educate children.

Tracing Woods: A good example of the absurdity of “grade-level” thinking, from Texas: “If they’re reading on a second-grade level, but they’re in the third grade, they’re always going to receive that third-grade instruction.”

This makes no sense. Learning does not simply follow age.

Imagine having a “grade level” in chess. 9-year-olds in the third grade advancing to play against 1100 elo players. 100 more elo per year.

“Grade-level performance” has always been nonsensical. Learning does not work that way. Just figure out what people actually know and need

Danielle Fong: sooner or later, and probably sooner, all this will be thrown out in favor of an adaptive learning environment, the human teachers and other students can give individual attention when you’re stuck, and you can maxx out learning like a game. already happening privately. +2sd

Eowyn Jackman: I know I’m supposed to move on but I don’t think I can ever forgive the US public primary education complex for testing me and saying he “has a college reading level” at 9 years old but not administering an exam for me to be in the “gifted/advanced” classes at my PWI until two years later when I wrote my first book. So much time wasted.

Would’ve loved to have an office, secretarial job before 15 tbch

Wow I need therapy 😅

James Miller: Imagine math classes grouped by ability, not age. You’d have classes with 7- and 17-year-olds together.

Tracing Woods: And that’s a good thing (No, but seriously, at that point put them in different schools)

I bite the bullet. I do think it’s fine and actively good to have 7-year-olds and 17-year-olds in the same math classroom.

Of course, if you think that learning is bad, you won’t like this plan to have kids learn.

Owen Cyclops: this conception of early childhood education poses a generally unasked question to our present educational paradigm, which is: what could be the potential downside of learning things too quickly? from my perspective, this is basically never asked. its a total 100% blind spot.

[this is part of a long thread complaining about how awful it is if kids were to learn things too quickly, before they are supposed to, because ‘stages of development’]

Thomas: The main downside brought up re: “learning too quickly” is being ahead of peers. There’s parents’ posts online sharing experiences of being told not to read to their kid at home, or not teach them new math concepts.

Divia: Fwiw I hear people talk about the downside of learning too fast constantly! And have since I was a kid. Mostly whenever anyone wanted academics to be faster than it was convenient for someone they would talk about the downsides IME.

The ‘learning too much too soon is bad’ paradigm seems categorically insane. Here’s the concrete example Owen gives:

Owen: i’m in the forest (this actually happened). a kid asks: why is this log making a sound when i hit it with a stick? and this adult says, “well, the molecules in the log vibrate when you hit it, because you hitting it transfers energy into the log. you’re hearing the vibrations.”

the kid is NOT at that developmental stage (in my opinion). even if they can understand this (may be impossible, they might just be repeating what you’re saying), that type of understanding comes later. that is not the developmental stage a five year old is at. not their world.

I mean, maybe they have enough physics that this is the right explanation. Maybe they don’t, and it would be modestly better to simplify it a bit. But then they can ask.

This seems fine, and opening the doors to ‘what’s a molecule?’ or other similar questions seems great. They ask, you tell them. The actual objection here is ‘you need to explain [A] then [B] then [C]’ and I think that is usually vastly overstated but okay, sure, that doesn’t tell you what age you should tell people [C].

At some point, if you’re far enough ahead or behind the class, the class is worthless to you. A remarkable number of students hit this threshold, or would hit it if they weren’t being sabotaged to not hit it.

There is a point of diminishing returns for sufficiently young students, where you start to outstrip your ability to efficiently process the information and learn the material at your current age, but many students are very far away from that limit. As of course they are, if they’re forced to proceed at the speed of the typical subpar ship, which shall we say is not close to even that ship’s maximum speed.

Tracing Woodgrains: The most refreshing and novel thing to me about the Alpha/GT School model is the four hours of extracurricular workshops.

In sixth grade, I did online school. Completed the entire seventh and eighth grade curriculum in two hours a day that year. And then with all my extra time, I played video games.

Much healthier to have an institution that recognizes what can really be done with all of that time.

Patrick McKenzie: While this makes me feel *extremelyold, my recollection of 4th-8th grade was a solid 1.5-2 years of instruction and 3 years of being physically present while classroom management was conducted.

If you object to being physically present for classroom management you will become one of the focuses of classroom management, so I spent a lot of time counting ceiling tiles, drawing cubes, and writing out the solutions for all possible games of 24.

Ryan Moulton: I think you can get 80% of the benefit of ~whatever gifted/tracking/etc. program in gradeschool by having any mechanism at all to let self directed kids be self directed.

“You already know this, so just go do Khan Academy for an hour during math time” is an essentially zero cost intervention.

“If you finish your grammar/spelling/vocab sheet early, you are allowed to read your library book” is too.

Teachers don’t reliably do this, because letting kids do different things sometimes creates conflict in class. If they did reliably do this then most of the debate about how to organize gradeschool among kids of different academic ability levels goes away.

When you stop holding kids back, you instead get things like in this thread: The 11 year old boy in Organic Chemistry that goes on to be a researcher, the girl graduating college at that same age. Which is proof by counterexample that all this ‘not ready’ nonsense is indeed nonsense.

An important aspect of denying the reality of different learning abilities is that it is absurdly cruel to those you are gaslighting – they are told that they are just as smart and good at learning as everyone else, so what are they to think about their failure to get those same results?

Anonymous: We have this myth of a ‘fast learner’ but research suggests people actually learn at similar rates. A ‘fast learner’ is really just someone who’s been exposed to this problem/material before, maybe multiple times. People seeing something for the first time will struggle.

Eigenrobot: there’s something cruel about asserting that people dont differ in ability like. its cruel to directly rub it in someones face, but its also cruel to proactively lie to people when, be real, they either know youre lying or will end up damaged by believing you.

Some may say “maybe but acknowledging difference in ability is bad because the moral value is a function of their ability and we don’t want a norm of believing that some people are worthless” and to them i say find god.

What happened with No Child Left Behind? One of its architects explains that they knew you can’t actually leave no child behind. That’s impossible. You obviously have to leave some children behind. They were declaring a goal of none with the plan to modifying it to some. Then they met Congress, which has become unable to do reasonable things, so instead of the usual ‘quietly change the rules and declare victory’ the insane requirements stayed on the books and everything went to hell trying to work around them. Whoops.

The good news is that now everyone knows that Congress is unable to fix broken things, so we can correctly plan that laws will stay on the books indefinitely exactly as written no matter how broken they get, and write them accordingly. I endorse both the fully serious and sarcastic versions of that sentence.

Reading is a godsend. The earlier your kids can read, the better, on so many levels. One that is vastly underestimated is this makes parenting vastly easier, you’ve unlocked unlimited cheap, healthy and non-disruptive education and entertainment. It unlocks all the things.

Erik Hoel here says that with a year of dedicated effort, he got his 3-year-old to read at 9-year-old levels, and offers a guide: Part 1, Part 2, and Part 3. The core is simple, phonics plus spaced repetition plus books as the center of entertainment. In my experience, you need to push to the critical point where they can largely teach themselves via more reading plus a little adult help (and AI voice help?) and then you are home free.

Eric seems to have focused entirely on reading first and ignored other subjects, including math, while he was doing that. This actually seems exactly right to me, if you can (as he did) get your child to buy in, because reading unlocks all the things and builds on itself. Get to that critical point first, worry about everything else (that isn’t a short term practical need) later.

A woman virtuously notices she is confused about why mirrors work the way they do and asks, essentially, ‘mirrors how do they even work,’ as opposed to most other people who have no idea how mirrors work the way they do. That’s great, thumbs up.

Then this gets quoted with ‘this is why we need the department of education,’ but no, actually we had a department of education and not only does no one know no one asks the question. This is why you don’t need the department of education. You need an LLM, and you need to teach people to be curious and give them actual education.

We all know it works. So why don’t schools use it?

Eric Hoel: “The power of spaced repetition has been known for 150 years. It replicates and has large effects. So why is spaced repetition (or even its more implementable form of spiral learning) not used all the time in classrooms? No one knows!”

Spaced repetition works so well that it ends up causing me to memorize a lot of spoilers that I actively don’t want to remember. As in, I’ll keep trying not to think of the pink elephant, to remind myself to forget, at increasingly long intervals, which cause me to remember, and whoops. Damn it. One could use this power for good.

Paper suggests new way to teach economics nonlinearly, supposedly so it will line up with how people learn. I think this is essentially ‘include spaced repetition in your lecture plan.’ Which is one of those obviously good ideas that no one implements.

bosco: 4yo is probably ready to memorize her address, and at least one phone number, in case of emergencies, but she is completely uninterested when I try to teach her the spiel any tips?

Niels Hoven: If you’re having trouble getting your kid to remember your phone number, make it the password to your iPad’s lock screen and watch how quickly they memorize it

Kelsey Piper: We’ve been encouraging the eight year old to pick up some history by saying to one another in front of her “oh, the switch passcode is the year of the Marco Polo bridge incident” or “yeah I changed it to the year Constantinople was sacked in the Fourth Crusade.”

It seems like something you could study and measure, but it seems no one has?

Pamela Hobart: why do so few people have any real insight into just how *verydisruptive interruptions are?

today’s painful reminder via @PepsMccrea

I’m not even sure ‘learning time lost’ is the right measure, as time is not created equal, and there are different types of learning, some of which are far more disruptable than others, so this can shift learning composition, likely in ways we do not want.

The problem, identified.

Ben Hsieh: Parents will likely say, “Drills and rote memorization? That pales in comparison to my strategy, instilling a lifelong love of learning,” and then not instill a lifelong love of learning.

Drills and rote memorization are not ideal, but they work. If you can do better, great, but way too many parents and schools think they can do better and are wrong.

Autumn: if as a society we value math-based disciplines so much, can i ask why we teach calculus in a way thats optimized to weed out students who wont comply with hazing. why do we design them to teach horrible habits for later studies in math?

Sarah Constantin: this is a pet peeve of mine.

there are people who want to make classes easier & less advanced (e.g. not teaching calculus in high school) and there are people who want to scout for exceptional talent but there’s very few people pushing for actually teaching the material well!

“let’s try to make sure everyone in this calculus class learns calculus” is a very lonely mission, even though IMO it’s common sense.

In general we have the problem of teaching math in ways that make many students hate math. I don’t think this is especially a problem in calculus, at least the way I learned it (in a high school class)? Autumn suggests the college method is somehow worse.

Obviously ‘won’t comply with hazing’ is a terrible reason to drive someone away, calculus is vital to understanding the world (for intuition and general understanding, not for actually Doing Calculus, I actively do a happy dance every time I get do Do Calculus which is very rare) and at minimum we want everyone who takes such a class to learn calculus.

However, in terms of the math-based disciplines, as long as we are gating on actual ability I think weeding people out here is in principle fine? In the sense that if you design a good filter, you’re doing a favor for those you filter out, except that now they don’t get to understand calculus.

I strongly agree with this. It is very obvious interacting with kids that they yearn for meaningful work in the ordinary sense of work

Ozy Brennan: my pet parenting theories:

  1. Children yearn for meaningful work; child labor is bad but we’ve overcorrected.

  2. Whenever possible, children should be brought along to adult activities instead of adults going to child activities.

tips for bringing kids along to adult activities: bring a Kindle or (in extreme situations) a laptop. explain behavioral norms ahead of time. allow independence. have friends who like kids. prioritize activities you and kids both like (museums, movies, parks, whatever).

Putting children to pointless industrial work is not a good idea. But if they can understand why what they are doing is useful, yeah, they really dig it, and it seems obviously great for them and also great for you.

For trips, you have to calibrate to your particular kids, but yes, absolutely, especially once the kids cross the ‘can be entertained by a book’ threshold.

Patrick Collison: In which domains are elite practitioners celebrating the kids being better than ever before? Would love to read about a few instances. (Not just where there’s one particular genius, such as Ashwin Sah’s recent success, but where “the kids” as some kind of aggregate appear to be improving.)

Michael Nielsen: It was perhaps in my third week of linear algebra (MP 174!) that the professor told me that incoming students were noticeably worse at math than they used to be There are a very large number of potential confounders here. The point, of course: that was ~30 years ago. “Back when I was a boy…” nostalgia seems to be time-invariant.

I’ll state my prejudice, backed by a wide smattering of anecdotes, and not much else: the top 0.1% today are vastly more competent across a much wider range of subjects than 20 years ago. And that same statement was also true 20 years ago. And 40 years ago. And 60 years ago…

(I don’t mean that a given individual is more competent in every domain. But the ceiling per-domain will be higher, and the range of domains much broader.)

For comparison, the 1939 Putnam, where (IIRC) Feynman was Putnam Fellow. And the 2023 Putnam. I don’t know about you, but I’d rather take the first.

Charles: I think the answer to this is “all of them” or close to it. The very best teenagers at almost every endeavor are better than they’ve ever been.

It’s below the 90th percentile (maybe even higher) where it’s a different story.

Is are children learning? In high school math, the answer seems to be no. The 50th percentile student gets a 230 in 8th grade on this test, and then a 234 in 12th (a second source said 232→237, but that’s the same thing), and we know from the higher percentiles that the test is not being saturated here. Note that reading scores only increased 5 points, from 224→229.

The obvious question is, if students are learning this little, why are we wasting their childhoods in school at all? It seems like there is no point. One would think that at least this much improvement would happen via osmosis and practical learning.

Paul Graham: 13 yo asked me to teach him how to write essays. I asked if he wanted to learn how to write real essays, or the kind you have to write in school. He said the school kind, because he’s writing one for school.

I don’t mind if my kids have to learn math that’s not real math or writing that’s not real writing or science that’s just words, but at the same time as I teach them these things I always try to give them an idea of how they’re fake, and what the real version is like.

At least 13 yo won’t spend years puzzling over how the “conclusion” is supposed to be different from the “introduction” even though they’re saying the same things. I told him upfront it’s just an artifact of this fake format.

RashLabs: How does one write a good essay for school?

Paul Graham: You just write a good essay. But your teachers may freak out if you do.

Patrick McKenzie: Good essays disrupt the production function of teachers w/r/t essay grading/correction. Some teachers give some students a bit of leeway to take more of their cycles than generally required, some of the time.

I once wrote an essay about how the hamburger essay format (buns on top and bottom, three layers in middle) was artificial, limiting, and unengaging, and got a talking to which was… at least minimally helpful.

There are some skills, especially more basic things like spelling and grammar, where the two types of good line up. Being able to write a good School Essay does give you a leg up on a Real Essay, if you don’t get trapped in the arbitrary parts of the format. But past a certain point, they are very different skills.

Paul is nailing the key point. Which is, if a child must write a School Essay, it is vital not to gaslight them about what they are writing, or pretend it has much to do with a Real Essay. Never pretend the fake thing is not fake.

Discussion about this post

Childhood and Education #11: The Art of Learning Read More »

nasa-tested-a-new-sls-booster-that-may-never-fly,-and-the-end-of-it-blew-off

NASA tested a new SLS booster that may never fly, and the end of it blew off


NASA didn’t want to say much about one of the tests, and the other one lost its nozzle.

An uncontained plume of exhaust appeared near the nozzle of an SLS solid rocket booster moments before its nozzle was destroyed during a test-firing Thursday. Credit: NASA

NASA’s Space Launch System appears to have a finite shelf life. The Trump administration wants to cancel it after just three launches, while the preliminary text of a bill making its way through Congress would extend it to five flights.

But chances are low the Space Launch System will make it to nine flights, and if it does, it’s questionable that it would reach that point before 2040. The SLS rocket is a core piece of NASA’s plan to return US astronauts to the Moon under the Artemis program, but the White House seeks to cancel the program in favor of cheaper commercial alternatives.

For the second time in less than a week, NASA test-fired new propulsion hardware Thursday that the agency would need to keep SLS alive. Last Friday, a new liquid-fueled RS-25 engine ignited on a test stand at NASA’s Stennis Space Center in Mississippi. The hydrogen-fueled engine is the first of its kind to be manufactured since the end of the Space Shuttle program. This particular RS-25 engine is assigned to power the fifth flight of the SLS rocket, a mission known as Artemis V.

Then, on Thursday of this week, NASA and Northrop Grumman test-fired a new solid rocket booster in Utah. This booster features a new design that NASA would use to power SLS rockets beginning with the ninth mission, or Artemis IX. The motor tested on Thursday isn’t flight-worthy. It’s a test unit that engineers will use to gather data on the rocket’s performance.

While the engine test in Mississippi apparently went according to plan, the ground firing of the new solid rocket booster didn’t go quite as smoothly. Less than two minutes into the burn, the motor’s exhaust nozzle violently shattered into countless shards of debris. You can watch the moment in the YouTube video below.

At the start of the program nearly 15 years ago, NASA and its backers in Congress pitched the SLS rocket as the powerhouse behind a new era of deep space exploration. The Space Launch System, they said, would have the advantage of recycling old space shuttle engines and boosters, fast-tracking the new rocket’s path to the launch pad for less money than the cost of an all-new vehicle.

That didn’t pan out. Each Artemis mission costs $4.2 billion per flight, and that’s with shuttle-era engines and boosters that NASA and its contractors already have in their inventories. NASA’s 16 leftover shuttle main engines are enough for the first four SLS flights. NASA has leftover parts for eight pairs of solid rocket boosters.

It has been 10 years

Recognizing that shuttle-era parts will eventually run out, NASA signed a contract with Aerojet Rocketdyne to set the stage for the production of new RS-25 engines in 2015. NASA later ordered an initial batch of six RS-25 engines from Aerojet, then added 18 more to the order in 2020, at a price of about $100 million per engine. NASA and its contractor aim to reduce the cost to $70 million per engine, but even that figure is many times the cost of engines of comparable size and power: Blue Origin’s BE-4 and SpaceX’s Raptor.

Finally, NASA test-fired a new flight-rated RS-25 engine for the first time last week at Stennis Space Center. The agency has often provided a livestream of its engine tests at Stennis, but it didn’t offer the public any live video. And this particular test was a pretty big deal. L3Harris, which acquired Aerojet Rocketdyne in 2023, has finally reactivated the RS-25 production line after a decade and billions of dollars of funding.

In fact, NASA made no public statement about the RS-25 test until Monday, and the agency didn’t mention its assignment to fly on the Artemis V mission. If the Trump administration gets its way, the engine will never fly. Maybe that’s fine, but after so long with so much taxpayer investment, this is a milestone worth publicizing, if not celebrating.

L3Harris issued a press release Tuesday confirming the engine’s planned use on the fifth SLS mission. The engine completed a 500-second acceptance test, throttling up to 111 percent of rated thrust, demonstrating more power than engines that flew on the space shuttle or on the first SLS launch in 2022.

A new RS-25 engine, No. 20001, was installed on its test stand in Mississippi earlier this year. Credit: NASA

“This successful acceptance test shows that we’ve been able to replicate the RS-25’s performance and reliability, while incorporating modern manufacturing techniques and upgraded components such as the main combustion chamber, nozzle, and pogo accumulator assembly,” said Kristin Houston, president of space propulsion and power systems at Aerojet Rocketdyne, L3Harris. “Our propulsion technology is key to ensuring the United States leads in lunar exploration, creates a sustained presence on the Moon and does not cede this strategic frontier to other nations.”

The test-firing last Friday came a few days before the 50th anniversary of the first space shuttle main engine test at Stennis on June 24, 1975. That engine carried the serial number 0001. The new RS-25 engine is designated No. 20001.

Watch out

NASA followed last week’s low-key engine test with the test-firing of a solid-fueled booster at Northrop Grumman’s rocket test site in Promontory, Utah, on Thursday. Held in place on its side, the booster produced 3.9 million pounds of thrust, outclassing the power output of the existing boosters assigned to the first eight SLS missions.

Unlike the RS-25 firing at Stennis, NASA chose to broadcast the booster test. Everything appeared to go well until 1 minute and 40 seconds into the burn, when a fiery plume of super-hot exhaust appeared to burn through part of the booster’s structure just above the nozzle. Moments later, the nozzle disintegrated.

Solid rocket boosters can’t be turned off after ignition, and for better or worse, the motor continued firing until it ran out of propellant about 30 seconds later. The rocket sparked a fire in the hills overlooking the test stand.

This was the first test-firing of the Booster Obsolescence and Life Extension (BOLE) program, which aims to develop a higher-performance solid rocket booster for SLS missions. NASA awarded Northrop Grumman a $3.2 billion contract in 2021 to produce boosters with existing shuttle parts for five SLS missions (Artemis IV-VIII), and design, develop, and test a new booster design for Artemis IX.

The boosters produce more than 75 percent of the thrust required to propel the SLS rocket off the launch pad with NASA’s crewed Orion spacecraft on top. Four RS-25 engines power the core stage, collectively generating more than 2 million pounds of thrust.

Northrop Grumman calls the new booster “the largest and most powerful segmented solid rocket motor ever built for human spaceflight.”

One of the most significant changes with the BOLE booster design is that it replaces shuttle-era steel cases with carbon-fiber composite cases. Northrop says the new cases are lighter and stronger. It also replaces the booster’s hydraulic thrust vector control steering system with an electronic system. The propellant packed inside the booster is also different, using a mix that Northrop packs inside its commercial rocket motors instead of the recipe used for the space shuttle.

Northrop Grumman has had a tough time with rocket nozzles in recent years. In 2019, a test motor for the company’s now-canceled Omega rocket lost its nozzle during a test-firing in Utah. Then, last year, a smaller Northrop-made booster flying on United Launch Alliance’s Vulcan rocket lost its nozzle in flight. Vulcan’s guidance system and main engines corrected for the problem, and the rocket still achieved its planned orbit.

Photo of Stephen Clark

Stephen Clark is a space reporter at Ars Technica, covering private space companies and the world’s space agencies. Stephen writes about the nexus of technology, science, policy, and business on and off the planet.

NASA tested a new SLS booster that may never fly, and the end of it blew off Read More »