Google

google-makes-android-development-private,-will-continue-open-source-releases

Google makes Android development private, will continue open source releases

Google is planning a major change to the way it develops new versions of the Android operating system. Since the beginning, large swaths of the software have been developed in public-facing channels, but that will no longer be the case. This does not mean Android is shedding its open source roots, but the process won’t be as transparent.

Google has confirmed to Android Authority that all Android development work going forward will take place in Google’s internal branch. This is a shift from the way Google has worked on Android in the past, which featured frequent updates to the public AOSP branch. Anyone can access AOSP, but the internal branches are only available to Google and companies with a Google Mobile Services (GMS) license, like Samsung, Motorola, and others.

According to the company, it is making this change to simplify things, building on a recent change to trunk-based development. As Google works on both public and private branches of Android, the two fall out of sync with respect to features and API support. This forces Google to tediously merge the branches for every release. By focusing on the internal branch, Google claims it can streamline releases and make life easier for everyone.

When new versions of Android are done, Google says it will continue to publish the source code in AOSP as always. Supposedly, this will allow developers to focus on supporting their apps without keeping track of pending changes to the platform in AOSP. Licensed OEMs, meanwhile, can just focus on the lively internal branch as they work on devices that can take a year or more to launch.

Google makes Android development private, will continue open source releases Read More »

gemini-2.5-pro-is-here-with-bigger-numbers-and-great-vibes

Gemini 2.5 Pro is here with bigger numbers and great vibes

Just a few months after releasing its first Gemini 2.0 AI models, Google is upgrading again. The company says the new Gemini 2.5 Pro Experimental is its “most intelligent” model yet, offering a massive context window, multimodality, and reasoning capabilities. Google points to a raft of benchmarks that show the new Gemini clobbering other large language models (LLMs), and our testing seems to back that up—Gemini 2.5 Pro is one of the most impressive generative AI models we’ve seen.

Gemini 2.5, like all Google’s models going forward, has reasoning built in. The AI essentially fact-checks itself along the way to generating an output. We like to call this “simulated reasoning,” as there’s no evidence that this process is akin to human reasoning. However, it can go a long way to improving LLM outputs. Google specifically cites the model’s “agentic” coding capabilities as a beneficiary of this process. Gemini 2.5 Pro Experimental can, for example, generate a full working video game from a single prompt. We’ve tested this, and it works with the publicly available version of the model.

Gemini 2.5 Pro builds a game in one step.

Google says a lot of things about Gemini 2.5 Pro; it’s smarter, it’s context-aware, it thinks—but it’s hard to quantify what constitutes improvement in generative AI bots. There are some clear technical upsides, though. Gemini 2.5 Pro comes with a 1 million token context window, which is common for the big Gemini models but massive compared to competing models like OpenAI GPT or Anthropic Claude. You could feed multiple very long books to Gemini 2.5 Pro in a single prompt, and the output maxes out at 64,000 tokens. That’s the same as Flash 2.0, but it’s still objectively a lot of tokens compared to other LLMs.

Naturally, Google has run Gemini 2.5 Experimental through a battery of benchmarks, in which it scores a bit higher than other AI systems. For example, it squeaks past OpenAI’s o3-mini in GPQA and AIME 2025, which measure how well the AI answers complex questions about science and math, respectively. It also set a new record in the Humanity’s Last Exam benchmark, which consists of 3,000 questions curated by domain experts. Google’s new AI managed a score of 18.8 percent to OpenAI’s 14 percent.

Gemini 2.5 Pro is here with bigger numbers and great vibes Read More »

after-borking-my-pixel-4a-battery,-google-borks-me,-too

After borking my Pixel 4a battery, Google borks me, too


The devil is in the details.

The Pixel 4a. It’s finally here! Credit: Google

It is an immutable law of nature that when you receive a corporate email with a subject line like “Changes coming to your Pixel 4a,” the changes won’t be the sort you like. Indeed, a more honest subject line would usually be: “You’re about to get hosed.”

So I wasn’t surprised, as I read further into this January missive from Google, that an “upcoming software update for your Pixel 4a” would “affect the overall performance and stability of its battery.”

How would my battery be affected? Negatively, of course. “This update will reduce your battery’s runtime and charging performance,” the email said. “To address this, we’re providing some options to consider. “

Our benevolent Google overlords were about to nerf my phone battery—presumably in the interests of “not having it erupt in flames,” though this was never actually made clear—but they recognized the problem, and they were about to provide compensation. This is exactly how these kinds of situations should be handled.

Google offered three options: $50 cash money, a $100 credit to Google’s online store, or a free battery replacement. It seemed fair enough. Yes, not having my phone for a week or two while I shipped it roundtrip to Google could be annoying, but at least the company was directly mitigating the harm it was about to inflict. Indeed, users might actually end up in better shape than before, given the brand-new battery.

So I was feeling relatively sunny toward the giant monopolist when I decided to spring for the 50 simoleons. My thinking was that 1) I didn’t want to lose my phone for a couple of weeks, 2) the update might not be that bad, in which case I’d be ahead by 50 bucks, and 3) I could always put the money towards a battery replacement if assumption No. 2 turned out to be mistaken.

The naïveté of youth!

I selected my $50 “appeasement” through an online form, and two days later, I received an email from Bharath on the Google Support Team.

Bharath wanted me to know that I was eligible for the money and it would soon be in my hands… once I performed a small, almost trivial task: giving some company I had never heard of my name, address, phone number, Social Security number, date of birth, and bank account details.

About that $50…

Google was not, in fact, just “sending” me $50. I had expected, since the problem involved their phones and their update, that the solution would require little or nothing from me. A check or prepaid credit card would arrive in the mail, perhaps, or a drone might deliver a crisp new bill from the sky. I didn’t know and didn’t care, so long as it wasn’t my problem.

But it was my problem. To get the cash, I had to create an account with something called “Payoneer.” This is apparently a reputable payments company, but I had never heard of it, and much about its operations is unclear. For instance, I was given three different ways to sign up depending on whether I 1) “already have a Payoneer account from Google,” 2) “don’t have an account,” or 3) “do have a Payoneer account that was not provided nor activated through Google.”

Say what now?

And though Google promised “no transaction fees,” Payoneer appears to charge an “annual account fee” of $29.95… but only to accounts that receive less than $2,000 through Payoneer in any consecutive 12-month period.

Does this fee apply to me if I sign up through the Google offer? I was directed to Payoneer support with any questions, but the company’s FAQ on the annual account fee doesn’t say.

If the fee does apply to me, do I need to sign up for a Payoneer account, give them all of my most personal financial information, wait the “10 to 18 business days” that Google says it will take to get my money, and then return to Payoneer so that I can cancel my account before racking up some $30 charge a year from now? And I’m supposed to do all this just to get…. fifty bucks? One time?

It was far simpler for me to get a recent hundred-dollar rebate on a washing machine… and they didn’t need my SSN or bank account information.

(Reddit users also report that, if you use the wrong web browser to cancel your Payoneer account, you’re hit with an error that says: “This end point requires that the body of all requests be formatted as JSON.”)

Like Lando Calrissian, I realized that this deal was getting worse all the time.

I planned to write Bharath back to switch my “appeasement,” but then I noticed the fine print: No changes are possible after making a selection.

So—no money for me. On the scale of life’s crises, losing $50 is a minor one, and I resolved to move on, facing the world with a cheerful heart and a clear mind, undistracted by the many small annoyances our high-tech overlords continually strew upon the path.

Then the software update arrived.

A decimation situation

When Google said that the new Pixel 4a update would “reduce your battery’s runtime and charging performance,” it was not kidding. Indeed, the update basically destroyed the battery.

Though my phone was three years old, until January of this year, the battery still held up for all-day usage. The screen was nice, the (smallish) phone size was good, and the device remained plenty fast at all the basic tasks: texting, emails, web browsing, snapping photos. I’m trying to reduce both my consumerism and my e-waste, so I was planning to keep the device for at least another year. And even then, it would make a decent hand-me-down device for my younger kids.

After the update, however, the phone burned through a full battery charge in less than two hours. I could pull up a simple podcast app, start playing an episode, and watch the battery percentage decrement every 45 seconds or so. Using the phone was nearly impossible unless one was near a charging cable at all times.

To recap: My phone was shot, I had to jump through several hoops to get my money, and I couldn’t change my “appeasement” once I realized that it wouldn’t work for me.

Within the space of three days, I went from 1) being mildly annoyed at the prospect of having my phone messed with remotely to 2) accepting that Google was (probably) doing it for my own safety and was committed to making things right to 3) berating Google for ruining my device and then using a hostile, data collecting “appeasement” program to act like it cared. This was probably not the impression Google hoped to leave in people’s minds when issuing the Pixel 4a update.

Pixel 4a, disassembled, with two fingers holding its battery above the front half.

Removing the Pixel 4a’s battery can be painful, but not as painful as catching fire. Credit: iFixit

Cheap can be quite expensive

The update itself does not appear to be part of some plan to spy on us or to extract revenue but rather to keep people safe. The company tried to remedy the pain with options that, on the surface, felt reasonable, especially given the fact that batteries are well-known as consumable objects that degrade over time. And I’ve had three solid years of service with the 4a, which wasn’t especially expensive to begin with.

That said, I do blame Google in general for the situation. The inflexibility of the approach, the options that aren’t tailored for ease of use in specific countries, the outsourced tech support—these are all hallmarks of today’s global tech behemoths.

It is more efficient, from an algorithmic, employ-as-few-humans-as-possible perspective, to operate “at scale” by choosing global technical solutions over better local options, by choosing outsourced email support, by trying to avoid fraud (and employee time) through preventing program changes, by asking the users to jump through your hoops, by gobbling up ultra-sensitive information because it makes things easier on your end.

While this makes a certain kind of sense, it’s not fun to receive this kind of “efficiency.” When everything goes smoothly, it’s fine—but whenever there’s a problem, or questions arise, these kinds of “efficient, scalable” approaches usually just mean “you’re about to get screwed.”

In the end, Google is willing to pay me $50, but that money comes with its own cost. I’m not willing to pay with my time nor with the risk of my financial information, and I will increasingly turn to companies that offer a better experience, that care more about data privacy, that build with higher-quality components, and that take good care of customers.

No company is perfect, of course, and this approach costs a bit more, which butts up against my powerful urge to get a great deal on everything. I have to keep relearning the old lesson— as I am once again with this Pixel 4a fiasco—that cheap gear is not always the best value in the long run.

Photo of Nate Anderson

After borking my Pixel 4a battery, Google borks me, too Read More »

italy-demands-google-poison-dns-under-strict-piracy-shield-law

Italy demands Google poison DNS under strict Piracy Shield law

Spotted by TorrentFreak, AGCOM Commissioner Massimiliano Capitanio took to LinkedIn to celebrate the ruling, as well as the existence of the Italian Piracy Shield. “The Judge confirmed the value of AGCOM’s investigations, once again giving legitimacy to a system for the protection of copyright that is unique in the world,” said Capitanio.

Capitanio went on to complain that Google has routinely ignored AGCOM’s listing of pirate sites, which are supposed to be blocked in 30 minutes or less under the law. He noted the violation was so clear-cut that the order was issued without giving Google a chance to respond, known as inaudita altera parte in Italian courts.

This decision follows a similar case against Internet backbone firm Cloudflare. In January, the Court of Milan found that Cloudflare’s CDN, DNS server, and WARP VPN were facilitating piracy. The court threatened Cloudflare with fines of up to 10,000 euros per day if it did not begin blocking the sites.

Google could face similar sanctions, but AGCOM has had difficulty getting international tech behemoths to acknowledge their legal obligations in the country. We’ve reached out to Google for comment and will update this report if we hear back.

Italy demands Google poison DNS under strict Piracy Shield law Read More »

apple-and-google-in-the-hot-seat-as-european-regulators-ignore-trump-warnings

Apple and Google in the hot seat as European regulators ignore Trump warnings

The European Commission is not backing down from efforts to rein in Big Tech. In a series of press releases today, the European Union’s executive arm has announced actions against both Apple and Google. Regulators have announced that Apple will be required to open up support for non-Apple accessories on the iPhone, but it may be too late for Google to make changes. The commission says the search giant has violated the Digital Markets Act, which could lead to a hefty fine.

Since returning to power, Donald Trump has railed against European regulations that target US tech firms. In spite of rising tensions and tough talk, the European Commission seems unfazed and is continuing to follow its more stringent laws, like the Digital Markets Act (DMA). This landmark piece of EU legislation aims to make the digital economy more fair. Upon coming into force last year, the act labeled certain large tech companies, including Apple and Google, as “gatekeepers” that are subject to additional scrutiny.

Europe’s more aggressive regulation of Big Tech is why iPhone users on the continent can install apps from third-party app markets while the rest of us are stuck with the Apple App Store. As for Google, the European Commission has paid special attention to search, Android, and Chrome, all of which dominate their respective markets.

Apple’s mobile platform plays second fiddle to Android in Europe, but it’s large enough to make the company subject to the DMA. The EU has now decreed that Apple is not doing enough to support interoperability on its platform. As a result, it will be required to make several notable changes. Apple will have to provide other companies and developers with improved access to iOS for devices like smartwatches, headphones, and TVs. This could include integration with notifications, faster data transfers, and streamlined setup.

The commission is also forcing Apple to release additional technical documentation, communication, and notifications for upcoming features for third parties. The EU believes this change will encourage more companies to build products that integrate with the iPhone, giving everyone more options aside from Apple’s.

Regulators say both sets of measures are the result of a public comment period that began late last year. We’ve asked Apple for comment on this development but have not heard back as of publication time. Apple is required to make these changes, and failing to do so could lead to fines. However, Google is already there.

Apple and Google in the hot seat as European regulators ignore Trump warnings Read More »

gemini-gets-new-coding-and-writing-tools,-plus-ai-generated-“podcasts”

Gemini gets new coding and writing tools, plus AI-generated “podcasts”

On the heels of its release of new Gemini models last week, Google has announced a pair of new features for its flagship AI product. Starting today, Gemini has a new Canvas feature that lets you draft, edit, and refine documents or code. Gemini is also getting Audio Overviews, a neat capability that first appeared in the company’s NotebookLM product, but it’s getting even more useful as part of Gemini.

Canvas is similar (confusingly) to the OpenAI product of the same name. Canvas is available in the Gemini prompt bar on the web and mobile app. Simply upload a document and tell Gemini what you need to do with it. In Google’s example, the user asks for a speech based on a PDF containing class notes. And just like that, Gemini spits out a document.

Canvas lets you refine the AI-generated documents right inside Gemini. The writing tools available across the Google ecosystem, with options like suggested edits and different tones, are available inside the Gemini-based editor. If you want to do more edits or collaborate with others, you can export the document to Google Docs with a single click.

Gemini Canvas with tic-tac-toe game

Credit: Google

Canvas is also adept at coding. Just ask, and Canvas can generate prototype web apps, Python scripts, HTML, and more. You can ask Gemini about the code, make alterations, and even preview your results in real time inside Gemini as you (or the AI) make changes.

Gemini gets new coding and writing tools, plus AI-generated “podcasts” Read More »

google-inks-$32-billion-deal-to-buy-security-firm-wiz-even-as-doj-seeks-breakup

Google inks $32 billion deal to buy security firm Wiz even as DOJ seeks breakup

“While a tough regulatory climate in 2024 had hampered such large-scale deals, Wall Street is optimistic that a shift in antitrust policies under US President Donald Trump could reignite dealmaking momentum,” Reuters wrote today.

Google reportedly agreed to a $3.2 billion breakup fee that would be paid to Wiz if the deal collapses. A Financial Times report said the breakup fee is unusually large as it represents 10 percent of the total deal value, instead of the typical 2 or 3 percent. The large breakup fee “shows how technology companies are still bracing themselves for pushback from antitrust regulators, even under President Donald Trump and his new Federal Trade Commission chair Andrew Ferguson,” the article said.

Wiz co-founder and CEO Assaf Rappaport wrote today that although the plan is for Wiz to become part of Google Cloud, the companies both believe that “Wiz needs to remain a multicloud platform… We will still work closely with our great partners at AWS, Azure, Oracle, and across the entire industry.”

Google Cloud CEO Thomas Kurian wrote that Wiz’s platform would fill a gap in Google’s security offerings. Google products already “help customers detect and respond to attackers through both SaaS-based services and cybersecurity consulting,” but Wiz is different because it “connects to all major clouds and code environments to help prevent incidents from happening in the first place,” he wrote.

“Wiz’s solution rapidly scans the customer’s environment, constructing a comprehensive graph of code, cloud resources, services, and applications—along with the connections between them,” Kurian wrote. “It identifies potential attack paths, prioritizes the most critical risks based on their impact, and empowers enterprise developers to secure applications before deployment. It also helps security teams collaborate with developers to remediate risks in code or detect and block ongoing attacks.”

Google inks $32 billion deal to buy security firm Wiz even as DOJ seeks breakup Read More »

farewell-photoshop?-google’s-new-ai-lets-you-edit-images-by-asking.

Farewell Photoshop? Google’s new AI lets you edit images by asking.


New AI allows no-skill photo editing, including adding objects and removing watermarks.

A collection of images either generated or modified by Gemini 2.0 Flash (Image Generation) Experimental. Credit: Google / Ars Technica

There’s a new Google AI model in town, and it can generate or edit images as easily as it can create text—as part of its chatbot conversation. The results aren’t perfect, but it’s quite possible everyone in the near future will be able to manipulate images this way.

Last Wednesday, Google expanded access to Gemini 2.0 Flash’s native image-generation capabilities, making the experimental feature available to anyone using Google AI Studio. Previously limited to testers since December, the multimodal technology integrates both native text and image processing capabilities into one AI model.

The new model, titled “Gemini 2.0 Flash (Image Generation) Experimental,” flew somewhat under the radar last week, but it has been garnering more attention over the past few days due to its ability to remove watermarks from images, albeit with artifacts and a reduction in image quality.

That’s not the only trick. Gemini 2.0 Flash can add objects, remove objects, modify scenery, change lighting, attempt to change image angles, zoom in or out, and perform other transformations—all to varying levels of success depending on the subject matter, style, and image in question.

To pull it off, Google trained Gemini 2.0 on a large dataset of images (converted into tokens) and text. The model’s “knowledge” about images occupies the same neural network space as its knowledge about world concepts from text sources, so it can directly output image tokens that get converted back into images and fed to the user.

Adding a water-skiing barbarian to a photograph with Gemini 2.0 Flash.

Adding a water-skiing barbarian to a photograph with Gemini 2.0 Flash. Credit: Google / Benj Edwards

Incorporating image generation into an AI chat isn’t itself new—OpenAI integrated its image-generator DALL-E 3 into ChatGPT last September, and other tech companies like xAI followed suit. But until now, every one of those AI chat assistants called on a separate diffusion-based AI model (which uses a different synthesis principle than LLMs) to generate images, which were then returned to the user within the chat interface. In this case, Gemini 2.0 Flash is both the large language model (LLM) and AI image generator rolled into one system.

Interestingly, OpenAI’s GPT-4o is capable of native image output as well (and OpenAI President Greg Brock teased the feature at one point on X last year), but that company has yet to release true multimodal image output capability. One reason why is possibly because true multimodal image output is very computationally expensive, since each image either inputted or generated is composed of tokens that become part of the context that runs through the image model again and again with each successive prompt. And given the compute needs and size of the training data required to create a truly visually comprehensive multimodal model, the output quality of the images isn’t necessarily as good as diffusion models just yet.

Creating another angle of a person with Gemini 2.0 Flash.

Creating another angle of a person with Gemini 2.0 Flash. Credit: Google / Benj Edwards

Another reason OpenAI has held back may be “safety”-related: In a similar way to how multimodal models trained on audio can absorb a short clip of a sample person’s voice and then imitate it flawlessly (this is how ChatGPT’s Advanced Voice Mode works, with a clip of a voice actor it is authorized to imitate), multimodal image output models are capable of faking media reality in a relatively effortless and convincing way, given proper training data and compute behind it. With a good enough multimodal model, potentially life-wrecking deepfakes and photo manipulations could become even more trivial to produce than they are now.

Putting it to the test

So, what exactly can Gemini 2.0 Flash do? Notably, its support for conversational image editing allows users to iteratively refine images through natural language dialogue across multiple successive prompts. You can talk to it and tell it what you want to add, remove, or change. It’s imperfect, but it’s the beginning of a new type of native image editing capability in the tech world.

We gave Gemini Flash 2.0 a battery of informal AI image-editing tests, and you’ll see the results below. For example, we removed a rabbit from an image in a grassy yard. We also removed a chicken from a messy garage. Gemini fills in the background with its best guess. No need for a clone brush—watch out, Photoshop!

We also tried adding synthesized objects to images. Being always wary of the collapse of media reality, called the “cultural singularity,” we added a UFO to a photo the author took from an airplane window. Then we tried adding a Sasquatch and a ghost. The results were unrealistic, but this model was also trained on a limited image dataset (more on that below).

Adding a UFO to a photograph with Gemini 2.0 Flash. Google / Benj Edwards

We then added a video game character to a photo of an Atari 800 screen (Wizard of Wor), resulting in perhaps the most realistic image synthesis result in the set. You might not see it here, but Gemini added realistic CRT scanlines that matched the monitor’s characteristics pretty well.

Adding a monster to an Atari video game with Gemini 2.0 Flash.

Adding a monster to an Atari video game with Gemini 2.0 Flash. Credit: Google / Benj Edwards

Gemini can also warp an image in novel ways, like “zooming out” of an image into a fictional setting or giving an EGA-palette character a body, then sticking him into an adventure game.

“Zooming out” on an image with Gemini 2.0 Flash. Google / Benj Edwards

And yes, you can remove watermarks. We tried removing a watermark from a Getty Images image, and it worked, although the resulting image is nowhere near the resolution or detail quality of the original. Ultimately, if your brain can picture what an image is like without a watermark, so can an AI model. It fills in the watermark space with the most plausible result based on its training data.

Removing a watermark with Gemini 2.0 Flash.

Removing a watermark with Gemini 2.0 Flash. Credit: Nomadsoul1 via Getty Images

And finally, we know you’ve likely missed seeing barbarians beside TV sets (as per tradition), so we gave that a shot. Originally, Gemini didn’t add a CRT TV set to the barbarian image, so we asked for one.

Adding a TV set to a barbarian image with Gemini 2.0 Flash.

Adding a TV set to a barbarian image with Gemini 2.0 Flash. Credit: Google / Benj Edwards

Then we set the TV on fire.

Setting the TV set on fire with Gemini 2.0 Flash.

Setting the TV set on fire with Gemini 2.0 Flash. Credit: Google / Benj Edwards

All in all, it doesn’t produce images of pristine quality or detail, but we literally did no editing work on these images other than typing requests. Adobe Photoshop currently lets users manipulate images using AI synthesis based on written prompts with “Generative Fill,” but it’s not quite as natural as this. We could see Adobe adding a more conversational AI image-editing flow like this one in the future.

Multimodal output opens up new possibilities

Having true multimodal output opens up interesting new possibilities in chatbots. For example, Gemini 2.0 Flash can play interactive graphical games or generate stories with consistent illustrations, maintaining character and setting continuity throughout multiple images. It’s far from perfect, but character consistency is a new capability in AI assistants. We tried it out and it was pretty wild—especially when it generated a view of a photo we provided from another angle.

Creating a multi-image story with Gemini 2.0 Flash, part 1. Google / Benj Edwards

Text rendering represents another potential strength of the model. Google claims that internal benchmarks show Gemini 2.0 Flash performs better than “leading competitive models” when generating images containing text, making it potentially suitable for creating content with integrated text. From our experience, the results weren’t that exciting, but they were legible.

An example of in-image text rendering generated with Gemini 2.0 Flash.

An example of in-image text rendering generated with Gemini 2.0 Flash. Credit: Google / Ars Technica

Despite Gemini 2.0 Flash’s shortcomings so far, the emergence of true multimodal image output feels like a notable moment in AI history because of what it suggests if the technology continues to improve. If you imagine a future, say 10 years from now, where a sufficiently complex AI model could generate any type of media in real time—text, images, audio, video, 3D graphics, 3D-printed physical objects, and interactive experiences—you basically have a holodeck, but without the matter replication.

Coming back to reality, it’s still “early days” for multimodal image output, and Google recognizes that. Recall that Flash 2.0 is intended to be a smaller AI model that is faster and cheaper to run, so it hasn’t absorbed the entire breadth of the Internet. All that information takes a lot of space in terms of parameter count, and more parameters means more compute. Instead, Google trained Gemini 2.0 Flash by feeding it a curated dataset that also likely included targeted synthetic data. As a result, the model does not “know” everything visual about the world, and Google itself says the training data is “broad and general, not absolute or complete.”

That’s just a fancy way of saying that the image output quality isn’t perfect—yet. But there is plenty of room for improvement in the future to incorporate more visual “knowledge” as training techniques advance and compute drops in cost. If the process becomes anything like we’ve seen with diffusion-based AI image generators like Stable Diffusion, Midjourney, and Flux, multimodal image output quality may improve rapidly over a short period of time. Get ready for a completely fluid media reality.

Photo of Benj Edwards

Benj Edwards is Ars Technica’s Senior AI Reporter and founder of the site’s dedicated AI beat in 2022. He’s also a tech historian with almost two decades of experience. In his free time, he writes and records music, collects vintage computers, and enjoys nature. He lives in Raleigh, NC.

Farewell Photoshop? Google’s new AI lets you edit images by asking. Read More »

rcs-texting-updates-will-bring-end-to-end-encryption-to-green-bubble-chats

RCS texting updates will bring end-to-end encryption to green bubble chats

One of the best mostly invisible updates in iOS 18 was Apple’s decision to finally implement the Rich Communications Services (RCS) communication protocol, something that is slowly helping to fix the generally miserable experience of texting non-iPhone users with an iPhone. The initial iOS 18 update brought RCS support to most major carriers in the US, and the upcoming iOS 18.4 update is turning it on for a bunch of smaller prepaid carriers like Google Fi and Mint Mobile.

Now that Apple is on board, iPhones and their users can also benefit from continued improvements to the RCS standard. And one major update was announced today: RCS will now support end-to-end encryption using the Messaging Layer Security (MLS) protocol, a standard finalized by the Internet Engineering Task Force in 2023.

“RCS will be the first large-scale messaging service to support interoperable E2EE between client implementations from different providers,” writes GSMA Technical Director Tom Van Pelt in the post announcing the updates. “Together with other unique security features such as SIM-based authentication, E2EE will provide RCS users with the highest level of privacy and security for stronger protection from scams, fraud and other security and privacy threats. ”

RCS texting updates will bring end-to-end encryption to green bubble chats Read More »

google-joins-openai-in-pushing-feds-to-codify-ai-training-as-fair-use

Google joins OpenAI in pushing feds to codify AI training as fair use

Google’s position on AI regulation: Trust us, bro

If there was any doubt about Google’s commitment to move fast and break things, its new policy position should put that to rest. “For too long, AI policymaking has paid disproportionate attention to the risks,” the document says.

Google urges the US to invest in AI not only with money but with business-friendly legislation. The company joins the growing chorus of AI firms calling for federal legislation that clarifies how they can operate. It points to the difficulty of complying with a “patchwork” of state-level laws that impose restrictions on AI development and use. If you want to know what keeps Google’s policy wonks up at night, look no further than the vetoed SB-1047 bill in California, which would have enforced AI safety measures.

AI ethics or AI Law concept. Developing AI codes of ethics. Compliance, regulation, standard , business policy and responsibility for guarding against unintended bias in machine learning algorithms.

Credit: Parradee Kietsirikul

According to Google, a national AI framework that supports innovation is necessary to push the boundaries of what artificial intelligence can do. Taking a page from the gun lobby, Google opposes attempts to hold the creators of AI liable for the way those models are used. Generative AI systems are non-deterministic, making it impossible to fully predict their output. Google wants clearly defined responsibilities for AI developers, deployers, and end users—it would, however, clearly prefer most of those responsibilities fall on others. “In many instances, the original developer of an AI model has little to no visibility or control over how it is being used by a deployer and may not interact with end users,” the company says.

There are efforts underway in some countries that would implement stringent regulations that force companies like Google to make their tools more transparent. For example, the EU’s AI Act would require AI firms to publish an overview of training data and possible risks associated with their products. Google believes this would force the disclosure of trade secrets that would allow foreign adversaries to more easily duplicate its work, mirroring concerns that OpenAI expressed in its policy proposal.

Google wants the government to push back on these efforts at the diplomatic level. The company would like to be able to release AI products around the world, and the best way to ensure it has that option is to promote light-touch regulation that “reflects US values and approaches.” That is, Google’s values and approaches.

Google joins OpenAI in pushing feds to codify AI training as fair use Read More »

end-of-life:-gemini-will-completely-replace-google-assistant-later-this-year

End of Life: Gemini will completely replace Google Assistant later this year

Not all devices can simply download an updated app—after almost a decade, Assistant is baked into many Google products. The company says Google-powered cars, watches, headphones, and other devices that use Assistant will receive updates that transition them to Gemini. It’s unclear if all Assistant-powered gadgets will be part of the migration. Most of these devices connect to your phone, so the update should be relatively straightforward, even for accessories that launched early in the Assistant era.

There are also plenty of standalone devices that run Assistant, like TVs and smart speakers. Google says it’s working on updated Gemini experiences for those devices. For example, there’s a Gemini preview program for select Google Nest speakers. It’s unclear if all these devices will get updates. Google says there will be more details on this in the coming months.

Meanwhile, Gemini still has some ground to make up. There are basic features that work fine in Assistant, like setting timers and alarms, that can go sideways with Gemini. On the other hand, Assistant had its fair share of problems and didn’t exactly win a lot of fans. Regardless, this transition could be fraught with danger for Google as it upends how people interact with their devices.

End of Life: Gemini will completely replace Google Assistant later this year Read More »

ai-search-engines-cite-incorrect-sources-at-an-alarming-60%-rate,-study-says

AI search engines cite incorrect sources at an alarming 60% rate, study says

A new study from Columbia Journalism Review’s Tow Center for Digital Journalism finds serious accuracy issues with generative AI models used for news searches. The research tested eight AI-driven search tools equipped with live search functionality and discovered that the AI models incorrectly answered more than 60 percent of queries about news sources.

Researchers Klaudia Jaźwińska and Aisvarya Chandrasekar noted in their report that roughly 1 in 4 Americans now use AI models as alternatives to traditional search engines. This raises serious concerns about reliability, given the substantial error rate uncovered in the study.

Error rates varied notably among the tested platforms. Perplexity provided incorrect information in 37 percent of the queries tested, whereas ChatGPT Search incorrectly identified 67 percent (134 out of 200) of articles queried. Grok 3 demonstrated the highest error rate, at 94 percent.

A graph from CJR shows

A graph from CJR shows “confidently wrong” search results. Credit: CJR

For the tests, researchers fed direct excerpts from actual news articles to the AI models, then asked each model to identify the article’s headline, original publisher, publication date, and URL. They ran 1,600 queries across the eight different generative search tools.

The study highlighted a common trend among these AI models: rather than declining to respond when they lacked reliable information, the models frequently provided confabulations—plausible-sounding incorrect or speculative answers. The researchers emphasized that this behavior was consistent across all tested models, not limited to just one tool.

Surprisingly, premium paid versions of these AI search tools fared even worse in certain respects. Perplexity Pro ($20/month) and Grok 3’s premium service ($40/month) confidently delivered incorrect responses more often than their free counterparts. Though these premium models correctly answered a higher number of prompts, their reluctance to decline uncertain responses drove higher overall error rates.

Issues with citations and publisher control

The CJR researchers also uncovered evidence suggesting some AI tools ignored Robot Exclusion Protocol settings, which publishers use to prevent unauthorized access. For example, Perplexity’s free version correctly identified all 10 excerpts from paywalled National Geographic content, despite National Geographic explicitly disallowing Perplexity’s web crawlers.

AI search engines cite incorrect sources at an alarming 60% rate, study says Read More »