Tech

Google DeepMind unveils a new video model to rival Sora

Google DeepMind, Google’s flagship AI research lab, wants to beat OpenAI at the video generation game — and it might just, at least for a little while.

On Monday, DeepMind announced Veo 2, a next-gen video-generating AI and the successor to Veo, which powers a growing number of products across Google’s portfolio. Veo 2 can create two-minute-plus clips in resolutions up to 4k (4096 x 2160 pixels).

Notably, that’s 4x the resolution — and over 6x the duration — OpenAI’s Sora can achieve.

It’s a theoretical advantage for now, granted. In Google’s experimental video creation tool, VideoFX, where Veo 2 is now exclusively available, videos are capped at 720p and eight seconds in length. (Sora can produce up to 1080p, 20-second-long clips.)

Google VideoFX — Veo 2 in VideoFX.Image Credits:Google

VideoFX is behind a waitlist, but Google says it’s expanding the number of users who can access it this week.

Eli Collins, VP of product at DeepMind, also told TechCrunch that Google will make Veo 2 available via its Vertex AI developer platform “as the model becomes ready for use at scale.”

“Over the coming months, we’ll continue to iterate based on feedback from users,” Collins said, “and [we’ll] look to integrate Veo 2’s updated capabilities into compelling use cases across the Google ecosystem … [W]e expect to share more updates next year.”

More controllable

Like Veo, Veo 2 can generate videos given a text prompt (e.g. “A car racing down a freeway”) or text and a reference image.

So what’s new in Veo 2? Well, DeepMind says the model, which can generate clips in a range of styles, has an improved “understanding” of physics and camera controls, and produces “clearer” footage.

By clearer, DeepMind means textures and images in clips are sharper — especially in scenes with a lot of movement. As for the improved camera controls, they enable Veo 2 to position the virtual “camera” in the videos it generates more precisely, and to move that camera to capture objects and people from different angles.

DeepMind also claims that Veo 2 can more realistically model motion, fluid dynamics (like coffee being poured into a mug), and properties of light (such as shadows and reflections). That includes different lenses and cinematic effects, DeepMind says, as well as “nuanced” human expression.

Google Veo 2 sample. Note that the compression artifacts were introduced in the clip’s conversion to a GIF. **Image Credits:**Google

DeepMind shared a few cherry-picked samples from Veo 2 with TechCrunch last week. For AI-generated videos, they looked pretty good — exceptionally good, even. Veo 2 seems to have a strong grasp of refraction and tricky liquids, like maple syrup, and a knack for emulating Pixar-style animation.

But despite DeepMind’s insistence that the model is less likely to hallucinate elements like extra fingers or “unexpected objects,” Veo 2 can’t quite clear the uncanny valley.

Note the lifeless eyes in this cartoon dog-like creature:

And the weirdly slippery road in this footage — plus the pedestrians in the background blending into each other and the buildings with physically impossible facades:

Collins admitted that there’s work to be done.

“Coherence and consistency are areas for growth,” he said. “Veo can consistently adhere to a prompt for a couple minutes, but [it can’t] adhere to complex prompts over long horizons. Similarly, character consistency can be a challenge. There’s also room to improve in generating intricate details, fast and complex motions, and continuing to push the boundaries of realism.”

DeepMind’s continuing to work with artists and producers to refine its video generation models and tooling, added Collins.

“We started working with creatives like Donald Glover, the Weeknd, d4vd, and others since the beginning of our Veo development to really understand their creative process and how technology could help bring their vision to life,” Collins said. “Our work with creators on Veo 1 informed the development of Veo 2, and we look forward to working with trusted testers and creators to get feedback on this new model.”

Safety and training

Veo 2 was trained on lots of videos. That’s generally how AI models work: Provided with example after example of some form of data, the models pick up on patterns in the data that allow them to generate new data.

DeepMind won’t say exactly where it scraped the videos to train Veo 2, but YouTube is one possible source; Google owns YouTube, and DeepMind previously told TechCrunch that Google models like Veo “may” be trained on some YouTube content.

“Veo has been trained on high-quality video-description pairings,” Collins said. “Video-description pairs are a video and associated description of what happens in that video.”

While DeepMind, through Google, hosts tools to let webmasters block the lab’s bots from extracting training data from their websites, DeepMind doesn’t offer a mechanism to let creators remove works from its existing training sets. The lab and its parent company maintain that training models using public data is fair use, meaning that DeepMind believes it isn’t obligated to ask permission from data owners.

Not all creatives agree — particularly in light of studies estimating that tens of thousands of film and TV jobs could be disrupted by AI in the coming years. Several AI companies, including the eponymous startup behind the popular AI art app Midjourney, are in the crosshairs of lawsuits accusing them of infringing on artists’ rights by training on content without consent.

“We’re committed to working collaboratively with creators and our partners to achieve common goals,” Collins said. “We continue to work with the creative community and people across the wider industry, gathering insights and listening to feedback, including those who use VideoFX.”

Thanks to the way today’s generative models behave when trained, they carry certain risks, like regurgitation, which refers to when a model generates a mirror copy of training data. DeepMind’s solution is prompt-level filters, including for violent, graphic, and explicit content.

Google’s indemnity policy, which provides a defense for certain customers against allegations of copyright infringement stemming from the use of its products, won’t apply to Veo 2 until it’s generally available, Collins said.

To mitigate the risk of deepfakes, DeepMind says it’s using its proprietary watermarking technology, SynthID, to embed invisible markers into frames Veo 2 generates. However, like all watermarking tech, SynthID isn’t foolproof.

Imagen upgrades

In addition to Veo 2, Google DeepMind this morning announced upgrades to Imagen 3, its commercial image generation model.

A new version of Imagen 3 is rolling out to users of ImageFX, Google’s image-generating tool, beginning today. It can create “brighter, better-composed” images and photos in styles like photorealism, impressionism, and anime, per DeepMind.

“This upgrade [to Imagen 3] also follows prompts more faithfully, and renders richer details and textures,” DeepMind wrote in a blog post provided to TechCrunch.

Google ImageFX — **Image Credits:**Google

Rolling out alongside the model are UI updates to ImageFX. Now, when users type prompts, key terms in those prompts will become “chiplets” with a drop-down menu of suggested, related words. Users can use the chips to iterate what they’ve written, or select from a row of auto-generated descriptors beneath the prompt.

source

Tech

Volkswagen’s cheapest EV ever is the first to use Rivian software

Volkswagen’s ultra-cheap EV called the ID EVERY1 — a small four-door hatchback revealed Wednesday — will be the first to roll out with software and architecture from Rivian, according to a source familiar with the new model.

The EV is expected to go into production in 2027 with a starting price of 20,000 euros ($21,500). A second EV called the ID.2all, which will be priced in the 25,000 euro price category, will be available in 2026. Both vehicles are part of the automaker’s new of category electric urban front-wheel drive cars that are being developing under the so-called “Brand Group Core” that makes up the volume brands in the VW Group. And both vehicles are for the European market.

The EVERY1 will be the first to ship with Rivian’s vehicle architecture and software as part of a $5.8 billion joint venture struck last year between the German automaker and U.S. EV maker. The ID.2all is based on the E3 1.1 architecture and software developed by VW’s software unit Cariad.

VW didn’t name Rivian in its reveal Wednesday, although there were numerous nods to next-generation software. Kai Grünitz, member of the Volkswagen Brand Board of Management responsible for Technical Development, noted it would be the first model in the entire VW Group to use a “fundamentally new, particularly powerful software architecture.”

“This means the future entry-level Volkswagen can be equipped with new functions throughout its entire life cycle,” he said. “Even after purchase of a new car, the small Volkswagen can still be individually adapted to customer needs.”

Sources who didn’t want to be named because they were not authorized to speak publicly, confirmed to TechCrunch that Rivian’s software will be in the ID EVERY1 EV. TechCrunch has reached out to Rivian and VW and will update the article if the companies respond.

The new joint venture provides Rivian with a needed influx of cash and the opportunity to diversify its business. Meanwhile, VW Group gains a next-generation electrical architecture and software for EVs that will help it better compete. Both companies have said that the joint venture, called Rivian and Volkswagen Group Technologies, will reduce development costs and help scale new technologies more quickly.

The joint venture is a 50-50 partnership with co-CEOs. Rivian’s head of software, Wassym Bensaid, and Volkswagen Group’s chief technical engineer, Carsten Helbing, will lead the joint venture. The team will be based initially in Palo Alto, California. Three other sites are in development in North America and Europe, the companies have previously said.

“The ID. EVERY1 represents the last piece of the puzzle on our way to the widest model selection in the volume segment,” Thomas Schäfer, CEO of the Volkswagen Passenger Cars brand and Head of the Brand Group Core, said in a statement. “We will then offer every customer the right car with the right drive system–including affordable all-electric entry-level mobility. Our goal is to be the world’s technologically leading high-volume manufacturer by 2030. And as a brand for everyone–just as you would expect from Volkswagen.”

The Volkswagen ID EVERY1 is just a concept for now — and with only a few details attached to the unveiling. The concept vehicle reaches a top speed of 130 km/h (80 miles per hour) and is powered by a newly developed electric drive motor with 70 kW, according to Volkswagen. The German automaker said the range on the EVERY1 will be at least 250 kilometers (150 miles). The vehicle is small but larger than VW’s former UP! vehicle. The company said it will have enough space for four people and a luggage compartment volume of 305 liters.

source

Tech

The hottest AI models, what they do, and how to use them

AI models are being cranked out at a dizzying pace, by everyone from Big Tech companies like Google to startups like OpenAI and Anthropic. Keeping track of the latest ones can be overwhelming.

Adding to the confusion is that AI models are often promoted based on industry benchmarks. But these technical metrics often reveal little about how real people and companies actually use them.

To cut through the noise, TechCrunch has compiled an overview of the most advanced AI models released since 2024, with details on how to use them and what they’re best for. We’ll keep this list updated with the latest launches, too.

There are literally over a million AI models out there: Hugging Face, for example, hosts over 1.4 million. So this list might miss some models that perform better, in one way or another.

AI models released in 2025

Cohere’s Aya Vision

Cohere released a multimodal model called Aya Vision that it claims is best in class at doing things like captioning images and answering questions about photos. It also excels in languages other than English, unlike other models, Cohere claims. It is available for free on WhatsApp.

OpenAI’s GPT 4.5 ‘Orion’

OpenAI calls Orion their largest model to date, touting its strong “world knowledge” and “emotional intelligence.” However, it underperforms on certain benchmarks compared to newer reasoning models. Orion is available to subscribers of OpenAI’s $200 a month plan.

Claude Sonnet 3.7

Anthropic says this is the industry’s first ‘hybrid’ reasoning model, because it can both fire off quick answers and really think things through when needed. It also gives users control over how long the model can think for, per Anthropic. Sonnet 3.7 is available to all Claude users, but heavier users will need a $20 a month Pro plan.

xAI’s Grok 3

Grok 3 is the latest flagship model from Elon Musk-founded startup xAI. It’s claimed to outperform other leading models on math, science, and coding. The model requires X Premium (which is $50 a month.) After one study found Grok 2 leaned left, Musk pledged to shift Grok more “politically neutral” but it’s not yet clear if that’s been achieved.

OpenAI o3-mini

This is OpenAI’s latest reasoning model and is optimized for STEM-related tasks like coding, math, and science. It’s not OpenAI’s most powerful model but because it’s smaller, the company says it’s significantly lower cost. It is available for free but requires a subscription for heavy users.

OpenAI Deep Research

OpenAI’s Deep Research is designed for doing in-depth research on a topic with clear citations. This service is only available with ChatGPT’s $200 per month Pro subscription. OpenAI recommends it for everything from science to shopping research, but beware that hallucinations remain a problem for AI.

Mistral Le Chat

Mistral has launched app versions of Le Chat, a multimodal AI personal assistant. Mistral claims Le Chat responds faster than any other chatbot. It also has a paid version with up-to-date journalism from the AFP. Tests from Le Monde found Le Chat’s performance impressive, although it made more errors than ChatGPT.

OpenAI Operator

OpenAI’s Operator is meant to be a personal intern that can do things independently, like help you buy groceries. It requires a $200 a month ChatGPT Pro subscription. AI agents hold a lot of promise, but they’re still experimental: a Washington Post reviewer says Operator decided on its own to order a dozen eggs for $31, paid with the reviewer’s credit card.

Google Gemini 2.0 Pro Experimental

Google Gemini’s much-awaited flagship model says it excels at coding and understanding general knowledge. It also has a super-long context window of 2 million tokens, helping users who need to quickly process massive chunks of text. The service requires (at minimum) a Google One AI Premium subscription of $19.99 a month.

AI models released in 2024

DeepSeek R1

This Chinese AI model took Silicon Valley by storm. DeepSeek’s R1 performs well on coding and math, while its open source nature means anyone can run it locally. Plus, it’s free. However, R1 integrates Chinese government censorship and faces rising bans for potentially sending user data back to China.

Gemini Deep Research

Deep Research summarizes Google’s search results in a simple and well-cited document. The service is helpful for students and anyone else who needs a quick research summary. However, its quality isn’t nearly as good as an actual peer-reviewed paper. Deep Research requires a $19.99 Google One AI Premium subscription.

Meta Llama 3.3 70B

This is the newest and most advanced version of Meta’s open source Llama AI models. Meta has touted this version as its cheapest and most efficient yet, especially for math, general knowledge, and instruction following. It is free and open source.

OpenAI Sora

Sora is a model that creates realistic videos based on text. While it can generate entire scenes rather than just clips, OpenAI admits that it often generates “unrealistic physics.” It’s currently only available on paid versions of ChatGPT, starting with Plus, which is $20 a month.

Alibaba Qwen QwQ-32B-Preview

This model is one of the few to rival OpenAI’s o1 on certain industry benchmarks, excelling in math and coding. Ironically for a “reasoning model,” it has “room for improvement in common sense reasoning,” Alibaba says. It also incorporates Chinese government censorship, TechCrunch testing shows. It’s free and open source.

Anthropic’s Computer Use

Claude’s Computer Use is meant to take control of your computer to complete tasks like coding or booking a plane ticket, making it a predecessor of OpenAI’s Operator. Computer use, however, remains in beta. Pricing is via API: $0.80 per million tokens of input and $4 per million tokens of output.

x.AI’s Grok 2

Elon Musk’s AI company, x.AI, has launched an enhanced version of its flagship Grok 2 chatbot it claims is “three times faster.” Free users are limited to 10 questions every two hours on Grok, while subscribers to X’s Premium and Premium+ plans enjoy higher usage limits. x.AI also launched an image generator, Aurora, that produces highly photorealistic images, including some graphic or violent content.

OpenAI o1

OpenAI’s o1 family is meant to produce better answers by “thinking” through responses through a hidden reasoning feature. The model excels at coding, math, and safety, OpenAI claims, but has issues deceiving humans, too. Using o1 requires subscribing to ChatGPT Plus, which is $20 a month.

Anthropic’s Claude Sonnet 3.5

Claude Sonnet 3.5 is a model Anthropic claims as being best in class. It’s become known for its coding capabilities and is considered a tech insider’s chatbot of choice. The model can be accessed for free on Claude although heavy users will need a $20 monthly Pro subscription. While it can understand images, it can’t generate them.

OpenAI GPT 4o-mini

OpenAI has touted GPT 4o-mini as its most affordable and fastest model yet thanks to its small size. It’s meant to enable a broad range of tasks like powering customer service chatbots. The model is available on ChatGPT’s free tier. It’s better suited for high-volume simple tasks compared to more complex ones.

Cohere Command R+

Cohere’s Command R+ model excels at complex Retrieval-Augmented Generation (or RAG) applications for enterprises. That means it can find and cite specific pieces of information really well. (The inventor of RAG actually works at Cohere.) Still, RAG doesn’t fully solve AI’s hallucination problem.

source

Tech

Not all cancer patients need chemo. Ataraxis AI raised $20M to fix that.

Artificial intelligence is a big trend in cancer care, and it’s mostly focused detecting cancer at the earliest possible stage. That makes a lot of sense, given that cancer is less deadly the earlier it’s detected.

But fewer are asking another fundamental question: if someone does have cancer, is an aggressive treatment like chemotherapy necessary? That’s the problem Ataraxis AI is trying to solve.

The New York-based startup is focused on using AI to accurately predict not only if a patient has cancer, but also what their cancer outcome looks like in 5 to 10 years. If there’s only a small chance of the cancer coming back, chemo can be avoided altogether – saving a lot of money, while avoiding the treatment’s notorious side effects.

Ataraxis AI now plans to launch their first commercial test, for breast cancer, to U.S. oncologists in the coming months, its co-founder Jan Witowski tells TechCrunch. To bolster the launch and expand into other types of cancer, the startup has raised a $20.4 million Series A, it told TechCrunch exclusively.

The round was led by AIX Ventures with participation from Thiel Bio, Founders Fund, Floating Point, Bertelsmann, and existing investors Giant Ventures and Obvious Ventures. Ataraxis emerged from stealth last year with a $4 million seed round.

Ataraxis was co-founded by Witowski and Krzysztof Geras, an assistant professor at NYU’s medical school who focuses on AI.

Ataraxis’ tech is powered by an AI model that extracts information from high-resolution images of cancer cells. The model is trained on hundreds of millions of real images from thousands of patients, Witowski said. A recent study showed Ataraxis’ tech was 30% more accurate than the current standard of care for breast cancer, per Ataraxis.

Long term, Ataraxis has big ambitions. It wants its tests to impact at least half of new cancer cases by 2030. It also views itself as a frontier AI company that builds its own models, touting Meta’s chief AI scientist Yann LeCun as an AI advisor.

“I think at Ataraxis we are trying to build what is essentially an AI frontier lab, but for healthcare applications,” Witowski said. “Because so many of those problems require a very novel technology.”

The AI boom has led to a rush of fundraises for cancer care startups. Valar Labs raised $22 million to help patients figure out their treatment plan in May 2024, for example. There’s also a bevvy of AI-powered drug discovery firms in the cancer space, like Manas AI which raised $24.6 million in January 2025 and was co-founded by Reid Hoffman, the LinkedIn co-founder.

source

Daily Fact Hub

Google DeepMind unveils a new video model to rival Sora

More controllable

Safety and training

Imagen upgrades

You may like

Leave a Reply Cancel reply

Leave a Reply

Tech

Volkswagen’s cheapest EV ever is the first to use Rivian software

Tech

The hottest AI models, what they do, and how to use them

AI models released in 2025

Cohere’s Aya Vision

OpenAI’s GPT 4.5 ‘Orion’

Claude Sonnet 3.7

xAI’s Grok 3

OpenAI o3-mini

OpenAI Deep Research

Mistral Le Chat

OpenAI Operator

Google Gemini 2.0 Pro Experimental

AI models released in 2024

DeepSeek R1

Gemini Deep Research

Meta Llama 3.3 70B

OpenAI Sora

Alibaba Qwen QwQ-32B-Preview

Anthropic’s Computer Use

x.AI’s Grok 2

OpenAI o1

Anthropic’s Claude Sonnet 3.5

OpenAI GPT 4o-mini

Cohere Command R+

Tech

Not all cancer patients need chemo. Ataraxis AI raised $20M to fix that.

A'ja Wilson has no shortage of motivation after Aces' early exit in '24

Best Mothers Day gifts: Show mom some love

Jacob Wilson joins Aaron Judge in spotlight for Yankees-A's series

Florida's Auston Kim shares first-round lead at Blue Bay LPGA

Wordle today: Answer, hints for May 9, 2025

Best Mothers Day gifts: Show mom some love

Disney’s live-action Aladdin finally finds its stars

Meet Superman’s grandfather in new trailer for Krypton

New Season 8 Walking Dead trailer flashes forward in time

Leave a Reply
Cancel reply