Connect with us

Tech

The hottest AI models, what they do, and how to use them

AI models are being cranked out at a dizzying pace, by everyone from Big Tech companies like Google to startups like OpenAI and Anthropic. Keeping track of the latest ones can be overwhelming. 

Adding to the confusion is that AI models are often promoted based on industry benchmarks. But these technical metrics often reveal little about how real people and companies actually use them. 

To cut through the noise, TechCrunch has compiled an overview of the most advanced AI models released since 2024, with details on how to use them and what they’re best for. We’ll keep this list updated with the latest launches, too.

There are literally over a million AI models out there: Hugging Face, for example, hosts over 1.4 million. So this list might miss some models that perform better, in one way or another. 

AI models released in 2025

Cohere’s Aya Vision

Cohere released a multimodal model called Aya Vision that it claims is best in class at doing things like captioning images and answering questions about photos. It also excels in languages other than English, unlike other models, Cohere claims. It is available for free on WhatsApp.

OpenAI’s GPT 4.5 ‘Orion’

OpenAI calls Orion their largest model to date, touting its strong “world knowledge” and “emotional intelligence.” However, it underperforms on certain benchmarks compared to newer reasoning models. Orion is available to subscribers of OpenAI’s $200 a month plan.

Claude Sonnet 3.7

Anthropic says this is the industry’s first ‘hybrid’ reasoning model, because it can both fire off quick answers and really think things through when needed. It also gives users control over how long the model can think for, per Anthropic. Sonnet 3.7 is available to all Claude users, but heavier users will need a $20 a month Pro plan.

xAI’s Grok 3

Grok 3 is the latest flagship model from Elon Musk-founded startup xAI. It’s claimed to outperform other leading models on math, science, and coding. The model requires X Premium (which is $50 a month.) After one study found Grok 2 leaned left, Musk pledged to shift Grok more “politically neutral” but it’s not yet clear if that’s been achieved.

OpenAI o3-mini

This is OpenAI’s latest reasoning model and is optimized for STEM-related tasks like coding, math, and science. It’s not OpenAI’s most powerful model but because it’s smaller, the company says it’s significantly lower cost. It is available for free but requires a subscription for heavy users.

OpenAI Deep Research

OpenAI’s Deep Research is designed for doing in-depth research on a topic with clear citations. This service is only available with ChatGPT’s $200 per month Pro subscription. OpenAI recommends it for everything from science to shopping research, but beware that hallucinations remain a problem for AI.

Mistral Le Chat

Mistral has launched app versions of Le Chat, a multimodal AI personal assistant. Mistral claims Le Chat responds faster than any other chatbot. It also has a paid version with up-to-date journalism from the AFP. Tests from Le Monde found Le Chat’s performance impressive, although it made more errors than ChatGPT.

OpenAI Operator

OpenAI’s Operator is meant to be a personal intern that can do things independently, like help you buy groceries. It requires a $200 a month ChatGPT Pro subscription. AI agents hold a lot of promise, but they’re still experimental: a Washington Post reviewer says Operator decided on its own to order a dozen eggs for $31, paid with the reviewer’s credit card.

Google Gemini 2.0 Pro Experimental

Google Gemini’s much-awaited flagship model says it excels at coding and understanding general knowledge. It also has a super-long context window of 2 million tokens, helping users who need to quickly process massive chunks of text. The service requires (at minimum) a Google One AI Premium subscription of $19.99 a month.

AI models released in 2024

DeepSeek R1

This Chinese AI model took Silicon Valley by storm. DeepSeek’s R1 performs well on coding and math, while its open source nature means anyone can run it locally. Plus, it’s free. However, R1 integrates Chinese government censorship and faces rising bans for potentially sending user data back to China.

Gemini Deep Research

Deep Research summarizes Google’s search results in a simple and well-cited document. The service is helpful for students and anyone else who needs a quick research summary. However, its quality isn’t nearly as good as an actual peer-reviewed paper. Deep Research requires a $19.99 Google One AI Premium subscription.

Meta Llama 3.3 70B

This is the newest and most advanced version of Meta’s open source Llama AI models. Meta has touted this version as its cheapest and most efficient yet, especially for math, general knowledge, and instruction following. It is free and open source.

OpenAI Sora

Sora is a model that creates realistic videos based on text. While it can generate entire scenes rather than just clips, OpenAI admits that it often generates “unrealistic physics.” It’s currently only available on paid versions of ChatGPT, starting with Plus, which is $20 a month. 

Alibaba Qwen QwQ-32B-Preview

This model is one of the few to rival OpenAI’s o1 on certain industry benchmarks, excelling in math and coding. Ironically for a “reasoning model,” it has “room for improvement in common sense reasoning,” Alibaba says. It also incorporates Chinese government censorship, TechCrunch testing shows. It’s free and open source.

Anthropic’s Computer Use

Claude’s Computer Use is meant to take control of your computer to complete tasks like coding or booking a plane ticket, making it a predecessor of OpenAI’s Operator. Computer use, however, remains in beta. Pricing is via API: $0.80 per million tokens of input and $4 per million tokens of output.

x.AI’s Grok 2 

Elon Musk’s AI company, x.AI, has launched an enhanced version of its flagship Grok 2 chatbot it claims is “three times faster.” Free users are limited to 10 questions every two hours on Grok, while subscribers to X’s Premium and Premium+ plans enjoy higher usage limits. x.AI also launched an image generator, Aurora, that produces highly photorealistic images, including some graphic or violent content.

OpenAI o1

OpenAI’s o1 family is meant to produce better answers by “thinking” through responses through a hidden reasoning feature. The model excels at coding, math, and safety, OpenAI claims, but has issues deceiving humans, too. Using o1 requires subscribing to ChatGPT Plus, which is $20 a month.

Anthropic’s Claude Sonnet 3.5 

Claude Sonnet 3.5 is a model Anthropic claims as being best in class. It’s become known for its coding capabilities and is considered a tech insider’s chatbot of choice. The model can be accessed for free on Claude although heavy users will need a $20 monthly Pro subscription. While it can understand images, it can’t generate them.

OpenAI GPT 4o-mini

OpenAI has touted GPT 4o-mini as its most affordable and fastest model yet thanks to its small size. It’s meant to enable a broad range of tasks like powering customer service chatbots. The model is available on ChatGPT’s free tier. It’s better suited for high-volume simple tasks compared to more complex ones.

Cohere Command R+

Cohere’s Command R+ model excels at complex Retrieval-Augmented Generation (or RAG) applications for enterprises. That means it can find and cite specific pieces of information really well. (The inventor of RAG actually works at Cohere.) Still, RAG doesn’t fully solve AI’s hallucination problem.

source

Continue Reading
Click to comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Tech

Tesla brings its robotaxi service to Dallas and Houston

Tesla is expanding its robotaxi service to Dallas and Houston, according to a social media post from the company.

The post says simply that “Robotaxi is now rolling out in Dallas & Houston 🤠” and includes a 14-second video showing Tesla vehicles driving without human monitors or drivers in the front seat.

The company now offers robotaxi service in three cities, all of them in Texas, after launching in Austin last year and starting to offer rides without safety drivers in January 2026. In a February filing, Tesla said that its Austin robotaxis have been involved in 14 crashes since launch.

It also offers a more limited ride service with human drivers in the San Francisco Bay Area.

Tesla may not be running many vehicles in either of these new markets yet, with crowdsourced data on the Robotaxi Tracker website only registering a single vehicle in each city (compared to 46 active vehicles logged in Austin).

source

Continue Reading

Tech

Netflix plans to add a vertical video feed, use AI for recommendations

Netflix is going to launch a TikTok-like vertical video feed within its apps this month, and plans to use AI broadly for content creation and recommendations, the company said on Thursday.

Netflix has been testing a vertical video feed since last year. The short video feature could aid users with discovering video podcasts, along with the current slate of shows and movies. The company is also leaning more into using AI for recommendations after launching a ChatGPT-powered search feature last year.

“We have been in personalization and recommendation for two decades, but we still see tremendous room to make it better by leveraging newer technologies,” Netflix co-CEO Gregory Peters said during the company’s first-quarter conference call. “Recommendation systems based on new model architectures not only improve current personalization but also let us iterate and improve more quickly — adding support for different content types much more efficiently.”

Co-CEO Ted Sarandos said he sees AI tools improving the entire content creation process. “In general, we expect GenAI to make content better; better tools, better processes […] It takes a great artist to make great art, and AI won’t change that. But AI will give those artists better tools to bring those visions to life,” he said.

Last month, Netflix bought Ben Affleck’s AI creation company InterPositive, which, Sarandos said, has garnered interest from creators.

“With our acquisition of InterPositive, we think it accelerates our GenAI capability because it is proprietary technology created specifically for filmmakers and filmmaking, different from other GenAI video applications. While our ownership of InterPositive is very new, we have generated interest with creators who have spent time with the tools, and we are seeing momentum build around adoption,” he noted.

Netflix also mentioned that it wants to use AI to improve its ad suite, and allow for new formats and customization to get better returns. The company expects to generate ad revenue of $3 billion this year.

Techcrunch event

San Francisco, CA
|
October 13-15, 2026

Netflix reported revenue of $12.25 billion in Q1 2026, up 16.2% year-year-year, and said profit jumped 83% to $5.28 billion. Alongside the first-quarter results, Netflix said its co-founder and chair, Reed Hastings, is leaving the company’s board this summer.

Notably, the company hiked subscription prices in the U.S. late last month, which could have a positive impact next quarter. The company said it ended 2025 with 325 million paying subscribers.

source

Continue Reading

Tech

Bluesky confirms DDoS attack is cause of continued app outages

Bluesky’s website and app are still struggling on Friday after experiencing service interruptions that chief operating officer Rose Wang attributed to an ongoing cyberattack.

On Thursday evening, the social media company confirmed that a “sophisticated Distributed Denial-of-Service (DDoS) attack” was to blame for the issues, which had originally started on April 15 at around 8:40 p.m. ET.

Distributed denial-of-service attacks often involve pummeling apps or websites with large amounts of junk web traffic aimed at overloading and knocking its servers offline. While these kinds of cyberattacks do not involve intrusions into a company’s systems, these incidents can still be disruptive to both the company and its users.

Our team received a report of intermittent app outages at about 11:40pm PDT on April 15, 2026. They worked through the night to mitigate a sophisticated Distributed Denial-of-Service (DDoS) attack, which intensified throughout the day.

Bluesky (@bsky.app) 2026-04-16T23:47:25.963Z

In a post on the Bluesky account, the company shared the cause of the problem and noted that the attack was “impacting our operations, with users experiencing intermittent interruptions in service for their feeds, notifications, threads, and search.”

Bluesky said that it has not seen any evidence of unauthorized access to private data, however.

When originally reached for comment on Thursday, Bluesky only pointed us to the status.bsky.app page and account (@status.bsky.app) for updates. The company did not provide an estimated time for a fix.

The network’s status page is currently not working, however.

Bluesky said it will provide another update on the status of the attack and its mitigation by 1 p.m. ET on Friday.

Image Credits:screenshot of Bluesky

Because the outages are intermittent, the Bluesky site and app will load at times, slowly, and other times will display error messages.

For instance, switching to a particular feed within the app could display a message that says, “This feed is currently receiving high traffic and is temporarily unavailable. Please try again later. Message from server: Rate Limit Exceeded.”

Image Credits:screenshot of Bluesky

Popular feeds like Discover or the official Bluesky Team’s feed often see this problem, even as users’ own personal feeds are functional.

Other times, like when trying to visit a user’s profile, the site will display an error message, forcing you to refresh and try again.

Image Credits:screenshot of Bluesky

Bluesky protocol engineer Bryan Newbold remarked around 3:46 a.m. ET on Wednesday, “oof, our services are getting hit pretty hard tonight.”

Notably, the service disruptions are impacting Bluesky, but other communities, like Blacksky, that run their own infrastructure on the underlying protocol that powers the decentralized social network, are still functioning.

Blacksky’s team told TechCrunch that the Bluesky outage has led to a “significant spike” in migration requests from Bluesky users over the past 12 hours, as usersdevs, and other ATmosphere founders like Sebastian at Eurosky have been promoting its services. 

ScreenshotImage Credits:screenshot of Bluesky

It was clear that Bluesky’s team was in a hectic state this week while facing these issues, as one message on its status page had a typo: ” investigating an incident with service in one of our reginos [sic].”

Image Credits:screenshot of Bluesky

source

Continue Reading