Elon Musk’s AI firm, xAI, launched its newest flagship AI mannequin, Grok 3, late Monday evening, together with new capabilities within the Grok apps for iOS and the net.
Grok, xAI’s reply to fashions like OpenAI’s GPT-4o and Google’s Gemini, can analyze photographs and reply to questions, and powers quite a lot of options on Musk’s social community, X. Grok 3, which has been in growth for a number of months, was optimistically slated for launch in 2024, however missed that deadline.
Monday’s is an formidable launch.
xAI has been utilizing an unlimited information middle in Memphis — an information middle containing round 200,000 GPUs — to coach Grok 3. In a submit on X, Musk claimed that Grok 3 was developed with “10x” extra computing than Grok 2, its predecessor, and with an expanded coaching information set that ostensibly consists of filings from court docket instances.

“Grok 3 is an order of magnitude extra succesful than Grok 2,” Musk stated throughout a live-streamed presentation Monday. “[It’s a] maximally truth-seeking AI, even when that fact is typically at odds with what’s politically right.”
Grok 3 is a household of fashions, to be exact — not only one. A smaller model of Grok 3, Grok 3 mini, responds to questions extra rapidly at the price of some accuracy. Not all fashions and associated options can be found as of but (and a few are in beta), however the rollout begins on Monday.
xAI claims that Grok 3 beats GPT-4o on benchmarks together with AIME, which evaluates a mannequin’s efficiency on a sampling of math questions, and GPQA, which assesses fashions utilizing PhD-level physics, biology, and chemistry issues. An early model of Grok 3 additionally scored competitively in Chatbot Enviornment, a crowdsourced take a look at that pits completely different AI fashions in opposition to one another and has customers vote on their most popular responses, in keeping with xAI.

Two variations of Grok 3, Grok 3 Reasoning and Grok 3 mini Reasoning, can fastidiously “assume by way of” issues, just like “reasoning” fashions like OpenAI’s o3-mini and Chinese language AI firm DeepSeek’s R1. Reasoning fashions completely fact-check themselves earlier than giving out outcomes, which helps them keep away from among the pitfalls that usually journey up fashions.
xAI claims that Grok 3 Reasoning surpasses the perfect model of o3-mini — o3-mini-high — on a number of well-liked benchmarks, together with a more moderen arithmetic benchmark known as AIME 2025.

The reasoning fashions might be accessed through the Grok app. Customers can ask Grok 3 to “Suppose,” or — for tougher queries — leverage “Large Mind” mode for reasoning that employs further computing. xAI describes the reasoning fashions as finest fitted to mathematics-, science-, and programming-related questions.
Musk stated that, within the Grok app, among the reasoning fashions’ “ideas” are obscured to forestall distillation, a way utilized by AI mannequin builders to extract information from one other mannequin. Just lately, DeepSeek was accused of distilling OpenAI’s fashions to create its personal.
Grok’s reasoning fashions underpin a brand new function within the Grok app known as DeepSearch, xAI’s reply to AI-powered “deep analysis” instruments like OpenAI’s deep analysis. DeepSearch scans the web and X to research data and ship an summary in response to a query.
Subscribers to X’s Premium+ tier ($22 per 30 days) will get Grok 3 first, and different options are gated behind a brand new plan that xAI’s calling SuperGrok. Priced at $30 per 30 days or $300 per 12 months (if leaks are to be believed), SuperGrok unlocks further reasoning and DeepSearch queries, and throws in limitless picture technology.

Sooner or later — as quickly as a few week from now — the Grok app will achieve a “voice mode,” Musk stated, which is able to give Grok fashions a synthesized voice. A number of weeks after that, Grok 3 fashions will arrive in xAI’s enterprise API, together with the DeepSearch functionality.
xAI plans to open-source Grok 2 within the coming months, stated Musk.
“Our basic method is that we are going to open-source the final model [of Grok] when the following model is totally out,” he continued. “When Grok 3 is mature and secure, which might be inside just a few months, then we’ll open-source Grok 2.”
When Musk introduced Grok roughly two years in the past, he pitched the AI as edgy, unfiltered, and anti-“woke” — typically, keen to reply controversial questions different AI techniques received’t. He delivered on a few of that promise. Instructed to be vulgar, for instance, Grok and Grok 2 would fortunately oblige, spewing colourful language you probably wouldn’t hear from ChatGPT.
However Grok fashions previous to Grok 3 hedged on political topics and wouldn’t cross sure boundaries. In reality, one research discovered that Grok leaned to the political left on subjects like transgender rights, range applications, and inequality.
Musk has blamed the conduct on Grok’s coaching information — public net pages — and pledged to “shift Grok nearer to politically impartial.” It’s not but clear whether or not xAI achieved that objective — and what the implications may be.