Breaking news! OpenAI released 2 new inference models today: o3-mini and o3-mini-high.

o3-mini and o3-mini (high) will be released today.

Regular users will also get o3-mini, and plus users will be able to use o3-mini (high).

o3-mini (high) is about 200 points higher than o1 on Codeforce, faster than o1, and performs better in coding and mathematics, but the cost is still at the level of o1-mini.

Plus users can use o3-mini 100 times a day. However, the usage limit of o3-mini (high) needs to be further confirmed.

Some netizens said, yes, R1 is so popular that Openai can’t hold back:

And earlier, the Alibaba Qwen team released qwen2.5-max on New Year’s Eve. During the Spring Festival, everyone still needs to roll, hahaha…

In fact, as early as the Christmas live broadcast, Openai announced that o3 mini will be available in early 2025:

Table of Contents

We still need to talk about what o3 and o3-mini are?

o3: A cutting-edge inference model that excels in coding, mathematics, and even AGI-oriented benchmark tests. It sets a new benchmark for intelligence and problem solving.

o3-mini: A cost-effective version of o3 that provides superior performance at a very low cost and speed.

These models have taken inference to a whole new level, making breakthroughs in complex tasks possible that require in-depth understanding and logic.

o3 brings three major breakthroughs.

Programming ability: 71.7% accuracy in practical programming, 20% higher than o1. 2727 points on Codeforces, already surpassing human level.

Math level: nearly 97% accuracy in the US Mathematics Olympiad qualifying round. Even the most difficult Epic AI frontier math problems can get 25% results.

The most amazing thing is the Arc AGI test: 87.5%, surpassing humans for the first time on this extremely difficult benchmark test.

Why is the o3-mini a disruptive innovation? The o3-mini brings two changes.

Adaptive thinking: the depth of reasoning can be adjusted according to the difficulty of the task, with three modes to choose from: low, medium, and high.

This makes the AI more closely aligned with real-world usage scenarios.

Cost-effectiveness breakthrough: lower cost than the o1-mini, faster response, and better results.

However, netizens lament that o3 high consumes $1,000 per task:

In addition, there are indeed too many models available, and we have yet to confirm how to switch between them.

Uncategorized

Large Language Model management artifacts such as DeepSeek: Cherry Studio, Chatbox, AnythingLLM, who is your efficiency accelerator?

Byzddeepseeker February 11, 2025February 11, 2025

Many people have already started to deploy and use Deepseek Large Language Models locally, using Chatbox as a visualization tool This article will continue to introduce two other AI Large Language Model management and visualization artifacts, and will compare the three in detail to help you use AI Large Language Models more efficiently. In 2025,…

Uncategorized

The world’s mainstream AI products focus on analysis and comprehensive user experience guidelines (including DeepSeek and GPT)

Byzddeepseeker February 10, 2025February 10, 2025

Function positioning and core advantage analysis ChatGPT (OpenAI) – the global benchmark for all-rounders ChatGPT Technical genes: generative AI based on the GPT series of large models, with general conversational skills and logical reasoning as its core advantages. Multilingual processing: performs best in English, with continuous improvement in Chinese;but we recommen to use English to…

Uncategorized

How was DeepSeek created? A analysis of DeepSeek’s growth history

Byzddeepseeker February 3, 2025February 3, 2025

In the future, there will be more and more hardcore innovation. It may not be easy to understand now, because the entire social group needs to be educated by facts. When this society allows people who innovate hardcore to succeed, the collective mindset will change. We just need a bunch of facts and a process….

Uncategorized

DeepSeek-R1 technology revealed: core principles of the paper are broken down and the key to breakthrough model performance is revealed

Byzddeepseeker February 9, 2025February 9, 2025

Today we will share DeepSeek R1, Title: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning: Incentivizing the reasoning capability of LLM via reinforcement learning. This paper introduces DeepSeek’s first generation of reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. The DeepSeek-R1-Zero model was trained through large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as an initial step,…

Uncategorized

Qwen2.5-max vs DeepSeek R1: A deep comparison of models: a full analysis of application scenarios

Byzddeepseeker February 14, 2025February 14, 2025

Introduction Today, large language models (LLMs) play a crucial role. In early 2025, as the competition for AI intensified, Alibaba launched the new Qwen2.5-max AI model, and DeepSeek, a company from Hangzhou, China, launched the R1 model, which represents the pinnacle of LLM technology. Deepseek R1 is an open source AI model that has attracted…

Uncategorized

DeepSeek-R1-0528 Update: Deeper Thinking, Stronger Reasoning

Byzddeepseeker May 29, 2025May 29, 2025

The DeepSeek R1 model has undergone a minor version upgrade, with the current version being DeepSeek-R1-0528. When you enter the DeepSeek webpage or app, enable the “Deep Thinking” feature in the dialogue interface to experience the latest version. The DeepSeek-R1-0528 model weights have been uploaded to HuggingFace Over the past four months, DeepSeek-R1 has undergone…

We still need to talk about what o3 and o3-mini are?

o3 brings three major breakthroughs.

Why is the o3-mini a disruptive innovation? The o3-mini brings two changes.

Similar Posts

Leave a Reply Cancel reply