OpenAI just got a major upgrade with world-changing potential — here's how it works

OpenAI logo on a phone screen in front of a blurred image of Sam Altman
(Image credit: Shutterstock)

On Day 2 of “12 Days of OpenAI,” we were gifted the launch of reinforcement fine-tuning and the chance to see a live demo of ChatGPT Pro. Although Sam Altman was not present, his team walked us through a fascinating preview of what could be a significant advancement in model customization.

For those unable to join the live briefing or who want to take a deeper dive into what reinforcement fine-tuning means, here’s a quick rundown. Reinforcement Fine-Tuning (RFT) is a groundbreaking approach that could empower developers and machine learning engineers to create AI models tailored for complex, domain-specific tasks. In other words, there is unlimited potential for breakthroughs in science, medical, financial, and legal discoveries.

Unlike traditional supervised fine-tuning, which focuses on training models to replicate desired outputs, RFT optimizes a model’s reasoning capabilities through lessons and rewards. This advancement represents a significant leap in AI customization, enabling models to excel in specialized fields.

For the rest of us non-scientists, this news means scientific advancements in medicine and other industries may be closer than we think, with AI assisting in ways beyond human comprehension. At least, that's OpenAI's goal.

How RFT works

For the first time, reinforcement learning techniques previously reserved for OpenAI’s cutting-edge models like GPT-4o and the o1-series are available to external developers. This democratization of advanced AI training methods paves the way for highly specialized AI solutions.

Developers and organizations can now create expert-level models without requiring extensive reinforcement learning expertise. RFT’s focus on reasoning and problem-solving could prove particularly relevant in fields demanding precision and expertise.

Applications range from advancing scientific discoveries to streamlining complex legal workflows that could mark a paradigm shift in applying AI to real-world challenges.

12 Days of OpenAI is far from over

One of RFT’s standout features is its developer-friendly interface. Users only need to supply a dataset and grader, while OpenAI handles the reinforcement learning and training processes. This simplicity lowers the barrier to entry, allowing a broader range of developers and organizations to harness RFT’s power.

Yesterday’s o1 preview and today’s look at reinforcement fine-tuning have been fascinating. We’ve only just begun the countdown, and there’s still so much more to come from Altman and his team.

The event pauses over the weekend, but join us next week for even more exciting news. Will we get more from OpenAI’s Canvas? Will there be a projects-type upgrade that allows groups to use ChatGPT together? Stay tuned!

More from Tom's Guide

Category
Arrow
Arrow
Back to MacBook Air
Brand
Arrow
Processor
Arrow
RAM
Arrow
Storage Size
Arrow
Screen Size
Arrow
Colour
Arrow
Storage Type
Arrow
Condition
Arrow
Price
Arrow
Any Price
Showing 10 of 99 deals
Filters
Arrow
Show more
TOPICS
Amanda Caswell
AI Writer
Read more
ChatGPT logo on MacBook Pro M4
OpenAI now integrates with Notes, Quip, and Notion — here’s what’s new
OpenAI logo
OpenAI ChatGPT-4.5 is here and it's the most human-like chatbot yet — here's how to try it
ChatGPT logo on a smartphone screen being held outside
ChatGPT just got OpenAI's most powerful upgrade yet — meet 'Deep Research'
OpenAI logo on a phone screen in front of a blurred image of Sam Altman
OpenAI CEO Sam Altman just shared a massive update on what's next for ChatGPT
ChatGPT Vision feature
OpenAI just released Advanced Voice with Vision — now ChatGPT can see and hear you
12 Days of OpenAI
ChatGPT Projects announced — and it's one of the most important AI releases this year
Latest in ChatGPT
ChatGPT app on iPhone
I just tested ChatGPT-4.5 with 5 prompts — the good, the bad and the weird
ChatGPT app icon on mobile device
ChatGPT 4.5 — 5 big upgrades you need to know
OpenAI logo
OpenAI ChatGPT-4.5 is here and it's the most human-like chatbot yet — here's how to try it
ChatGPT app icon on mobile device
ChatGPT Plus just got a huge deep research upgrade — here's how to try it now
A person logging into LinkedIn on their phone and laptop
Looking for a job? — 7 prompts to use ChatGPT o3-mini as a job search assistant
OpenAI logo on phone sitting on top of laptop keyboard
OpenAI’s ‘o3-mini’ is free for all users — what you need to know
Latest in News
iOS 19 logo on an iPhone
iOS 19 — all the rumors so far
NYTimes Connections
NYT Connections today hints and answers — Tuesday, March 11 (#639)
An image of a CAPTCHA
Hackers are using reCAPTCHA to trick users into infecting their own PCs with malware — how to stay safe
Gmail logo on iPhone
Gmail just got a huge AI upgrade that will save you a ton of time
Xbox handheld
Xbox handheld reportedly arriving this year, new PC-like console in 2027
Concept image of foldable iPad
Apple reportedly has an 18.8-inch foldable iPad prototype with under-display Face ID