Close Menu
Alan C. Moore
    What's Hot

    My Friend Sol, the Supreme Court, and Defining Discrimination

    June 6, 2025

    Double Murder: Man Kills Pregnant Wife With Forced Abortion in India

    June 6, 2025

    SECDEF Hegseth orders Navy to rename ship named after gay rights activist: Report

    June 6, 2025
    Facebook X (Twitter) Instagram
    Trending
    • My Friend Sol, the Supreme Court, and Defining Discrimination
    • Double Murder: Man Kills Pregnant Wife With Forced Abortion in India
    • SECDEF Hegseth orders Navy to rename ship named after gay rights activist: Report
    • Iran orders ballistic missile materials from China for hundreds of missiles: Report
    • US power grid may be at risk from Chinese solar power inverters, fmr. NSA official warns: Report 
    • Santa Teresa to get secondary border wall
    • American tourists being ‘kidnapped’ through dating apps in Mexico, US Embassy warns 
    • Soldier killed pregnant wife with machete, dismembered her in HI, officials say
    Alan C. MooreAlan C. Moore
    Subscribe
    Friday, June 6
    • Home
    • US News
    • Politics
    • Business & Economy
    • Video
    • About Alan
    • Newsletter Sign-up
    Alan C. Moore
    Home » Blog » OpenAI’s New AI Models o3 and o4-mini Can Now ‘Think With Images’

    OpenAI’s New AI Models o3 and o4-mini Can Now ‘Think With Images’

    April 18, 2025Updated:April 18, 2025 Tech No Comments
    ew openai api ai agent webp
    ew openai api ai agent webp
    Share
    Facebook Twitter LinkedIn Pinterest Email
    OpenAI’s CEO Sam Altman. Image: Creative Commons

    OpenAI has rolled out two new AI models, o3 and o4‑mini, that can literally “think with images,” marking a big step forward in how machines understand pictures. These models, announced in an OpenAI press release, can reason about images the same way they do about text — cropping, zooming, and rotating photos as part of their internal thought process.

    At the heart of this update is the ability to blend visual and verbal reasoning.

    “OpenAI o3 and o4‑mini represent a significant breakthrough in visual perception by reasoning with images in their chain of thought,” the company said in its press release. Unlike past versions, these models don’t rely on separate vision systems — instead, they natively mix image tools and text tools for richer, more accurate answers.

    How does ‘thinking with images’ work?

    The models can crop, zoom, rotate, or flip an image as part of their thinking process, just like humans would. They’re not just recognizing what’s in a photo but working with it to draw conclusions.

    The company notes that “ChatGPT’s enhanced visual intelligence helps you solve tougher problems by analyzing images more thoroughly, accurately, and reliably than ever before.”

    This means if you upload a photo of a handwritten math problem, a blurry sign, or a complicated chart, the model can not only understand it, but also break it down step by step — possibly even better than before.

    More must-read AI coverage

    Outperforms previous models in key benchmarks

    These new abilities aren’t just impressive in theory; OpenAI says both models outperform their predecessors regarding top academic and AI benchmarks.

    “Our models set new state-of-the-art performance in STEM question-answering (MMMU, MathVista), chart reading and reasoning (CharXiv), perception primitives (VLMs are Blind), and visual search (V*),” the company noted in a statement. “On V*, our visual reasoning approach achieves 95.7% accuracy, largely solving the benchmark.”

    But the models aren’t perfect. OpenAI admits the models can sometimes overthink, leading to prolonged and unnecessary image manipulations. There are also cases where the AI might misinterpret what it sees, despite correctly using tools to analyze the image. The company also warned of reliability issues when trying the same task multiple times.

    Who can use OpenAI o3 and o4-mini?

    As of April 16, both o3 and o4-mini are available to ChatGPT Plus, Pro, and Team users; they replace older models like o1 and o3-mini. Enterprise and education users will get access next week, and free users can try o4-mini through a new “Think” feature.

    Source credit

    Keep Reading

    Trump/Musk Feud: Possible Impact on AI Regulation, Budget Bill, Government Contracts

    Mistral’s New AI Tool Offers ‘Best-in-Class Coding Models’ to Enterprise Developers

    Mistral’s New AI Tool Offers ‘Best-in-Class Coding Models’ to Enterprise Developers

    Mistral’s New AI Tool Offers ‘Best-in-Class Coding Models’ to Enterprise Developers

    Mistral’s New AI Tool Offers ‘Best-in-Class Coding Models’ to Enterprise Developers

    ChatGPT Business Features Now Include Gmail/Outlook Connectors, Meeting Transcriptions, New Pricing

    Editors Picks

    My Friend Sol, the Supreme Court, and Defining Discrimination

    June 6, 2025

    Double Murder: Man Kills Pregnant Wife With Forced Abortion in India

    June 6, 2025

    SECDEF Hegseth orders Navy to rename ship named after gay rights activist: Report

    June 6, 2025

    Iran orders ballistic missile materials from China for hundreds of missiles: Report

    June 6, 2025

    US power grid may be at risk from Chinese solar power inverters, fmr. NSA official warns: Report 

    June 6, 2025

    Santa Teresa to get secondary border wall

    June 6, 2025

    American tourists being ‘kidnapped’ through dating apps in Mexico, US Embassy warns 

    June 6, 2025

    Soldier killed pregnant wife with machete, dismembered her in HI, officials say

    June 6, 2025

    Musk walks back threat to decommission SpaceX Dragon spacecraft

    June 6, 2025

    ‘I know my lane’: Kash Patel says he would stay out of ‘Trump-Elon thing’ on Epstein Files

    June 6, 2025
    • Home
    • US News
    • Politics
    • Business & Economy
    • About Alan
    • Contact

    Sign up for the Conservative Insider Newsletter.

    Get the latest conservative news from alancmoore.com [aweber listid="5891409" formid="902172699" formtype="webform"]
    Facebook X (Twitter) YouTube Instagram TikTok
    © 2025 alancmoore.com
    • Privacy Policy
    • Terms
    • Accessibility

    Type above and press Enter to search. Press Esc to cancel.