
” We are excited to launch a strong fresh open-weight language design with logic in the coming weeks”, Altman wrote on X.
Altman said in the post that the company has been thinking about releasing an empty weight model for some time, adding” then it feels important to do”.
The shift is partly a reaction to the fugitive achievement of the R1 design from Chinese firm DeepSeek, as well as the recognition of Meta’s Llama models.
OpenAI may even feel the need to demonstrate that it can teach the new design more expensively, since DeepSeek’s model was apparently trained at a fraction of the cost of most big AI models.
” This is remarkable information”, Clement Delangue, director and CEO of HuggingFace, a company that specializes in hosting open AI types, told WIRED. ” With DeepSeek one’s realizing the power of receptive lifts”.
Got a Tip? |
---|
Are you a current or former employee who wants to speak about what’s happening? We’d prefer to hear from you. Using a nonwork phone or computer, email the reporter safely on Signal at wak. 01. |
OpenAI now makes its AI accessible through a robot and through the sky. R1, Llama and other empty fat designs can be downloaded for free and modified. A woman’s weights refers to the ideals inside a huge neurological network—something that is set during education. Start fat versions are cheaper to use and can even be tailored for delicate use situations, like handling very private information.
Steven Heidel, a member of the technical staff at OpenAI, reposted Altman’s announcement and added” we’re releasing a model this year that you can run on your own hardware”.
OpenAI today also posted a webpage inviting developers to apply for early access to the forthcoming model. Altman said in his post that the company would host events for developers with early prototypes of the new model in the coming weeks.
Meta was the first major AI company to pursue a more open approach, releasing the first version of Llama in July 2023. A growing number of open weight AI models are now available. Some researchers note that Llama and some other models are not as transparent as they could be because the training data and other details are still kept secret. Meta also imposes a license that limits other companies ‘ ability to profit from applications and tools built using Llama.
Update 3/31/25 4: 21 EST: This article was updated with a comment from Clement Delangue, cofounder and CEO of HuggingFace.