Close Menu
Alan C. Moore
    What's Hot

    Here’s Why Trump Had No Choice But to Step In and Stop the Riots

    June 8, 2025

    Who’s Paying for the Violent L.A. Protests Against ICE? You Are. And It Gets Worse From There

    June 8, 2025

    Trump orders National Guard to Los Angeles amid fiery ICE protests

    June 8, 2025
    Facebook X (Twitter) Instagram
    Trending
    • Here’s Why Trump Had No Choice But to Step In and Stop the Riots
    • Who’s Paying for the Violent L.A. Protests Against ICE? You Are. And It Gets Worse From There
    • Trump orders National Guard to Los Angeles amid fiery ICE protests
    • National Guard troops arrive in Los Angeles after days of immigration protests
    • Usha Vance hosts first Camp VPR for children at vice president’s residence
    • Feud on pause? Aides of Elon Musk and Donald Trump hold talks; but ‘relationship remains uncertain’: Report
    • Unrest in LA: Trump sends in troops to control immigration protests
    • Journalist under fire: ABC’s Terry Moran targets ‘hater’ Donald Trump, Stephen Miller; Karoline Leavitt demands action for ‘rampage’
    Alan C. MooreAlan C. Moore
    Subscribe
    Sunday, June 8
    • Home
    • US News
    • Politics
    • Business & Economy
    • Video
    • About Alan
    • Newsletter Sign-up
    Alan C. Moore
    Home » Blog » Pioneers of Reinforcement Learning Win the Turing Award

    Pioneers of Reinforcement Learning Win the Turing Award

    March 5, 2025Updated:March 5, 2025 Tech No Comments
    Share
    Facebook Twitter LinkedIn Pinterest Email
    Andrew Barto and Rich Sutton were viewed as eccentric supporters of an elegant but eventually lost concept in the 1980s: having models learn from experience, as humans and animals do.

    The method they developed is now extremely important to contemporary artificial intelligence, and initiatives like ChatGPT, Barto, and Sutton have received the Turing Award, the highest distinction in the field of computer technology.

    A strategy known as validation learning, which involves coaxing a machine to perform duties through research combined with either positive or negative feedback, was pioneered by Barto, an emeritus professor at the University of Massachusetts Amherst, and Sutton, a professor at the University of Alberta.

    When Barto began this work for me, it was incredibly unfashionable, he smiles as he speaks over Zoom from his Massachusetts house. It’s amazing that it has gained some media and influence, Barto continues.

    Google DeepMind probably made the most of reinforcement learning when it created AlphaGo, a program that autonomously learned to play the very complicated and delicate board game of Go on a high level. The approach, which has since been applied in advertising, maximizing data-center energy use, finance, and chip design, was sparked by this demonstration, which gained fresh interest from viewers. The method also has a long history in automation, where it can assist machines in learning to do actual tasks through problem.

    More recently, reinforcement learning has been important in guiding the creation of extraordinarily strong chatbot programs and guiding the output of large language models ( LLMs). Similar techniques are being employed to train AI designs to imitate human reasoning and to create stronger AI providers.

    Sutton points out, however, that the techniques used to link LLMs involve people providing objectives rather than an engine learning solely through its own inquiry. He claims that having devices learn completely on their own may be more successful in the long run. The major difference is whether AI learns from other individuals or whether it is actually learning from its own experiences, he says.

    In a statement released by the Association for Computing Machinery (ACM), which distributes the Turing Award, Jeff Dean, a senior vice president at Google, stated that” Barto and Sutton’s work has been a centerpiece of development in AI over the past few years.” They have made significant advances, reiterating that the equipment they developed are still a wall of the AI increase.

    Conditioning has a turbulent and long history in AI. In his popular 1950 paper” Computing Machinery and Intelligence,” which examines the possibility that a system might one day think like a human, Alan Turing made the suggestion that equipment could learn through experience and opinions. In 1955, Arthur Samuel, a forerunner in artificial intelligence, created one of the first machine learning programs that could play chess.

    Despite initial achievement, but, efforts to create artificial neural networks and related function fell out of favor and were for ages overshadowed by efforts to create AI using images and reasonable rules rather than learning from the ground up.

    Barto, Sutton, and some remained as a result, drawing ideas from research in biology and psychology, including research conducted by Edward Thorndike in the early 1990s that demonstrated how stimulation affect pet behavior. In developing algorithms that resemble this kind of learning, they also used insights from science and control theory.

    The ACM, which awards the Turing Prize annually, president Yannis Ioannidis, said in today’s speech,” Barto and Sutton’s job is not a stepping stone that we have now moved on from.”

    The ACM award recognizes contributions made by Barto and Sutton that helped to make reinforcement learning useful, such as policy-gradient methods, a fundamental method for an engine to learn how to behave, and historical difference learning, which enables a model to learn continuously. The method” continues to expand and has tremendous potential for more advancements in computing and many other fields,” Ioannidis said.

    The development of conditioning learning has also been important in moral debates about how AI systems might behave unknowingly. According to Barto, it was obvious from the beginning that systems may exhibit strange or undesirable behavior, such as consistently crashing a robot by focusing on the wrong things.

    Barto claims that several of his former students are nowadays professors who are interested in learning about these risks. However, he claims that the view is crucially important because of the potential of AI and encouragement studying for developing technological solutions to climate change and other significant issues. It can be very helpful, he says, “if used with prudence.”

    Source credit

    Keep Reading

    New OpenAI Sora & Google Veo Competitor Focuses on Storytelling With Its Text-to-Video Tool

    Trump/Musk Feud: Possible Impact on AI Regulation, Budget Bill, Government Contracts

    Mistral’s New AI Tool Offers ‘Best-in-Class Coding Models’ to Enterprise Developers

    Mistral’s New AI Tool Offers ‘Best-in-Class Coding Models’ to Enterprise Developers

    Mistral’s New AI Tool Offers ‘Best-in-Class Coding Models’ to Enterprise Developers

    Mistral’s New AI Tool Offers ‘Best-in-Class Coding Models’ to Enterprise Developers

    Editors Picks

    Here’s Why Trump Had No Choice But to Step In and Stop the Riots

    June 8, 2025

    Who’s Paying for the Violent L.A. Protests Against ICE? You Are. And It Gets Worse From There

    June 8, 2025

    Trump orders National Guard to Los Angeles amid fiery ICE protests

    June 8, 2025

    National Guard troops arrive in Los Angeles after days of immigration protests

    June 8, 2025

    Usha Vance hosts first Camp VPR for children at vice president’s residence

    June 8, 2025

    Feud on pause? Aides of Elon Musk and Donald Trump hold talks; but ‘relationship remains uncertain’: Report

    June 8, 2025

    Unrest in LA: Trump sends in troops to control immigration protests

    June 8, 2025

    Journalist under fire: ABC’s Terry Moran targets ‘hater’ Donald Trump, Stephen Miller; Karoline Leavitt demands action for ‘rampage’

    June 8, 2025

    Sunday Thoughts: Alternate Endings

    June 8, 2025

    Newsom Blames Trump for L.A. Insurrection

    June 8, 2025
    • Home
    • US News
    • Politics
    • Business & Economy
    • About Alan
    • Contact

    Sign up for the Conservative Insider Newsletter.

    Get the latest conservative news from alancmoore.com [aweber listid="5891409" formid="902172699" formtype="webform"]
    Facebook X (Twitter) YouTube Instagram TikTok
    © 2025 alancmoore.com
    • Privacy Policy
    • Terms
    • Accessibility

    Type above and press Enter to search. Press Esc to cancel.