Close Menu
Alan C. Moore
    What's Hot

    Nearly 90 universities have rebranded their DEI offices, College Fix survey finds

    June 13, 2025

    The Morning Briefing: Israel Surgically Delivers Multiple Packages of ‘FO’ to Iran

    June 13, 2025

    Anti-ICE protests: Appeal court blocks earlier ruling; allows Trump to command California Guard

    June 13, 2025
    Facebook X (Twitter) Instagram
    Trending
    • Nearly 90 universities have rebranded their DEI offices, College Fix survey finds
    • The Morning Briefing: Israel Surgically Delivers Multiple Packages of ‘FO’ to Iran
    • Anti-ICE protests: Appeal court blocks earlier ruling; allows Trump to command California Guard
    • Tensions soar: Why did Israel strike Iran? What makes the timing crucial – Explained
    • Operation Rising Lion: Did Donald Trump know in advance of Israel’s strikes on Iran? What he said
    • Operation ‘Rising Lion’: How world leaders reacted to Israel’s strike on Iran; Netanyahu vows to continue operation
    • ‘America was invaded’: Donald Trump pushes for ‘remigration’; slams Biden, Newsom
    • Sen. Alex Padilla forcibly removed from Noem’s LA presser: Trump officials call it ‘childish’; Democrats condemn assault on free speech
    Alan C. MooreAlan C. Moore
    Subscribe
    Friday, June 13
    • Home
    • US News
    • Politics
    • Business & Economy
    • Video
    • About Alan
    • Newsletter Sign-up
    Alan C. Moore
    Home » Blog » Retrieval Augmented Generation: What It Is and Why It Matters for Enterprise AI

    Retrieval Augmented Generation: What It Is and Why It Matters for Enterprise AI

    November 21, 2024Updated:November 21, 2024 Tech No Comments
    tr datastax cto rag ai hallucinations jpg
    tr datastax cto rag ai hallucinations jpg
    Share
    Facebook Twitter LinkedIn Pinterest Email

    IT managers and businesses looking to implement generative AI have discovered that research-augmented generation (RAG ) is a necessity. Enterprises can ground an LLM in enterprise data by using a large language model ( LLM) and RAG, which increases output accuracy.

    But how does RAG job? What are some Fabric apply situations? And are there any actual choices?

    Davor Bonaci, the CEO and executive vice president of collection and AI company DataStax, spoke with TechRepublic to learn more about how relational AI is being leveraged in the market as it becomes available in 2024 and what he anticipates the technology will look like in 2025.

    What is Retrieval Augmented Generation?

    RAG adds extended or augmented perspective from an business to enhance the importance and accuracy of relational AI LLM model outputs. It enables venture use cases for relational AI LLMs.

    Bonaci explained that while LLMs have “basically been trained on all the data available on the internet”, up to a particular cut-off time, depending on the model, their vocabulary and basic awareness strengths are offset by major and well-known problems, such as AI hallucinations.

    Notice: Zetaris on why AI will be powered by united data lakes in the future.

    You may ground it in business data if you want to use it in an organization setting. Then, you get a lot of delusions”, he said. ” With RAG, instead of just asking the LLM to make things, you say,’ I want you to create something, but kindly consider these things that I know to be correct.'”

    How does RAG function in an organization building?

    RAG gives an LLM allusion to an organization data set, such as a information center, a database, or a file set. For example, DataStax’s primary product is its vectors database, Astra DB, which enterprises are using to help the creating of AI programs in enterprises.

    A user’s keyword suggestions would typically move through a matrix research, which locates the most pertinent paperwork or pieces of information from a predetermined knowledge base. This could include enterprise documents, academic papers, or FAQs.

    The retrieved information is then fed into the generative model as additional context alongside the original query, allowing the model to ground its response in real-world, up-to-date, or domain-specific knowledge. This grounding lowers the chance of hallucinations that could be bad for an organization.

    AI resources from TechRepublic Premium

    How much does RAG make generative AI models ‘ output better?

    “night and day” refers to the difference between using generative AI with and without RAG, according to Bonaci. For an enterprise, the propensity for an LLM to hallucinate essentially means they are “unusable” or only for very limited use cases. The RAG approach opens the door for generative AI in businesses.

    ” At the end of the day, they]LLMs] have knowledge from seeing things on the internet”, Bonaci explained. They will, however, give you a very confident response that may be completely incorrect if you ask a question that is kind of outside the left field.

    SEE: Generative AI has become a source of costly mistakes for enterprises

    According to Bonaci, RAG techniques can increase the accuracy of non-reasoning tasks ‘ LLM outputs to over 90 %, depending on the models and benchmarks used. For complex reasoning tasks, they are more likely to deliver between 70-80 % accuracy using RAG techniques.

    What are some RAG use cases?

    RAG is used across several typical generative AI use cases for organisations, including:

    Automation

    Enterprises can automate repeatable tasks by combining LLMs with RAG. A common use case for automation is customer support, where the system can be empowered to search documentation, provide answers, and take actions like canceling a ticket or making a purchase.

    Personalisation

    RAG can be used to synthesize and summarize a lot of data. Bonaci gave the example of customer reviews, which can be summarised in a personalised way that is relevant to the user’s context, such as their location, past purchases, or travel preferences.

    Search

    RAG can be applied to improve search results in an enterprise, making them more relevant and context-specific. Even if the search terms do n’t exactly match the available content, Bonaci noted how RAG assists streaming service users in finding movies or content that is relevant to their interests or location.

    What applications do knowledge graphs have for RAG?

    Using knowledge graphs with RAG is an “advanced version” of basic RAG. While a vector search in a basic RAG finds similarities between a vector database and is appropriate for general knowledge and natural language, Bonaci explained that it has limitations for specific enterprise use cases.

    A customer inquiry, such as whether international roaming is included, would be required by the AI in a situation where a mobile phone provider offers multiple-tiered plans with varying inclusions. A knowledge graph can aid in arranging the information to identify what applies.

    SEE: Digital maturity key to success in AI for cybersecurity

    The conflict between the plan documents ‘ content and one another, according to Bonaci. ” So the system does n’t know which one is true. Therefore, you could use a knowledge graph to aid in conflict resolution and proper organization.

    Are there any alternatives to RAG for enterprises?

    A generative AI model can be fine tuned, which is the main alternative to RAG. Instead of acting as a prompt, enterprise data is fine tuned to create an influenced data set that can be used to prepare the model for use.

    According to Bonaci, RAG has been the most popular method to date for advancing generative AI as a business priority.

    ” We do see people fine-tuning models, but it just solves a small niche of problems, and so it has not been widely accepted as a solution”, he said.

    Source credit

    Keep Reading

    The Meta AI App Lets You ‘Discover’ People’s Bizarrely Personal Chats

    NVIDIA Expands AI Dominance in Europe with Major Partnerships and Infrastructure Deals

    Unpacking AI Agents

    Gartner: This GenAI Apps Development Strategy Could Cut Delivery Time by 50%

    Gartner: This GenAI Apps Development Strategy Could Cut Delivery Time by 50%

    OpenAI Releases o3-pro, an Upgrade to Its ‘Most Intelligent Model’

    Editors Picks

    Nearly 90 universities have rebranded their DEI offices, College Fix survey finds

    June 13, 2025

    The Morning Briefing: Israel Surgically Delivers Multiple Packages of ‘FO’ to Iran

    June 13, 2025

    Anti-ICE protests: Appeal court blocks earlier ruling; allows Trump to command California Guard

    June 13, 2025

    Tensions soar: Why did Israel strike Iran? What makes the timing crucial – Explained

    June 13, 2025

    Operation Rising Lion: Did Donald Trump know in advance of Israel’s strikes on Iran? What he said

    June 13, 2025

    Operation ‘Rising Lion’: How world leaders reacted to Israel’s strike on Iran; Netanyahu vows to continue operation

    June 13, 2025

    ‘America was invaded’: Donald Trump pushes for ‘remigration’; slams Biden, Newsom

    June 13, 2025

    Sen. Alex Padilla forcibly removed from Noem’s LA presser: Trump officials call it ‘childish’; Democrats condemn assault on free speech

    June 13, 2025

    Operation Rising Lion: Before strike, Trump called for diplomacy; hours later, Israel hit Iran

    June 13, 2025

    Israel’s Operation Rising Lion: Iran warns ‘sworn enemies’ should expect retaliation; says US ‘responsible for consequences’

    June 13, 2025
    • Home
    • US News
    • Politics
    • Business & Economy
    • About Alan
    • Contact

    Sign up for the Conservative Insider Newsletter.

    Get the latest conservative news from alancmoore.com [aweber listid="5891409" formid="902172699" formtype="webform"]
    Facebook X (Twitter) YouTube Instagram TikTok
    © 2025 alancmoore.com
    • Privacy Policy
    • Terms
    • Accessibility

    Type above and press Enter to search. Press Esc to cancel.