Why RAG is the most accessible path to commercial AI

The barrier to entry for building practical generative AI is lower than you think, but you have to follow the right steps.
November 1, 2024

Since the public release of ChatGPT in late 2022, enterprises have been captivated by the promise of generative AI, yet many struggle to bridge the gap between concept and execution. To bridge this gap, organizations must first solve the challenge of integrating and managing their data, then optimize the way they interact with foundation models.

For instance, PGA Tour’s digital chief experienced a significant error when ChatGPT incorrectly stated how many times Tiger Woods had won the Tour. In response, the PGA now employs RAG to enhance foundation models like GPT-4 with accurate, context-rich data from sources such as internal documentation.

The benefits of RAG

For most enterprises, RAG offers a practical balance between innovation and feasibility. Other options, such as fine-tuning or building custom models, require massive investments of data, expertise and other resources. By contrast, RAG requires a modest volume of proprietary data and minor additions to a data stack.

However, to successfully implement RAG, organizations must solve two key challenges:

  • Ensuring data is reliably moved into a platform accessible by the foundation model.
  • Maximizing RAG’s capabilities to meet business needs.

Data integration: The foundation of RAG

To power RAG, data must be moved from a variety of sources – databases, applications, documents and more – into formats that foundation models can access, such as vector databases and knowledge graphs. This is where automated, fully managed data integration becomes crucial.

Fivetran’s recommended RAG architecture involves two stages: 

  1. Extracting and loading both structured and unstructured data to a data lake as a staging ground. 
  2. Transforming and embedding the relevant data into a vector database and/or knowledge graph. 

By using automated data pipelines and separating the data pipeline into these steps, a data team has a reliable, flexible and modular flow of data.

Maximizing RAG for immediate impact

Once data integration is in place, the next step is maximizing the value of RAG with a few critical strategies:

  1. Prompt engineering: Learning how to frame questions for the model can greatly improve accuracy. High specificity and clear context are crucial for getting reliable results.
  2. Data curation: Carefully selecting and refining the data used for RAG ensures the model provides relevant, accurate context. This iterative process prevents the model from producing false or "hallucinated" information.
  3. Knowledge graphs: While vector databases are commonly used with RAG, they produce results probabilistically, increasing the risk of errors. Knowledge graphs provide a more deterministic approach, encoding factual relationships to ensure accuracy, though they may require more computational resources. 

Generative AI is still evolving

Generative AI is an extremely dynamic field, with new models and architectures appearing by the month.  In the future, we may see a sudden profusion of custom foundation models for specific industries and professions, vastly improving the performance of commercial RAG deployments. New architectural developments could vastly simplify the flow of data between source systems and AI models, including live querying of production data. Public datasets could become so comprehensive, and computation so cheap, that any organization could spin up its own custom model for a modest cost. 

Here and now, however, organizations can best commercialize AI by focusing on data integration and managing RAG effectively. 

[CTA_MODULE]

Start for free

Join the thousands of companies using Fivetran to centralize and transform their data.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Data insights
Data insights

Why RAG is the most accessible path to commercial AI

Why RAG is the most accessible path to commercial AI

November 1, 2024
November 1, 2024
Why RAG is the most accessible path to commercial AI
The barrier to entry for building practical generative AI is lower than you think, but you have to follow the right steps.

Since the public release of ChatGPT in late 2022, enterprises have been captivated by the promise of generative AI, yet many struggle to bridge the gap between concept and execution. To bridge this gap, organizations must first solve the challenge of integrating and managing their data, then optimize the way they interact with foundation models.

For instance, PGA Tour’s digital chief experienced a significant error when ChatGPT incorrectly stated how many times Tiger Woods had won the Tour. In response, the PGA now employs RAG to enhance foundation models like GPT-4 with accurate, context-rich data from sources such as internal documentation.

The benefits of RAG

For most enterprises, RAG offers a practical balance between innovation and feasibility. Other options, such as fine-tuning or building custom models, require massive investments of data, expertise and other resources. By contrast, RAG requires a modest volume of proprietary data and minor additions to a data stack.

However, to successfully implement RAG, organizations must solve two key challenges:

  • Ensuring data is reliably moved into a platform accessible by the foundation model.
  • Maximizing RAG’s capabilities to meet business needs.

Data integration: The foundation of RAG

To power RAG, data must be moved from a variety of sources – databases, applications, documents and more – into formats that foundation models can access, such as vector databases and knowledge graphs. This is where automated, fully managed data integration becomes crucial.

Fivetran’s recommended RAG architecture involves two stages: 

  1. Extracting and loading both structured and unstructured data to a data lake as a staging ground. 
  2. Transforming and embedding the relevant data into a vector database and/or knowledge graph. 

By using automated data pipelines and separating the data pipeline into these steps, a data team has a reliable, flexible and modular flow of data.

Maximizing RAG for immediate impact

Once data integration is in place, the next step is maximizing the value of RAG with a few critical strategies:

  1. Prompt engineering: Learning how to frame questions for the model can greatly improve accuracy. High specificity and clear context are crucial for getting reliable results.
  2. Data curation: Carefully selecting and refining the data used for RAG ensures the model provides relevant, accurate context. This iterative process prevents the model from producing false or "hallucinated" information.
  3. Knowledge graphs: While vector databases are commonly used with RAG, they produce results probabilistically, increasing the risk of errors. Knowledge graphs provide a more deterministic approach, encoding factual relationships to ensure accuracy, though they may require more computational resources. 

Generative AI is still evolving

Generative AI is an extremely dynamic field, with new models and architectures appearing by the month.  In the future, we may see a sudden profusion of custom foundation models for specific industries and professions, vastly improving the performance of commercial RAG deployments. New architectural developments could vastly simplify the flow of data between source systems and AI models, including live querying of production data. Public datasets could become so comprehensive, and computation so cheap, that any organization could spin up its own custom model for a modest cost. 

Here and now, however, organizations can best commercialize AI by focusing on data integration and managing RAG effectively. 

[CTA_MODULE]

Want to learn more? We have a primer on generative AI for you.
Download the ebook
Topics
Share

Related blog posts

How generative AI will change the nature of work
Data insights

How generative AI will change the nature of work

Read post
Assembling a RAG architecture using Fivetran
Product

Assembling a RAG architecture using Fivetran

Read post
Build your own RAG-based GenAI application in 30 minutes
Product

Build your own RAG-based GenAI application in 30 minutes

Read post
Lyra Health leads mental health innovation with data lakes & AI
Blog

Lyra Health leads mental health innovation with data lakes & AI

Read post
FivetranChat: A homebrewed generative AI story
Blog

FivetranChat: A homebrewed generative AI story

Read post
How CIOs can drive AI success with a strong data foundation
Blog

How CIOs can drive AI success with a strong data foundation

Read post
Unlock AI-powered search with Fivetran and Milvus
Blog

Unlock AI-powered search with Fivetran and Milvus

Read post
The case for using structured and semi-structured data in generative AI
Blog

The case for using structured and semi-structured data in generative AI

Read post
How to use Fivetran and Snowflake to move data and innovate
Blog

How to use Fivetran and Snowflake to move data and innovate

Read post

Start for free

Join the thousands of companies using Fivetran to centralize and transform their data.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.