Trusq

factual analysis · traceable to primary sources

Explainer

RAG and enterprise AI: governing proprietary generative AI

Adopted 2026-06-22 ยท ≈ 2 min read ยท Dirk Baaijen

RAG connects a generative model to your own sources so answers come from company documents. That lowers fabrication but shifts the risk to access, confidentiality and provenance. Governance turns on source scoping, authorisation, logging and human oversight.

Short answer: Retrieval-Augmented Generation (RAG) lets a generative AI model answer from your own company documents rather than only from its training knowledge. That lowers fabrication and keeps sensitive knowledge inside your own sources, but it shifts the risk to access control, confidentiality and the provenance of answers. Good governance turns on source scoping, authorisation, logging and human oversight.

What RAG changes about the risk profile

In RAG the system retrieves relevant passages from a proprietary knowledge source and passes them to the model as context. The model therefore does not "know" your documents โ€” it is handed them per query. The benefit: answers are traceable to a source, and the model need not be trained on your data, which reduces the risk of eroding trade secrets. But quality and safety now depend on which documents are retrievable and who may see them.

The biggest pitfall: authorisation

A RAG system that makes all company documents searchable without filtering can expose sensitive information to employees who normally have no access to it. The retrieval layer must respect existing access rights: a user must not see more through the chatbot than through the source system. This is the difference between a handy assistant and an uncontrolled data leak.

Confidentiality and the choice of model

Whether you use a hosted model or your own environment determines where your context data ends up. Choose an offering where retrieved passages are not used for training and not retained. If the source contains personal data, GDPR requirements apply: legal basis, processor agreement and clarity on the processing location.

Provenance, hallucination and human oversight

RAG reduces fabrication but does not eliminate it: the model can mis-summarise retrieved passages or ignore context. Always show the source alongside the answer so a user can verify. For decisions with legal effect or high risk, human oversight remains required โ€” a RAG assistant is a tool, not a decision-maker.

Anchoring governance

Treat an internal RAG system like any other AI application: it belongs in your AI governance framework, with an owner, a risk assessment and logging. If you use a general model as the base, the GPAI regime may apply to the provider, while you as a user have your own obligations.

What to do

  • Mirror access rights: the retrieval layer must never expose more than the source system allows.
  • Choose a model/environment in which context data is not used for training or retained.
  • Show sources with every answer and keep human oversight for weighty decisions.
  • Curate the knowledge source: outdated or wrong documents lead to wrong answers.
  • Log use and outcomes and give the system an owner within your governance.

Sources

  1. https://eur-lex.europa.eu/eli/reg/2024/1689/oj
    Regulation (EU) 2024/1689 (AI Act); transparency, human oversight and GPAI obligations that also touch internal AI applications.
  2. https://eur-lex.europa.eu/eli/reg/2016/679/oj
    Regulation (EU) 2016/679 (GDPR); lawful processing and access control where personal data sits in internal sources.

Share on LinkedIn

Read next

A

Singapore: the Model AI Governance Framework and AI Verify

Singapore regulates AI not with a binding law but with voluntary instruments: the Model AI Governance Framework (with a separate version for generative AI) and the AI Verify testing toolkit. The aim is trust through testable practice rather than legal obligations up front.

U

AI, trade secrets and confidentiality

Feeding confidential information into an external AI model can undermine trade-secret status and breach confidentiality or GDPR obligations. Protection depends on secrecy measures; uncontrolled sharing erodes them. Manage it with policy, contract and access rules.

W

An AI use policy for employees: generative AI at work

Employees already use generative AI โ€” often without rules. A use policy bounds the risks: leakage of confidential or personal data, unreliable output, IP questions and transparency. The AI literacy duty (Art. 4) also makes such a policy part of compliance.

Dirk Baaijen

About this knowledge base

Compiled and maintained by YRproject โ€” programme and project direction at the intersection of digital transformation, AI and regulation. Every factual claim is traceable to its primary source. YRproject is led by Dirk Baaijen About & method โ†’

A project or programme? Work with YRproject โ†’

The monthly briefing

AI regulation in five minutes: what changed, what is coming and what it means. No spam, unsubscribe anytime.

Your address is used for this only and stored on our own servers.