feat(datafabric): standalone ontology tool grounded on OWL + R2RML by sankalp-uipath · Pull Request #911 · UiPath/uipath-langchain-python

sankalp-uipath · 2026-06-16T12:50:47Z

What

Adds a standalone Data Fabric ontology tool for low-code agents. The agent selects ontologies (not entities); the tool derives the entities it may query from the ontology's R2RML, resolves their schemas, and runs the existing inner SQL sub-graph grounded on both the OWL (semantic schema) and the R2RML (ontology→table/column mapping).

An ontology context (contextType: "datafabricontology") now becomes a tool on its own — no entitySet required in agent.json.

How it works (first invocation, cached)

Fetch the ontology's R2RML (critical — it is the entity allow-list) and OWL (grounds the prompt) via EntitiesService.get_ontology_file_async.
Parse R2RML → the closed (entity_name, folder_path) allow-list, reading rr:tableName + a uipath:folderPath annotation per rr:TriplesMap.
Resolve each entry: folderPath → folder_key (folders.retrieve_key_async, cached per path) then name → Entity schema (entities.retrieve_by_name_async); build a folder-scoped EntitiesService via the public folders_map constructor param (no SDK change).
Build a dedicated ontology system prompt (OWL + R2RML + entity schemas) and compile the shared DataFabricGraph — the inner agent still has a single tool, execute_sql.

Everything from the sub-graph down (execute_sql → query_entity_records_async) is reused unchanged from the entity tool.

Key modules

New

ontology_r2rml.py — dependency-free parser: parse_r2rml_entities() → the (entity_name, folder_path) allow-list (block-based, R2RMLParseError on contract violations).
datafabric_ontology_tool.py — resolve_ontology_entities() (the resolver), DataFabricOntologyQueryHandler, create_datafabric_ontology_tool().
datafabric_ontology_prompt_builder.py — the ontology tool's inner prompt (OWL + R2RML + entity schemas), fully self-contained so the entity-tool builder is left untouched by this feature.

Changed

ontology_fetcher.py — generalized to fetch_ontology_file (raw content + media type; raises) + fence_ontology_block.
datafabric_subgraph.py — DataFabricGraph is now prompt-agnostic (takes a pre-built system_prompt).
datafabric_tool.py / context_tool.py — removed all ontology grounding from the entity-tool path; the ontology context now builds the standalone tool (flag-gated).
datafabric_prompt_builder.py — not modified by this PR (kept byte-identical to main); it remains the entity tool's builder only.

Design decisions

Closed allow-list (Option A): the entities the tool can touch are exactly those declared in the ontology's R2RML — the LLM can never widen scope. execute_sql's blast radius = the resolved set.
uipath:folderPath (a folder path, not a GUID): R2RML has no folder concept and is deployment-agnostic; rr:tableName stays a valid SQL table name. Folder identity is resolved through the trusted folder service, never from mapping content. (Authoring contract documented for the ontology-authoring skill.)
No SDK change for the core feature — all new code is in uipath-langchain-python over existing public uipath-platform methods.

Feature flag

Gated by DataFabricOntologyEnabled (default off), a single shared constant.
Two guards (defense in depth): the tool-factory entry (context_tool — off ⇒ no tool created, feature fully inert) and the handler's lazy init (re-checks before any OWL/R2RML fetch/parse/resolve). Off ⇒ the agent runs exactly as before; the entity tool is flag-independent.

Security

Entity/folder scope is bounded by the R2RML allow-list; folder keys come only from trusted folder-path resolution (not the LLM).
execute_sql's single-statement sqlparse guard is unchanged.
OWL/R2RML are injected as prompt context; a hostile ontology can only reference entities it itself declares (Option A), which still pass folder-scoped resolution.

Testing

Unit tests for the parser, resolver (folder-key caching, folders_map), the ontology prompt builder, the factory, and the flag guard.
Live e2e on alpha (datafabric/ab, gpt-5.4, california-schools ontology): flag prefetched on → R2RML + OWL fetched → folderPath resolved → 3 entities resolved by name → OWL+R2RML-grounded SQL (join on the R2RML FK) executed folder-scoped → correct answer. Confirmed retrieve_by_name (/metadata) returns populated field schemas.

Notes / dependencies

Depends on SDK #1728 (ontology models: ontologySet on the context, get_ontology_file_async). Pinned to uipath>=2.12.5, <2.13.0 and uipath-platform>=0.1.91, <0.2.0 — the SDK versions that carry the ontology binding. uipath-platform 0.1.91 is not yet on PyPI (only testpypi dev builds), so CI dependency resolution stays red until #1728 merges and publishes; no .dev pin is committed.
Runtime flag delivery also requires DataFabricOntologyEnabled in uipath-agents-python's _ALL_FLAGS prefetch + the gitops flag deployed for the target tenants (both already in place from the prior work).
Authoring dependency: ontologies must be published with a uipath:folderPath per rr:TriplesMap (authoring-skill guidelines provided) — otherwise resolution fails loudly by design.
Supersedes the earlier "inject ontology into the entity tool" approach and folds in the R2RML grounding work.

Copilot

Pull request overview

Adds an optional fetch_ontology inner tool to the Data Fabric SQL sub-agent so the inner LLM can retrieve a configured ontology’s OWL schema from the QueryEngine REST API and use it to generate semantically-correct SQL.

Changes:

Introduces an ontology REST client (fetch_ontology_owl) with name validation and size limiting.
Adds a fetch_ontology leaf tool with an instance-level cache and wires it into the inner Data Fabric subgraph alongside execute_sql.
Threads ontology_name / folder_key into the Data Fabric tool construction path (with an env-var fallback).

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`src/uipath_langchain/agent/tools/datafabric_tool/ontology_fetch_tool.py`	New leaf tool (`fetch_ontology`) and cached fetcher wrapper for inner SQL agent use.
`src/uipath_langchain/agent/tools/datafabric_tool/ontology_client.py`	New client helper to fetch OWL content via `EntitiesService.request_async`, including name validation and payload cap.
`src/uipath_langchain/agent/tools/datafabric_tool/models.py`	Adds an intentionally-empty args schema (`OntologyFetchInput`) for the new tool.
`src/uipath_langchain/agent/tools/datafabric_tool/datafabric_tool.py`	Plumbs `ontology_name` / `folder_key` into the query handler creation (currently with env-var fallback).
`src/uipath_langchain/agent/tools/datafabric_tool/datafabric_subgraph.py`	Adds optional `fetch_ontology` tool binding and dispatch-by-tool-name inside the inner subgraph.

…lder)

…logy_file (drop local client)

…ogy-fetch-tool

…y fallback

…ctory mapping

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated 4 comments.

…age.status to match host node

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated 4 comments.

…ather)

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated 6 comments.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 9 out of 10 changed files in this pull request and generated 2 comments.

UIPath-Harshit · 2026-06-26T06:14:22Z

+        # Inner toolset: always execute_sql; optionally an LLM-decided
+        # fetch_ontology tool when one or more ontologies are configured.
+        inner_tools: list[BaseTool] = [self._execute_sql_tool]
+        if ontologies:


EnabledNewLlmClients <- check for the feature flag impl of this to ensure out feature is behind the feature flag.

yes, will add it.

UIPath-Harshit · 2026-06-26T06:16:50Z

+        # fetch_ontology tool when one or more ontologies are configured.
+        inner_tools: list[BaseTool] = [self._execute_sql_tool]
+        if ontologies:
+            inner_tools.append(


This doesnt update the subgraph ? correct?

UIPath-Harshit · 2026-06-26T06:20:38Z

+    entity set) as ``ontologySet`` items. Each carries its own ``folderId``, so
+    it is fetched from its own folder.
+    """
+    items = getattr(resource, "ontology_set", None) or []


Same as other PR. ontology_set?

…d flag

andreitava-uip · 2026-06-30T08:53:33Z

+def test_fetch_ontology_bound_only_when_ontologies(make_graph):
+    without = make_graph(None)
+    assert "execute_sql" in without._tools_by_name
+    assert "fetch_ontology" not in without._tools_by_name
+
+    with_onto = make_graph([("library", None)])
+    assert "fetch_ontology" in with_onto._tools_by_name


nit: splitting this test into two(should bind when present/should not bind when absent) is trivial and allows instantly knowing what failed from the test name alone without checking the assertion message.

andreitava-uip · 2026-06-30T09:28:35Z

+    # An ontology context is not a standalone tool — it only grounds the Data
+    # Fabric entity tool, which gathers it via resolve_context_ontologies.
+    if resource.context_type == AgentContextType.DATA_FABRIC_ONTOLOGY:
+        return None


If it is not a standalone tool at runtime, I think it is confusing to model it as a top level resource at design time. So far, all "resource nodes" in a lowcode agent (either standalone or part of flow), are independently executable and show up in traces. This is now a different paradigm, it is an optional helper tool that will be part of another tool's subgraph.

That being said this only applies to how it's modeled today. If we indeed plan to expand ontology support in the future such that they will actually allow queries (via something like SPARQL statements for instance); then it will be better for future proofing to define them top level (at least in the package mapping). We can figure out a less confusing design time experience for now

Yes, we have plan to expand ontology support to make it a primary design experience i.e, user will select the ontologies and then it will resolve the entities internally, thus decision of making it top level resource as a part of iterative development.

andreitava-uip · 2026-06-30T09:35:10Z

+        lines.append("## Available Ontology (authoritative semantic schema)")
+        lines.append("")
+        lines.append(
+            f"This agent has a semantic ontology attached for these entities: "
+            f"{names}. It is the authoritative source for the exact column names, "
+            "value formats (date formats, codes, zero-padding), allowed values, "
+            "and the relationships between entities — richer and more reliable "
+            "than the field list below, which omits value formats and semantics."
+        )
+        lines.append("")
+        lines.append(
+            "**Before writing any SQL, call the `fetch_ontology` tool once** to "
+            "load it, then base your column names, filter values, and joins on "
+            "what it says. The entity tables below are a quick reference only; "
+            "the ontology is the source of truth when they disagree."
+        )
+        lines.append("")
+


nit: could be cleaner to have this as a single formatted string depending on names instead of individually applying each line like this.

Applicable to the existing sql_expert_system_prompt as well, but that one wasn't introduced by this PR

Fixed, please review. Also there are some changes linked to your other comment in the data fabric prompt builder (adding ontology text in the prompt).

andreitava-uip · 2026-06-30T10:34:09Z

+        # When short-circuiting to END, return ONLY the terminal-success
+        # ToolMessages so the outer agent's result is the query rows — not a
+        # co-issued fetch_ontology's OWL. On a non-terminal turn keep all messages
+        # so the inner LLM can use them on its next pass.


isn't concurrent execution of a ontology retrieval + data service query an anomaly? It doesn't seem to be correct. Why not mechanically enforce ontology retrieval and injecting it in the context. When is it useful for the llm to choose not to fetch the ontology?

Agreed, earlier we were doing so as to support future use cases where llm will query the ontology (for ex using SPARQL), instead of giving the complete ontology to agent.
But I agree with you right now it makes more sense to mechanically injecting it in the system prompt.
I have made the changes please review again.

…onstant

…ogy-fetch-tool

andreitava-uip · 2026-06-30T11:06:16Z

+    ontologies: list[tuple[str, str | None]] = []
+    for resource in resources:
+        if (
+            isinstance(resource, AgentContextResourceConfig)
+            and resource.is_datafabric_ontology
+        ):
+            for item in resource.ontology_set or []:
+                ontologies.append((item.name, item.folder_key))
+    return ontologies


if I understand correctly, we implicitly assume all topologies will apply to this data service entity context. Shouldn't the link be more explicitly defined? IE either:
a) when defining an Data Service Context resource you can also specify one or more ontologies
b) when defining the Ontology Context resource you specify the list of entities it describes

I am currently working on adding the R2RML mapping which will resolve the entities from ontologies at the agent runtime by the llm node implicitly (I am working on it in separate PR and is currently in progress ).

…ring ontology prompt)

…ontology tool

…ame)" This reverts commit eebdfc2.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…builder self-contained

…ogy-fetch-tool # Conflicts: # pyproject.toml # tests/agent/tools/test_datafabric_prompt_builder.py

Copilot

Pull request overview

Copilot reviewed 15 out of 15 changed files in this pull request and generated 3 comments.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

sonarqubecloud · 2026-07-03T07:29:29Z

Quality Gate failed

Failed conditions
0.0% Coverage on New Code (required ≥ 90%)

See analysis details on SonarQube Cloud

feat(datafabric): add fetch_ontology tool to DF inner SQL agent

c6e73eb

Copilot AI review requested due to automatic review settings June 16, 2026 12:50

Merge branch 'main' into feat/datafabric-ontology-fetch-tool

b67e170

Copilot started reviewing on behalf of sankalp-uipath June 16, 2026 12:51 View session

Copilot AI reviewed Jun 16, 2026

View reviewed changes

feat(datafabric): resolve ontology from agent.json binding (name + fo…

da19087

…lder)

milind-jain-uipath reviewed Jun 17, 2026

View reviewed changes

Comment thread src/uipath_langchain/agent/tools/datafabric_tool/ontology_client.py Outdated

milind-jain-uipath requested changes Jun 17, 2026

View reviewed changes

sankalp-uipath added 5 commits June 17, 2026 15:23

refactor(datafabric): fetch ontology via SDK EntitiesService.get_onto…

4c22b8f

…logy_file (drop local client)

feat(datafabric): support multiple ontologies per context (ontologySet)

68f7cbf

Merge remote-tracking branch 'origin/main' into feat/datafabric-ontol…

ab77d65

…ogy-fetch-tool

fix(datafabric): end loop on any successful SQL; drop env-var ontolog…

40acdec

…y fallback

test(datafabric): cover ontology fetch tool, subgraph routing, and fa…

7a5bb69

…ctory mapping

sankalp-uipath requested a review from Copilot June 22, 2026 07:26

Copilot started reviewing on behalf of sankalp-uipath June 22, 2026 07:26 View session

Copilot AI reviewed Jun 22, 2026

View reviewed changes

fix(datafabric): return only terminal tool msgs on END; drop ToolMess…

04f79c5

…age.status to match host node

sankalp-uipath requested a review from Copilot June 22, 2026 08:05

Copilot started reviewing on behalf of sankalp-uipath June 22, 2026 08:06 View session

Copilot AI reviewed Jun 22, 2026

View reviewed changes

perf(datafabric): fetch configured ontologies concurrently (asyncio.g…

0ed6210

…ather)

sankalp-uipath requested a review from Copilot June 22, 2026 19:58

Copilot started reviewing on behalf of sankalp-uipath June 22, 2026 19:59 View session

Copilot AI reviewed Jun 22, 2026

View reviewed changes

sankalp-uipath requested a review from milind-jain-uipath June 23, 2026 07:39

sankalp-uipath and others added 5 commits June 23, 2026 18:19

feat(datafabric): resolve ontologies via ontology_refs

e9c4cfb

Merge branch 'main' into feat/datafabric-ontology-fetch-tool

be5ef26

chore: consume uipath dev build (#1728) to unblock CI

1fd7a30

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

chore: revert temp dev-build pin; fix datafabric test mypy

a871a0a

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Merge branch 'main' into feat/datafabric-ontology-fetch-tool

dfdd3d6

Copilot AI reviewed Jun 23, 2026

View reviewed changes

Comment thread src/uipath_langchain/agent/tools/datafabric_tool/datafabric_subgraph.py

Comment thread src/uipath_langchain/agent/tools/datafabric_tool/ontology_fetch_tool.py Outdated

Merge branch 'main' into feat/datafabric-ontology-fetch-tool

a07adb9

sankalp-uipath mentioned this pull request Jun 24, 2026

feat(datafabric): fetch ontology R2RML alongside OWL #935

Open

sankalp-uipath added 2 commits June 25, 2026 13:43

refactor(datafabric): resolve ontologies from nested ontologySet

54db78f

refactor(datafabric): gather ontologies from datafabricontology context

941f3ff

UIPath-Harshit reviewed Jun 26, 2026

View reviewed changes

feat(datafabric): gate fetch_ontology behind DataFabricOntologyEnable…

86e5912

…d flag

sankalp-uipath force-pushed the feat/datafabric-ontology-fetch-tool branch from 8b04daa to 86e5912 Compare June 29, 2026 20:13

andreitava-uip reviewed Jun 30, 2026

View reviewed changes

sankalp-uipath added 3 commits June 30, 2026 16:06

test(datafabric): drop ontology referenceKey fixture

826f036

refactor(datafabric): gate ontology flag at every entry; share flag c…

e57d1b0

…onstant

Merge remote-tracking branch 'origin/main' into feat/datafabric-ontol…

2f41f40

…ogy-fetch-tool

andreitava-uip reviewed Jun 30, 2026

View reviewed changes

refactor(datafabric): address review nits (split bind test, single-st…

7fab6d5

…ring ontology prompt)

sankalp-uipath force-pushed the feat/datafabric-ontology-fetch-tool branch from fbb0bea to 9a4a187 Compare July 1, 2026 09:01

refactor(datafabric): inject ontology into system prompt, drop fetch_…

a35807b

…ontology tool

sankalp-uipath force-pushed the feat/datafabric-ontology-fetch-tool branch from 9a4a187 to a35807b Compare July 1, 2026 10:04

sankalp-uipath changed the title ~~feat(datafabric): add fetch_ontology tool to DF inner SQL agent~~ feat(datafabric): inject ontology schema into inner SQL agent system prompt Jul 1, 2026

sankalp-uipath changed the title ~~feat(datafabric): inject ontology schema into inner SQL agent system prompt~~ feat(datafabric): standalone ontology tool grounded on OWL + R2RML Jul 2, 2026

sankalp-uipath and others added 6 commits July 3, 2026 02:15

feat(datafabric): standalone ontology tool (R2RML-driven, flag-gated)

4edb26a

refactor(datafabric): call get_ontology_bundle_async (SDK rename)

eebdfc2

Revert "refactor(datafabric): call get_ontology_bundle_async (SDK ren…

0f99ac0

…ame)" This reverts commit eebdfc2.

build(datafabric): require uipath>=2.12.5 / uipath-platform>=0.1.91

ee0532c

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

refactor(datafabric): keep entity prompt builder untouched; ontology …

72dc9cb

…builder self-contained

Merge remote-tracking branch 'origin/main' into feat/datafabric-ontol…

28d284d

…ogy-fetch-tool # Conflicts: # pyproject.toml # tests/agent/tools/test_datafabric_prompt_builder.py

sankalp-uipath requested a review from Copilot July 3, 2026 07:07

Copilot started reviewing on behalf of sankalp-uipath July 3, 2026 07:07 View session

Copilot AI reviewed Jul 3, 2026

View reviewed changes

Comment thread src/uipath_langchain/agent/tools/datafabric_tool/ontology_fetcher.py Outdated

Comment thread src/uipath_langchain/agent/tools/datafabric_tool/datafabric_ontology_prompt_builder.py

Comment thread pyproject.toml

refactor(datafabric): drop unused logger from ontology_fetcher

89cb44e

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Uh oh!

Conversation

sankalp-uipath commented Jun 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

How it works (first invocation, cached)

Key modules

Design decisions

Feature flag

Security

Testing

Notes / dependencies

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sankalp-uipath Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

sankalp-uipath commented Jun 16, 2026 •

edited

Loading

sankalp-uipath Jun 29, 2026 •

edited

Loading

sankalp-uipath Jul 1, 2026 •

edited

Loading

sankalp-uipath Jul 1, 2026 •

edited

Loading