Product Docs
  • What is Dataworkz?
  • Getting Started
    • What You Will Need (Prerequisites)
    • Create with Default Settings: RAG Quickstart
    • Custom Settings: RAG Quickstart
    • Data Transformation Quickstart
    • Create an Agent: Quickstart
  • Concepts
    • RAG Applications
      • Overview
      • Ingestion
      • Embedding Models
      • Vectorization
      • Retrieve
    • AI Agents
      • Introduction
      • Overview
      • Tools
        • Implementation
      • Type
      • Tools Repository
      • Tool Execution Framework
      • Agents
      • Scenarios
      • Agent Builder
    • Data Studio
      • No-code Transformations
      • Datasets
      • Dataflows
        • Single Dataflows:
        • Composite dataflows:
        • Benefits of Dataflows:
      • Discovery
        • How to: Discovery
      • Lineage
        • Features of Lineage:
        • Viewing a dataset's lineage:
      • Catalog
      • Monitoring
      • Statistics
  • Guides
    • RAG Applications
      • Configure LLM's
        • AWS Bedrock
      • Embedding Models
        • Privately Hosted Embedding Models
        • Amazon Bedrock Hosted Embedding Model
        • OpenAI Embedding Model
      • Connecting Your Data
        • Finding Your Data Storage: Collections
      • Unstructured Data Ingestion
        • Ingesting Unstructured Data
        • Unstructured File Ingestion
        • Html/Sharepoint Ingestion
      • Create Vector Embeddings
        • How to Build the Vector embeddings from Scratch
        • How do Modify Existing Chunking/Embedding Dataflows
      • Response History
      • Creating RAG Experiments with Dataworkz
      • Advanced RAG - Access Control for your data corpus
    • AI Agents
      • Concepts
      • Tools
        • Dataset
        • AI App
        • Rest API
        • LLM Tool
        • Relational DB
        • MongoDB
        • Snowflake
      • Agent Builder
      • Agents
      • Guidelines
    • Data Studio
      • Transformation Functions
        • Column Transformations
          • String Operations
            • Format Operations
            • String Calculation Operations
            • Remove Stop Words Operation
            • Fuzzy Match Operation
            • Masking Operations
            • 1-way Hash Operation
            • Copy Operation
            • Unnest Operation
            • Convert Operation
            • Vlookup Operation
          • Numeric Operations
            • Tiles Operation
            • Numeric Calculation Operations
            • Custom Calculation Operation
            • Numeric Encode Operation
            • Mask Operation
            • 1-way Hash Operation
            • Copy Operation
            • Convert Operation
            • VLookup Operation
          • Boolean Operations
            • Mask Operation
            • 1-way Hash Operation
            • Copy Operation
          • Date Operations
            • Date Format Operations
            • Date Calculation Operations
            • Mask Operation
            • 1-way Hash Operation
            • Copy Operation
            • Encode Operation
            • Convert Operation
          • Datetime/Timestamp Operations
            • Datetime Format Operations
            • Datetime Calculation Operations
            • Mask Operation
            • 1-way Hash Operation
            • Copy Operation
            • Encode Operation
            • Page 1
        • Dataset Transformations
          • Utility Functions
            • Area Under the Curve
            • Page Rank Utility Function
            • Transpose Utility Function
            • Semantic Search Template Utility Function
            • New Header Utility Function
            • Transform to JSON Utility Function
            • Text Utility Function
            • UI Utility Function
          • Window Functions
          • Case Statement
            • Editor Query
            • UI Query
          • Filter
            • Editor Query
            • UI Query
      • Data Prep
        • Joins
          • Configuring a Join
        • Union
          • Configuring a Union
      • Working with CSV files
      • Job Monitoring
    • Utility Features
      • IP safelist
      • Connect to data source(s)
        • Cloud Data Platforms
          • AWS S3
          • BigQuery
          • Google Cloud Storage
          • Azure
          • Snowflake
          • Redshift
          • Databricks
        • Databases
          • MySQL
          • Microsoft SQL Server
          • Oracle
          • MariaDB
          • Postgres
          • DB2
          • MongoDB
          • Couchbase
          • Aerospike
          • Pinecone
        • SaaS Applications
          • Google Ads
          • Google Analytics
          • Marketo
          • Zoom
          • JIRA
          • Salesforce
          • Zendesk
          • Hubspot
          • Outreach
          • Fullstory
          • Pendo
          • Box
          • Google Sheets
          • Slack
          • OneDrive / Sharepoint
          • ServiceNow
          • Stripe
      • Authentication
      • User Management
    • How To
      • Data Lake to Salesforce
      • Embed RAG into your App
  • API
    • Generate API Key in Dataworkz
    • RAG Apps API
    • Agents API
  • Open Source License Types
Powered by GitBook
On this page
  • Introduction
  • Pre-requisites
  • Concepts and Terminology
  • Building a RAG Application
  1. Guides
  2. How To

Embed RAG into your App

How to embed Gen AI into an existing application using the Dataworkz RAG API

PreviousData Lake to SalesforceNextAPI

Last updated 10 months ago

Introduction

Dataworkz simplifies RAG app development for businesses. Dataworkz offers RAG as a Service so that you can build GenAI applications using proprietary data, utilizing either public LLM APIs or privately hosted open source foundation models.

With Dataworkz's RAG builder, enterprises gain the ability to construct their AI stack — vector database, embedding model, and LLM — and connect with various sources of business data including SaaS, databases and more.

Dataworkz RAG Builder offers a REST API to integrate QnA capabilities into modern web applications. Using this API you can AI-enable your applications and allow users to ask free-form natural language questions of your data and receive high-quality, human-readable response along with the authoritative sources the response originated from. Read on to learn how to AI-enable your applications.

If you would like to see a sample application that you can try, Dataworkz has published a here that demonstrates API integration.

Pre-requisites

  • A Dataworkz RAG QnA system has already been configured in your account. Use .

  • You will need the host name of the Dataworkz RAG service that you are targeting (e.g. https://genai.dataworkz.com)

  • An for the target Dataworkz system.

  • Make sure all Angular CLI dependencies are up to date on your machine

Concepts and Terminology

QnA System - multiple QnA systems can be configured in a single Dataworkz account that provides separation between different systems. Each system can have its own set of documents and data transformation, parameters and configuration. This can be used to power different applications, isolate data from different departments, versions, systems, etc. When you make a query you select the QnA system you want to target.

LLMProvider - Dataworkz RAG Builder makes it easy to configure and work with multiple Large Language Models (LLM) including those from OpenAI, Google Vertex AI and privately hosted Open Source LLMs.

Questions - Users can ask questions from the QnA system. Dataworkz will leverage the power of RAG to generate answers from relevant pieces of content from the data corpus associated with a QnA system.

Monitoring - Dataworkz RAG Builder supports monitoring the performance of the RAG system, provides scores, validates LLM responses to prevent hallucinations, and allows users to provide feedback to questions.

Building a RAG Application

Dataworkz RAG APIs that you can use to build your own AI powered applications are listed below -

Authentication

For all APIs, an API Key must be passed in the Authorizationheader of the request. The value of the header should have the form SSWS <api-token>.

Listing QnA Systems

Getting a list of LLMs

Asking a question

Users can ask free-form natural language questions for the selected QnA system powered using Dataworkz RAG. Dataworkz RAG will retrieve appropriate data and augment the LLM prompt to get an easy-to-consume response back to show to the user. In addition, all sources used by the LLM to formulate its response are also returned which allows your application to list authoritative sources.

Getting previously asked questions

To summarize, Dataworkz RAG builder abstracts the operational complexity of building a GenAI app and provides an API for developers to embed the GenAI apps into existing workflows/apps.

All QnA systems in an account can be listed using the . Depending on your use-case, you can decide to show the systems to the user or pick the appropriate one for subsequent operations using the systemId returned by this API.

All can be retrieved. Questions can be asked from a specific LLM or a user-selected LLM or even from multiple LLMs in order to compare performance and select a more performant (or one with a better cost-performance ratio) LLM as per your use-case. Each LLM instance is identified by its id returned by this API.

Use this capabilities. You will need the systemId of the QnA System you are targeting as well the llmProviderId of the LLM you want to target and the question text.

You can retrieve previously asked questions, their answers and scores associated with the LLM responses using .

sample app
these steps to build a QnA system
API Key
systems listing API
LLMs configured for a system
API for Question and Answer
this API