Product Docs
  • What is Dataworkz?
  • Getting Started
    • What You Will Need (Prerequisites)
    • Create with Default Settings: RAG Quickstart
    • Custom Settings: RAG Quickstart
    • Data Transformation Quickstart
    • Create an Agent: Quickstart
  • Concepts
    • RAG Applications
      • Overview
      • Ingestion
      • Embedding Models
      • Vectorization
      • Retrieve
    • AI Agents
      • Introduction
      • Overview
      • Tools
        • Implementation
      • Type
      • Tools Repository
      • Tool Execution Framework
      • Agents
      • Scenarios
      • Agent Builder
    • Data Studio
      • No-code Transformations
      • Datasets
      • Dataflows
        • Single Dataflows:
        • Composite dataflows:
        • Benefits of Dataflows:
      • Discovery
        • How to: Discovery
      • Lineage
        • Features of Lineage:
        • Viewing a dataset's lineage:
      • Catalog
      • Monitoring
      • Statistics
  • Guides
    • RAG Applications
      • Configure LLM's
        • AWS Bedrock
      • Embedding Models
        • Privately Hosted Embedding Models
        • Amazon Bedrock Hosted Embedding Model
        • OpenAI Embedding Model
      • Connecting Your Data
        • Finding Your Data Storage: Collections
      • Unstructured Data Ingestion
        • Ingesting Unstructured Data
        • Unstructured File Ingestion
        • Html/Sharepoint Ingestion
      • Create Vector Embeddings
        • How to Build the Vector embeddings from Scratch
        • How do Modify Existing Chunking/Embedding Dataflows
      • Response History
      • Creating RAG Experiments with Dataworkz
      • Advanced RAG - Access Control for your data corpus
    • AI Agents
      • Concepts
      • Tools
        • Dataset
        • AI App
        • Rest API
        • LLM Tool
        • Relational DB
        • MongoDB
        • Snowflake
      • Agent Builder
      • Agents
      • Guidelines
    • Data Studio
      • Transformation Functions
        • Column Transformations
          • String Operations
            • Format Operations
            • String Calculation Operations
            • Remove Stop Words Operation
            • Fuzzy Match Operation
            • Masking Operations
            • 1-way Hash Operation
            • Copy Operation
            • Unnest Operation
            • Convert Operation
            • Vlookup Operation
          • Numeric Operations
            • Tiles Operation
            • Numeric Calculation Operations
            • Custom Calculation Operation
            • Numeric Encode Operation
            • Mask Operation
            • 1-way Hash Operation
            • Copy Operation
            • Convert Operation
            • VLookup Operation
          • Boolean Operations
            • Mask Operation
            • 1-way Hash Operation
            • Copy Operation
          • Date Operations
            • Date Format Operations
            • Date Calculation Operations
            • Mask Operation
            • 1-way Hash Operation
            • Copy Operation
            • Encode Operation
            • Convert Operation
          • Datetime/Timestamp Operations
            • Datetime Format Operations
            • Datetime Calculation Operations
            • Mask Operation
            • 1-way Hash Operation
            • Copy Operation
            • Encode Operation
            • Page 1
        • Dataset Transformations
          • Utility Functions
            • Area Under the Curve
            • Page Rank Utility Function
            • Transpose Utility Function
            • Semantic Search Template Utility Function
            • New Header Utility Function
            • Transform to JSON Utility Function
            • Text Utility Function
            • UI Utility Function
          • Window Functions
          • Case Statement
            • Editor Query
            • UI Query
          • Filter
            • Editor Query
            • UI Query
      • Data Prep
        • Joins
          • Configuring a Join
        • Union
          • Configuring a Union
      • Working with CSV files
      • Job Monitoring
    • Utility Features
      • IP safelist
      • Connect to data source(s)
        • Cloud Data Platforms
          • AWS S3
          • BigQuery
          • Google Cloud Storage
          • Azure
          • Snowflake
          • Redshift
          • Databricks
        • Databases
          • MySQL
          • Microsoft SQL Server
          • Oracle
          • MariaDB
          • Postgres
          • DB2
          • MongoDB
          • Couchbase
          • Aerospike
          • Pinecone
        • SaaS Applications
          • Google Ads
          • Google Analytics
          • Marketo
          • Zoom
          • JIRA
          • Salesforce
          • Zendesk
          • Hubspot
          • Outreach
          • Fullstory
          • Pendo
          • Box
          • Google Sheets
          • Slack
          • OneDrive / Sharepoint
          • ServiceNow
          • Stripe
      • Authentication
      • User Management
    • How To
      • Data Lake to Salesforce
      • Embed RAG into your App
  • API
    • Generate API Key in Dataworkz
    • RAG Apps API
    • Agents API
  • Open Source License Types
Powered by GitBook
On this page
  • Suffix
  • Prefix
  • Padding
  • Concat
  • Length
  • Remove Special Characters
  • Extract domain from URL
  • Generate Stemming
  • Remove Diacritics
  • Split Value
  • Pattern Matcher
  • Substring
  • Replace all occurrence
  1. Guides
  2. Data Studio
  3. Transformation Functions
  4. Column Transformations
  5. String Operations

String Calculation Operations

Here are the 'Calculation' operations for string type transformations

PreviousFormat OperationsNextRemove Stop Words Operation

Last updated 1 year ago

Suffix

Adds a suffix to the chosen column. The value of the suffix can be configured to be another column or a fixed value/constant. The separator between the two column values can be chosen to be one of the following -

  1. None

  2. Space

  3. The character “-”

  4. The character “~”

  5. The character “_”

Prefix

Adds a prefix to the chosen column. The value of the prefix can be configured to be another column or a fixed value/constant. The separator between the two column values can be chosen to be one of the following -

  1. None

  2. Space

  3. The character “-”

  4. The character “~”

  5. The character “_”

Padding

Pads the string to the left or right with the specified character to the specified length. Only the first character specified for padding is used.

Concat

Concatenates the value in the selected column with another value from another column or a constant value into a single attribute.

The separator between the two column values can be chosen to be one of the following -

  1. None

  2. Space

  3. The character “-”

  4. The character “~”

  5. The character “_”

Length

Returns the length of the value in the selected column.

Remove Special Characters

Name

Character

Dot

.

Comma

,

Backward slash

\

Forward slash

/

Pound/Hash

#

Exclamation

!

Dollar

$

Percentage

%

Caret

^

Ampersand

&

Asterisk

*

Semicolon

;

Colon

:

Open brace

{

Close brace

}

Equal

=

Hyphen

-

Underscore

_

Back Quote

`

Tilda

~

Open bracket

(

Close bracket

)

Extract domain from URL

Extracts the value of the domain from the URL in the string.

Generate Stemming

Reduce words in the text to their base or root form. It helps in eliminating variations of a word, such as different tenses, plurals, or derivations, which can improve text analysis and information retrieval tasks.

For example, the words “programming,” “programmer,” and “programs” can all be reduced down to the common word stem “program.” In other words, “program” can be used as a synonym for the prior three inflection words.

Remove Diacritics

Cleans the text from all types of diacritical marks and returns the text consisting of only standard Latin letters.

Diacritical mark - A sign, such as an accent or cedilla, which when written above or below a letter indicates a difference in pronunciation from the same letter when unmarked or differently marked.

Split Value

Splits an input string into two values.

  • Delimiter specified in the “Split By” textbox.

  • Start from the beginning or the end.

  • Choose the occurrence - first or last.

  • Include pattern in the result

Pattern Matcher

Returns boolean true or false based on whether an input string matches the provided pattern.

Substring

Returns a substring of an input string according to specified start and end positions

This transformation returns a value (substring) beginning at the specified start position (‘Start From Index’) and extending to include the character before the specified end position (End At Index), with position counting beginning with index 0.

Replace all occurrence

Replaces all occurrences of input text