Data TransformationText & StringIntermediate

Find Text

Infoveave Data Automation — Text & String

Write the regex pattern once — Infoveave extracts the matching tokens from every text column row on every scheduled run.

Product descriptions, support ticket notes, log messages, and customer feedback fields frequently embed structured data — product codes, order IDs, error codes, hashtags, or numeric identifiers — inside free-form text. Extracting those structured pieces manually or with ad-hoc scripts is fragile and does not scale. Find Text extracts structured tokens from text columns using a regex pattern, placing each capture group match into its own new column. The result is a clean, structured dataset where embedded identifiers are now explicit columns that can be filtered, joined, and aggregated.

Input:Tabular dataset with one or more text columns containing unstructured or semi-structured contentOutput:Tabular dataset with new columns containing each regex match group extracted from the source column

What Find Text does

Extract structured tokens, codes, keywords, and patterns from text columns in Infoveave using regular expressions. Parse product codes, IDs, hashtags, and structured substrings from unstructured text automatically.

When to use Find Text

  • A text column contains embedded structured tokens — such as product codes, order IDs, error codes, or hashtags — that you need to extract as separate columns for filtering, joining, or reporting
  • You are parsing semi-structured log messages, support ticket text, or event descriptions where known patterns repeat across rows but the exact position varies
  • You want to validate and extract standard formats such as email addresses, postal codes, dates written as text, or numeric identifiers from a description or notes column
  • You need to produce new structured columns from a single catch-all text field in a source system that stores multiple pieces of information in one column

When to avoid it

  • You want to replace matching text rather than extract it — use Find and Replace for in-place value substitution
  • You want to split the column into fixed-position columns by a delimiter — use Split Column for delimiter-based horizontal splitting
  • The patterns you need to extract are not consistent enough for a regex — use Summarize with AI for unstructured natural language extraction that does not follow a clear pattern

Where it fits in your Infoveave automation

Find Text is one step inside a multi-step Infoveave workflow. Chain it with other activities — no code, no manual hand-offs.

ConnectRead data from log system, ERP, CRM, supplier feed, or file export with semi-structured text columns
You are hereFind TextExtract structured tokens from the text column using a regex pattern into new named columns
JoinJoin the extracted code or ID columns against reference tables to enrich the dataset
Filter or AggregateFilter by extracted values or aggregate metrics grouped by the extracted structured tokens
AutomateSchedule the workflow to extract tokens from incoming text data automatically on every run

Build this workflow visually in Infoveave Data Automation — drag, connect, and schedule with no infrastructure setup.

Infoveave — Workflow Builder
● SavedSchedule: Daily 06:00
Data SourceConnectRead data from log system,…YOU ARE HEREFind TextExtract structured tokens …JoinJoin the extracted code or…Filter or AggregateFilter by extracted values…AutomateSchedule the workflow to e…Dashboard

How teams use Find Text

Real scenarios where this transformation saves hours of manual work.

Technology

Extract Error Codes and Component IDs from Application Logs

A DevOps team processes application log records where each log message contains an error code pattern like ERR-1234 and a component identifier like COMP-56. Find Text uses a regex pattern to extract error codes and component IDs from the log message column into separate columns, allowing the team to filter by specific error codes, aggregate error frequency per component, and route alerts based on error type — without parsing the raw log text in every downstream query.

Retail

Parse Product Codes from Mixed Description Fields

A retail data team processes supplier product feeds where product codes like SKU-AB1234 are embedded inside a product description column along with free-form text. Find Text extracts the SKU code from each description into a dedicated product_code column using a regex pattern matching uppercase letters and digits with the SKU prefix. The extracted codes can then be joined against the inventory reference table without manual text parsing.

Financial Services

Extract Transaction Reference Numbers from Payment Notes

A financial operations team processes payment records where each payment notes field contains a transaction reference number in a defined format embedded within free-form text. Find Text extracts the reference number pattern into a dedicated column, enabling reconciliation against the external payment processor ledger by reference number without parsing the notes field in every reconciliation query.

See Find Text in action

Input data (left) is transformed using the configuration below. The output table (right) is ready for dashboards or downstream steps.

Columns To Find:Description
Pattern:One or more uppercase letters (3 or more characters)
Output Columns Prefix:Column_
Include Original:Enabled

Input Data

IDDescription
1This contains ABC and XYZ
2Find CODE inside this text
3No pattern matches here
4Extract INFO and DATA points
5SAMPLE test for extraction

Output Data

IDDescriptionColumn_1Column_2
1This contains ABC and XYZABCXYZ
2Find CODE inside this textCODE
3No pattern matches here
4Extract INFO and DATA pointsINFODATA
5SAMPLE test for extractionSAMPLE

Configuration

Key fields to configure in the Infoveave workflow builder. Full reference available in the documentation.

Pattern

A regular expression that defines what to extract from the text column. Use capturing groups within the pattern to capture each piece you want as a separate output column. Each capture group match produces one output column named using the prefix and a sequential number. For example, a pattern with two capture groups on a row that matches both groups produces two output columns per row.

Output Columns Prefix

All new columns generated from the regex matches are named using this prefix followed by a sequence number — for example Column_1, Column_2 for a prefix of Column_. Set a meaningful prefix that reflects what the extracted values represent, such as Code_ or ID_, to make the output columns self-describing.

Include Original

When enabled, the original source text column is retained alongside the new extracted columns. When disabled, only the extracted columns appear in the output. Retaining the original is useful for auditing the extraction results or for using the source text in subsequent parsing steps.

Frequently asked questions

Everything you need to know about Find Text in Infoveave.

Also in Text & String — and what runs before & after

Transformations in the same family as Find Text, often chained together in the same Infoveave workflow.

Part of Infoveave Data Automation

80+ transformations. Zero manual steps.

Find Text is one of over 80 transformation activities available inside Infoveave workflows. Chain transformations together — no code, no exports, no waiting for IT.

Ready to see Infoveave in action?

Book a Demo
ISO 27001ISO 27017ISO 27701GDPRHIPAACCPAAICPACSR LogoCapterra Reviews — Infoveave

© 2026 Noesys Software Pvt Ltd

Infoveave® is a product of Noesys

All Rights Reserved