Infoveave Data Automation — Numeric
jOHN DOE becomes John Doe, leading spaces vanish, and accented é becomes e — all applied per column in one step so lookups and deduplication actually work.
Text data collected from forms, imports, and APIs arrives with inconsistent formatting that breaks matching, deduplication, and classification logic. A customer name stored as JOHN DOE in one system and john doe in another will not match in a join — even though they represent the same person. Category fields imported from different regional teams may use title case in some files and uppercase in others. Addresses with trailing spaces cause lookup mismatches. Normalize Columns applies a configurable set of text standardization rules to each column independently — case conversion, whitespace removal, special character stripping, and accent normalization — so the data that enters downstream matching, grouping, and classification steps is consistently formatted across all rows and sources.
Standardize text columns in Infoveave using title case, uppercase, lowercase, whitespace removal, accent normalization, and special character stripping. Prepare consistent text data for matching, deduplication, lookups, and classification pipelines without writing cleaning scripts.
Normalize Columns is one step inside a multi-step Infoveave workflow. Chain it with other activities — no code, no manual hand-offs.
Build this workflow visually in Infoveave Data Automation — drag, connect, and schedule with no infrastructure setup.
Real scenarios where this transformation saves hours of manual work.
A retail data team ingests product catalog files from regional teams where the same category appears as IT & Tech in one file, it tech in another, and IT TECH in a third. Normalize Columns applies Title Case and Remove Special Characters to the category column, converting all three variants to It Tech after special character removal. A subsequent Find and Replace step corrects the It to IT for the final IT Tech label. The standardized category allows clean pivot and grouping operations across all regional files.
A procurement team processes supplier invoices from multiple entry points where the same supplier name is recorded as ACME CORP., Acme Corp, and acme_corp. Normalize Columns applies Title Case, Remove Whitespace, and Remove Special Characters to the supplier name column, producing AcmeCorp from all three variants. This normalized form is used as the matching key against a supplier master data table, reducing false non-matches that previously required manual review.
A financial operations team processes international wire transfer records where beneficiary names include characters like é, ü, ñ, and ç from European and Latin American source systems. SWIFT payment message standards require plain ASCII character encoding. Normalize Columns applies Normalize Accents to the beneficiary name column, converting accented characters to their unaccented ASCII equivalents — é to e, ü to u — so processed records comply with the encoding requirements of the downstream payment messaging system.
Input data (left) is transformed using the configuration below. The output table (right) is ready for dashboards or downstream steps.
Title CaseUppercaseRemove Special Characters, Title CaseEnabledInput Data
| ID | Name | Description | Category |
|---|---|---|---|
| 1 | jOHN DOE | Software Engineer | IT & Tech |
| 2 | jane SMITH | Data Scientist | Analytics |
| 3 | mark_o'leary | Machine Learning | AI & ML |
Output Data
| ID | Name | Description | Category |
|---|---|---|---|
| 1 | John Doe | SOFTWARE ENGINEER | IT Tech |
| 2 | Jane Smith | DATA SCIENTIST | Analytics |
| 3 | Mark O'Leary | MACHINE LEARNING | AI ML |
Key fields to configure in the Infoveave workflow builder. Full reference available in the documentation.
Column Map
Assign one or more normalization techniques to each text column. Techniques can be combined — for example applying Remove Special Characters first, then Title Case — and are applied in the order they are configured. Each column gets its own independent set of techniques, so Name can use Title Case while Category uses Remove Special Characters and Title Case in the same step.
Normalization Techniques
Available options include: Convert to lowercase (alice), Convert to uppercase (ALICE), Convert to title case (Alice Smith), Capitalize first letter (Alice smith), Trim whitespace (removes leading and trailing spaces), Remove whitespace (deletes all spaces), Remove special characters (strips punctuation and symbols like &, %, #, '), and Normalize accents (replaces é with e, ü with u, ñ with n). Select the combination that produces the target format for each column.
Include Original
Retain the source column alongside the normalized version. Useful for audit comparisons and when original values are needed as lookup references while normalized values are used as matching keys.
Everything you need to know about Normalize Columns in Infoveave.
Transformations in the same family as Normalize Columns, often chained together in the same Infoveave workflow.
Part of Infoveave Data Automation
Normalize Columns is one of over 80 transformation activities available inside Infoveave workflows. Chain transformations together — no code, no exports, no waiting for IT.
Ready to see Infoveave in action?