80+ built-in transformations for every data prep challenge — from pivoting and filtering to geospatial calculations and JSON reshaping. All inside your Infoveave automated workflow.
Showing 71 transformations
Group rows by any column and apply sum, average, or count inside your Infoveave workflow. Automates the data aggregation step analysts used to handle in SQL GROUP BY or Excel pivot tables.
Group rows by day, week, month, or year and apply sum, average, or count in your Infoveave workflow. Automated time-series aggregation replaces monthly report builds done by hand.
Calculate the difference between two date columns in Infoveave in years, months, days, hours, or other time units. Compute customer age, contract duration, days since last activity, delivery lead time, and any time-based KPIs from date column pairs.
Transform geographic coordinates in a geopoint column from one Coordinate Reference System to another in Infoveave. Convert between WGS84, Web Mercator, UTM, and other CRS projections so geospatial data aligns correctly across mapping tools, spatial analysis, and export formats.
Calculate the distance between geospatial locations in a dataset against a fixed reference point, a static geospatial object, or another geopoint column in Infoveave. Measure customer proximity, delivery distances, and logistics coverage without GIS software or PostGIS queries.
Merge two or more JSON array columns into a single concatenated array column in Infoveave. Combine tag lists, measurement sequences, feature vectors, or any ordered array data from multiple columns into one unified array without Python list addition, SQL ARRAY_CAT, or manual JSON string manipulation.
Merge values from two or more columns into a single combined column in Infoveave. Create composite keys, display labels, and formatted identifiers from existing fields with a configurable separator.
Convert multi-currency transaction amounts to a single target currency using live exchange rates in Infoveave. Standardize international salary, revenue, and expense columns containing ISO 4217 formatted values like 20000.00 USD to any target currency without maintaining exchange rate lookup tables.
Create copies of existing columns under new names in Infoveave workflows. Preserve original values before applying transformations, or produce parallel fields for comparison and audit.
Count how often a keyword, substring, or regex pattern appears across one or more columns in your dataset. Replaces manual COUNTIF logic with an automated, workflow-ready step in Infoveave.
Generate rectangular or circular buffer polygons around every geopoint in a dataset column in Infoveave. Create delivery zones, service coverage areas, geofences, and catchment boundaries automatically from WKT point coordinates without GIS software or PostGIS queries.
Combine separate latitude and longitude columns into a single WKT-format GeoPoint column in Infoveave. Convert raw coordinate pairs to POINT(longitude latitude) geometry for use in geospatial transformations, spatial analysis, and map visualizations without writing PostGIS or shapely code.
Combine selected columns from a tabular dataset into a single structured JSON object column in Infoveave. Package address fields, product attributes, event metadata, or any grouped columns into nested JSON for export, API payloads, and semi-structured storage — without JSON_OBJECT queries or json.dumps code.
Convert date columns from any input format to any custom output format in Infoveave with locale and timezone support. Standardize inconsistent date formats from multi-system sources into MM/DD/YYYY, DD-MM-YYYY, or any regional format for dashboards and reports.
Decrypt column values that were encrypted using the two-way symmetric Encrypt Column step in Infoveave. Recover original names, identifiers, account numbers, or any encrypted data by supplying the correct decryption password — without cryptography libraries, key management code, or database decryption functions.
Remove specific columns from a dataset in Infoveave workflows. Clean up JSON metadata, system fields, and sensitive data before downstream processing or export.
Encrypt sensitive column values using one-way irreversible hashing (SHA-256, SHA-1) or two-way symmetric reversible encryption in Infoveave. Protect PII, financial data, credentials, and personal identifiers in data pipelines without cryptography libraries or database encryption functions — and without exposing unencrypted data in intermediate pipeline steps.
Run arbitrary Python code against the current pipeline dataset in Infoveave. Apply custom transformations, calculations, conditional logic, regex operations, multi-column derivations, or any pandas operation not covered by built-in steps — by writing Python that receives the dataset as a DataFrame (df) and returns the modified DataFrame.
Extract individual date components — Year, Month, Day, Week, DayOfWeek, and more — from date columns in Infoveave as separate named columns. Build time dimensions for reporting, enable date-based grouping and filtering, and prepare date breakdowns for dashboards.
Extract a single element or a contiguous sub-array from array-type columns in Infoveave. Pull a value at a specific index or slice a range of elements from JSON array columns stored in tabular data — without Python list indexing, SPLIT_PART queries, or custom array parsing code.
Extract the centroid point, length, and area from WKT polygon and linestring geometry columns in Infoveave. Automatically derive spatial properties from POLYGON, LINESTRING, and POINT geometries without PostGIS ST_Centroid, ST_Length, or ST_Area queries, and without shapely or GIS software.
Extract latitude and longitude from a combined geopoint column into two separate numeric columns in Infoveave. Reverse-parse parenthetical coordinate strings such as (lat, lon) into typed Latitude and Longitude columns for calculations, filtering, and display without SPLIT or REGEX formulas.
Extract n-gram sequences — bigrams, trigrams, and custom-length phrase pairs — from text columns in Infoveave. Identify frequent phrase patterns in reviews, support tickets, and survey text with stop word removal and stemming for clean NLP pipeline preparation.
Extract numeric values from mixed text columns in Infoveave across four modes — several numbers, JSON array, expand notation like 1K to 1000, and decimal separator handling. Recover numeric data from unstructured descriptions, codes, and notes fields without manual parsing.
Extract specific values from JSON string columns using JSONPath expressions in Infoveave. Pull nested keys, array elements, and deep object values from semi-structured JSON data into flat output columns without PARSE_JSON queries, jq commands, or custom Python parsing code.
Fill empty or null cells in dataset columns using statistical methods or constant values in Infoveave workflows. Prepare complete datasets for reporting, ML, and downstream processing.
Detect and remove rows where columns contain the wrong type of data — URLs in text fields, IP addresses in name columns, booleans in numeric fields. Automated semantic data quality in Infoveave.
Keep, remove, or flag rows where a date column falls within a defined range in your Infoveave workflow. Replaces manual date-based filtering done in SQL WHERE clauses or Excel date filters.
Detect and handle rows containing empty strings, zeros, NaN, false, or undefined in your Infoveave workflow. Keep, remove, or flag falsy data before it distorts aggregations and reports.
Keep, remove, or flag rows using custom formula expressions in your Infoveave workflow. Write conditions like Amount divided by Boxes greater than 10 without SQL or code.
Keep, remove, or flag rows where numeric column values fall within or outside a min/max range in your Infoveave workflow. Validates numeric inputs and removes outliers without SQL or code.
Keep, remove, or flag rows matching a specific value using exact match, substring, or regex in your Infoveave workflow. Replaces manual category-based filtering in SQL WHERE or Excel filters.
Replace, standardize, and correct text values across dataset columns in Infoveave workflows. Match by substring, exact value, or regex and apply multiple replacement rules per column without code.
Extract structured tokens, codes, keywords, and patterns from text columns in Infoveave using regular expressions. Parse product codes, IDs, hashtags, and structured substrings from unstructured text automatically.
Automatically classify date column values as Holiday, Weekend, or Weekday in Infoveave using predefined and custom holiday lists. Enable business day filtering, time-series analysis with holiday exclusions, and attendance tracking pipelines.
Expand a JSON array column into individual rows in Infoveave. Convert embedded JSON arrays returned from APIs, NoSQL exports, or event logs into a flat, queryable row-per-element table.
Flatten JSON object columns into individual flat columns in Infoveave. Extract structured nested properties from JSON strings stored in dataset columns without writing parsing code.
Expand an array column into individual rows in Infoveave workflows. Normalize multi-value fields, product tags, and grouped log entries for filtering, joining, and visualization.
Transform multiple columns into key-value row pairs in Infoveave workflows. Reshape wide-form survey, time-series, or metric data into long format for charting, ML, and normalization.
Force numeric column values to stay within a user-defined minimum and maximum bound in Infoveave. Clip out-of-range values to the boundary or remove entire rows with outliers — all without writing filter formulas or manual data cleaning scripts.
Multiply a dataset's row count by a configured expansion factor to generate synthetic data for load testing, performance benchmarking, model training, and pipeline capacity validation in Infoveave. Scale a small reference dataset into a large synthetic dataset with key column uniqueness preserved — without scripting data generators or maintaining faker libraries.
Automatically fill empty or null cells in an identifier column with unique UUID v4 values in Infoveave. Generate surrogate keys for records that arrived without a primary identifier from source systems — without NEWID() queries, uuid4() Python calls, or spreadsheet GUID formulas — while leaving existing non-null identifiers unchanged.
Replace low-frequency or off-list categories with a single label like Others inside your Infoveave workflow. Cleans long-tail noise from charts and reports without manual filtering.
Reorder columns in a dataset by specifying their desired sequence in Infoveave workflows. Standardize column layouts across multiple sources for consistent downstream processing.
Invert the boolean values in selected columns in Infoveave — True becomes False and False becomes True. Flip active/inactive flags, invert eligibility indicators, reverse binary status columns, and correct inverted boolean imports without IF formulas, CASE WHEN queries, or bitwise NOT operations.
Standardize text columns in Infoveave using title case, uppercase, lowercase, whitespace removal, accent normalization, and special character stripping. Prepare consistent text data for matching, deduplication, lookups, and classification pipelines without writing cleaning scripts.
Convert continuous numeric data into labeled intervals like 20-40 or 40-60 inside your Infoveave workflow. Multiple binning modes from automatic statistical rules to fully custom ranges.
Apply display formatting to numeric columns using comma separators, Indian number format, percentage notation, and auto formatting in Infoveave. Prepare clean numeric outputs for dashboards, Excel exports, and presentation-ready reports without formula maintenance.
Convert long-format data into wide cross-tab tables inside your Infoveave workflow. No SQL, no Excel pivots — Pivot Columns reshapes data automatically on every scheduled run.
Remove duplicate records from your dataset by deduplicating on a key column in your Infoveave workflow. Keeps the first occurrence and drops all subsequent duplicates automatically.
Rename dataset column headers using a visual mapping in Infoveave workflows. Standardize inconsistent field names from external sources without writing SQL or code.
Automatically fill null and blank cells in Infoveave datasets with typed default values for String, Number, Date, and Boolean columns. Ensure data completeness before filters, aggregations, joins, and ML pipelines that require non-null inputs.
Round numeric column values using Round, Ceil, or Floor methods with configurable decimal precision in Infoveave. Prepare clean financial figures, scores, and price columns for reporting, comparison, and export without manual cell editing.
Retain only a chosen set of columns from a dataset in Infoveave workflows. Reduce dataset width, eliminate noise, and focus downstream steps on the fields that matter.
Sort dataset rows by one or more columns in ascending or descending order in Infoveave workflows. Rank, prioritize, and prepare data for consistent reporting and downstream processing.
Split a delimited column into multiple rows in Infoveave workflows. Normalize comma-separated tags, categories, or multi-value fields for filtering, aggregation, and analysis.
Turn a delimited multi-value column into binary indicator columns in Infoveave. One-hot encode tags, labels, or categories stored as comma-separated text for pivot analysis and ML features.
Split a single delimited column into multiple separate columns in Infoveave. Extract hierarchical paths, composite identifiers, and structured multi-part fields into individual named columns for filtering and analysis.
Split combined monetary value strings like 75000 USD or 85000 Rs into separate Amount and Currency columns in Infoveave. Enable numeric aggregation, currency filtering, and multi-currency financial reporting from raw monetary data columns.
Split email address columns into local part and domain part in Infoveave. Separate the username from the domain for domain-level analysis, organization segmentation, and email validation workflows.
Extract individual HTTP query string key-value pairs from URL columns into named columns in Infoveave. Turn raw query strings like id=E001&name=John into Key_id and Key_name columns for campaign tracking, session analysis, and API log parsing.
Break full URL strings into protocol, host, port, path, query, and fragment columns in Infoveave. Extract web metadata from URL columns for traffic analysis, path-level reporting, and query parameter investigation.
Replace or annotate column values with mapped labels using a configured key-value dictionary or numeric range rules in Infoveave. Convert grade codes to descriptions, score ranges to performance bands, category codes to display labels, and status codes to readable text — without VLOOKUP, CASE WHEN, or dictionary-based Python mapping code.
Automatically condense long text columns into concise summaries using Fovea AI inside your Infoveave workflow. No Python, no manual steps — runs on every scheduled execution.
Tokenize text columns into individual word tokens in Infoveave with NLP options including stop word removal, word stemming, and output as one token per row, one per column, or JSON. Prepare text data for word frequency, sentiment, and text analytics pipelines.
Describe any data transformation in plain English. Fovea AI generates and executes the code on your dataset automatically — no Python environment, no script management, no manual runs.
Apply column-level JavaScript expressions to tabular data inside your Infoveave workflow. Derive new calculated columns, obfuscate sensitive values, compute date differences, and handle conditional logic — using inline JavaScript without deploying custom code.
Apply SQL SELECT, GROUP BY, HAVING, and JOIN queries to tabular data inside your Infoveave workflow using the Transform using SQL activity. Filter, aggregate, derive columns, and reshape datasets with standard SQL syntax — no database required.
Collapse session or event-style rows into one structured record per session using a trigger value in Infoveave. Flatten log-format event streams into analyzable wide-format session records without code.
Convert unique category values into binary indicator columns in Infoveave workflows. One-hot encode status, type, or tag columns for machine learning, filtering, and pivot analysis.
Zip two or more JSON array columns into a single combined array column in Infoveave. Merge parallel array columns returned from APIs or event systems by index into structured objects without code.
See how Infoveave transformations work end-to-end in a live demo.
Ready to see Infoveave in action?