Developers

API References

Data Subject Request API

Data Subject Request API Version 1 and 2

Data Subject Request API Version 3

Platform API

Key Management

Platform API Overview

Accounts

Apps

Audiences

Calculated Attributes

Data Points

Feeds

Field Transformations

Services

Users

Workspaces

Warehouse Sync API

Warehouse Sync API Overview

Warehouse Sync API Tutorial

Warehouse Sync API Reference

Data Mapping

Warehouse Sync SQL Reference

Warehouse Sync Troubleshooting Guide

ComposeID

Warehouse Sync API v2 Migration

Bulk Profile Deletion API Reference

Calculated Attributes Seeding API

Data Planning API

Custom Access Roles API

Group Identity API Reference

Pixel Service

Profile API

Events API

mParticle JSON Schema Reference

IDSync

Client SDKs

AMP

AMP SDK

Android

Initialization

Configuration

Network Security Configuration

Event Tracking

User Attributes

IDSync

Screen Events

Commerce Events

Location Tracking

Media

Kits

Application State and Session Management

Data Privacy Controls

Error Tracking

Opt Out

Push Notifications

WebView Integration

Logger

Preventing Blocked HTTP Traffic with CNAME

Workspace Switching

Linting Data Plans

Troubleshooting the Android SDK

API Reference

Upgrade to Version 5

Direct Url Routing

Direct URL Routing FAQ

Web

Android

iOS

Cordova

Cordova Plugin

Identity

Flutter

Getting Started

Usage

API Reference

iOS

Workspace Switching

Initialization

Configuration

Event Tracking

User Attributes

IDSync

Screen Tracking

Commerce Events

Location Tracking

Media

Kits

Application State and Session Management

Data Privacy Controls

Error Tracking

Opt Out

Push Notifications

Webview Integration

Upload Frequency

App Extensions

Preventing Blocked HTTP Traffic with CNAME

Linting Data Plans

Troubleshooting iOS SDK

Social Networks

iOS 14 Guide

iOS 15 FAQ

iOS 16 FAQ

iOS 17 FAQ

iOS 18 FAQ

API Reference

Upgrade to Version 7

React Native

Getting Started

Identity

Roku

Getting Started

Identity

Media

Unity

Upload Frequency

Getting Started

Opt Out

Initialize the SDK

Event Tracking

Commerce Tracking

Error Tracking

Screen Tracking

Identity

Location Tracking

Session Management

Xbox

Getting Started

Identity

Web

Initialization

Configuration

Content Security Policy

Event Tracking

User Attributes

IDSync

Page View Tracking

Commerce Events

Location Tracking

Media

Kits

Application State and Session Management

Data Privacy Controls

Error Tracking

Opt Out

Custom Logger

Persistence

Native Web Views

Self-Hosting

Multiple Instances

Web SDK via Google Tag Manager

Preventing Blocked HTTP Traffic with CNAME

Facebook Instant Articles

Troubleshooting the Web SDK

Browser Compatibility

Linting Data Plans

API Reference

Upgrade to Version 2 of the SDK

Xamarin

Getting Started

Identity

Alexa

Quickstart

Android

Overview

Step 1. Create an input

Step 2. Verify your input

Step 3. Set up your output

Step 4. Create a connection

Step 5. Verify your connection

Step 6. Track events

Step 7. Track user data

Step 8. Create a data plan

Step 9. Test your local app

HTTP Quick Start

Step 1. Create an input

Step 2. Create an output

Step 3. Verify output

iOS Quick Start

Overview

Step 1. Create an input

Step 2. Verify your input

Step 3. Set up your output

Step 4. Create a connection

Step 5. Verify your connection

Step 6. Track events

Step 7. Track user data

Step 8. Create a data plan

Java Quick Start

Step 1. Create an input

Step 2. Create an output

Step 3. Verify output

Node Quick Start

Step 1. Create an input

Step 2. Create an output

Step 3. Verify output

Python Quick Start

Step 1. Create an input

Step 2. Create an output

Step 3. Verify output

Web

Overview

Step 1. Create an input

Step 2. Verify your input

Step 3. Set up your output

Step 4. Create a connection

Step 5. Verify your connection

Step 6. Track events

Step 7. Track user data

Step 8. Create a data plan

Server SDKs

Node SDK

Go SDK

Python SDK

Ruby SDK

Java SDK

Tools

mParticle Command Line Interface

Linting Tools

Smartype

Media SDKs

Android

iOS

Web

Guides

Partners

Introduction

Outbound Integrations

Firehose Java SDK

Inbound Integrations

Kit Integrations

Overview

Android Kit Integration

JavaScript Kit Integration

iOS Kit Integration

Compose ID

Data Hosting Locations

Glossary

Migrate from Segment to mParticle

Migrate from Segment to Client-side mParticle

Migrate from Segment to Server-side mParticle

Segment-to-mParticle Migration Reference

Rules Developer Guide

API Credential Management

The Developer's Guided Journey to mParticle

Guides

Customer 360

Overview

User Profiles

Overview

User Profiles

Group Identity

Overview

Create and Manage Group Definitions

Calculated Attributes

Calculated Attributes Overview

Using Calculated Attributes

Create with AI Assistance

Calculated Attributes Reference

Predictive Attributes

What are predictive attributes?

Predict Future Behavior

Create Future Prediction

Use Future Predictions in Campaigns

Assess and Troubleshoot Predictions

Next Best Action

Next Best Action Overview

Create a Next Best Action (NBA)

View and Manage NBAs

Activate Next Best Actions in Campaigns

Getting Started

Create an Input

Start capturing data

Connect an Event Output

Create an Audience

Connect an Audience Output

Transform and Enhance Your Data

Platform Guide

Billing

Usage and Billing Report

The New mParticle Experience

The new mParticle Experience

The Overview Map

Observability

Observability Overview

Observability User Guide

Observability Troubleshooting Examples

Observability Span Glossary

Platform Settings

Key Management

Event Forwarding

Notification Center (Early Access)

System Alerts

Trends

Introduction

Data Retention

Data Catalog

Connections

Activity

Data Plans

Live Stream

Filters

Rules

Blocked Data Backfill Guide

Tiered Events

mParticle Users and Roles

Analytics Free Trial

Troubleshooting mParticle

Usage metering for value-based pricing (VBP)

Segmentation

Audiences

Audiences Overview

Create an Audience

Connect an Audience

Manage Audiences

FAQ

Classic Audiences

Standard Audiences (Legacy)

Predictive Audiences

Predictive Audiences Overview

Using Predictive Audiences

New vs. Classic Experience Comparison

IDSync

IDSync Overview

Use Cases for IDSync

Components of IDSync

Store and Organize User Data

Identify Users

Default IDSync Configuration

Profile Conversion Strategy

Profile Link Strategy

Profile Isolation Strategy

Best Match Strategy

Aliasing

Analytics

Introduction

Core Analytics (Beta)

Setup

Sync and Activate Analytics User Segments in mParticle

User Segment Activation

Welcome Page Announcements

Settings

Project Settings

Roles and Teammates

Organization Settings

Global Project Filters

Portfolio Analytics

Analytics Data Manager

Analytics Data Manager Overview

Events

Event Properties

User Properties

Revenue Mapping

Export Data

UTM Guide

Analyses

Analyses Introduction

Segmentation: Basics

Getting Started

Visualization Options

For Clauses

Date Range and Time Settings

Calculator

Numerical Settings

Segmentation: Advanced

Assisted Analysis

Properties Explorer

Frequency in Segmentation

Trends in Segmentation

Did [not] Perform Clauses

Cumulative vs. Non-Cumulative Analysis in Segmentation

Total Count of vs. Users Who Performed

Save Your Segmentation Analysis

Export Results in Segmentation

Explore Users from Segmentation

Funnels: Basics

Getting Started with Funnels

Group By Settings

Conversion Window

Tracking Properties

Date Range and Time Settings

Visualization Options

Interpreting a Funnel Analysis

Funnels: Advanced

Group By

Filters

Conversion over Time

Conversion Order

Trends

Funnel Direction

Multi-path Funnels

Analyze as Cohort from Funnel

Save a Funnel Analysis

Explore Users from a Funnel

Export Results from a Funnel

Cohorts

Getting Started with Cohorts

Analysis Modes

Save a Cohort Analysis

Export Results

Explore Users

Saved Analyses

Manage Analyses in Dashboards

Journeys

Getting Started

Event Menu

Visualization

Ending Event

Save a Journey Analysis

Users

Getting Started

User Activity Timelines

Time Settings

Export Results

Save A User Analysis

Query Builder

Data Dictionary

Query Builder Overview

Modify Filters With And/Or Clauses

Query-time Sampling

Query Notes

Filter Where Clauses

Event vs. User Properties

Group By Clauses

Annotations

Cross-tool Compatibility

Apply All for Filter Where Clauses

Date Range and Time Settings Overview

User Attributes at Event Time

Understanding the Screen View Event

User Aliasing

Dashboards

Dashboards––Getting Started

Manage Dashboards

Dashboard Filters

Organize Dashboards

Scheduled Reports

Favorites

Time and Interval Settings in Dashboards

Query Notes in Dashboards

Analytics Resources

The Demo Environment

Keyboard Shortcuts

User Segments

Tutorials

Analytics for Marketers

Analytics for Product Managers

Compare Conversion Across Acquisition Sources

Analyze Product Feature Usage

Identify Points of User Friction

Time-based Subscription Analysis

Dashboard Tips and Tricks

Understand Product Stickiness

Optimize User Flow with A/B Testing

APIs

User Segments Export API

Dashboard Filter API

Warehouse Sync

Warehouse Sync User Guide

Historical Data and Warehouse Sync

Data Privacy Controls

Data Subject Requests

Default Service Limits

Feeds

Cross-Account Audience Sharing

Approved Sub-Processors

Import Data with CSV Files

CSV File Reference

Glossary

Video Index

Analytics (Deprecated)

Identity Providers

Single Sign-On (SSO)

Setup Examples

Settings

Debug Console

Data Warehouse Delay Alerting

Introduction

Developer Docs

Introduction

Integrations

Introduction

Rudderstack

Google Tag Manager

Segment

Data Warehouses and Data Lakes

Advanced Data Warehouse Settings

AWS Kinesis (Snowplow)

AWS Redshift (Define Your Own Schema)

AWS S3 Integration (Define Your Own Schema)

AWS S3 (Snowplow Schema)

BigQuery (Snowplow Schema)

BigQuery Firebase Schema

BigQuery (Define Your Own Schema)

GCP BigQuery Export

Snowflake (Snowplow Schema)

Snowplow Schema Overview

Snowflake (Define Your Own Schema)

APIs

Dashboard Filter API (Deprecated)

REST API

User Segments Export API (Deprecated)

SDKs

SDKs Introduction

React Native

iOS

Android

Java

JavaScript

Python

Object API

Developer Basics

Aliasing

Integrations

Aarki

Audience

24i

Event

Abakus

Event

ABTasty

Audience

Actable

Feed

AdChemix

Event

Adikteev

Audience

Event

Adjust

Event

Feed

AdMedia

Audience

Adobe Marketing Cloud

Cookie Sync

Platform SDK Events

Server-to-Server Events

Adobe Audience Manager

Audience

Adobe Campaign Manager

Audience

Adobe Target

Audience

AdPredictive

Feed

AgilOne

Event

Airship

Audience

Event

Feed

Algolia

Event

AlgoLift

Event

Feed

Alooma

Event

Amazon Advertising

Audience

Amazon Kinesis

Event

Amazon Kinesis Firehose

Audience

Event

Amazon Redshift

Data Warehouse

Amazon S3

Event

Amazon SQS

Event

Amazon SNS

Event

Amobee

Audience

Amplitude

Forwarding Data Subject Requests

Event

Ampush

Audience

Event

Analytics

Audience

Event

Forwarding Data Subject Requests

Anodot

Event

Antavo

Feed

AppLovin

Audience

Event

AppsFlyer

Forwarding Data Subject Requests

Event

Feed

Apteligent

Event

Apptentive

Event

Attractor

Event

Awin

Event

Apptimize

Event

Microsoft Azure Blob Storage

Event

Attentive

Event

Feed

Bidease

Audience

Batch

Event

Audience

Bing Ads

Event

Bluecore

Event

Bluedot

Feed

Blueshift

Event

Feed

Forwarding Data Subject Requests

Branch

Event

Feed

Forwarding Data Subject Requests

Braze

Audience

Feed

Forwarding Data Subject Requests

Event

Bugsnag

Event

Branch S2S Event

Event

Button

Audience

Event

Census

Feed

Cadent

Audience

ciValue

Event

Feed

comScore

Event

Conversant

Event

CleverTap

Event

Audience

Feed

Cordial

Feed

Audience

Cortex

Event

Feed

Forwarding Data Subject Requests

Criteo

Audience

Event

Crossing Minds

Event

Custom Feed

CustomerGlu

Feed

Event

Customer.io

Audience

Feed

Event

Databricks

Data Warehouse

Datadog

Event

Didomi

Event

Dynamic Yield

Audience

Event

Dynalyst

Audience

Edge226

Audience

Emarsys

Audience

Epsilon

Event

Everflow

Audience

Facebook

Audience

Event

Facebook Offline Conversions

Event

Google Analytics for Firebase

Event

Fiksu

Audience

Event

Flurry

Event

Flybits

Event

Formation

Event

Feed

Foursquare

Audience

Feed

ForeSee

Event

FreeWheel Data Suite

Audience

Friendbuy

Event

Google Ad Manager

Audience

Google Ads

Audience

Event

Google Analytics 4

Event

Google Analytics

Event

Google BigQuery

Audience

Data Warehouse

Google Cloud Storage

Audience

Event

Google Marketing Platform

Cookie Sync

Audience

Event

Google Enhanced Conversions

Event

Google Pub/Sub

Event

Google Marketing Platform Offline Conversions

Event

Google Tag Manager

Event

Heap

Event

Herow

Feed

Hightouch

Feed

Ibotta

Event

Hyperlocology

Event

ID5

Kit

Impact

Event

InMarket

Audience

InMobi

Audience

Event

Insider

Audience

Event

Feed

Inspectlet

Event

Intercom

Event

iPost

Audience

Feed

ironSource

Audience

Iterable

Audience

Event

Feed

Jampp

Audience

Event

Kafka

Event

Kissmetrics

Event

Kayzen

Event

Audience

Kochava

Event

Feed

Forwarding Data Subject Requests

Klaviyo

Event

Audience

Kubit

Event

LaunchDarkly

Feed

Leanplum

Event

Audience

Feed

LifeStreet

Audience

Liftoff

Audience

Event

LinkedIn Conversions API Integration

LiveLike

Event

MadHive

Audience

Liveramp

Audience

mAdme Technologies

Event

Localytics

Event

Marigold

Audience

Mailchimp

Audience

Event

Feed

Mautic

Audience

Event

Mediasmart

Audience

MediaMath

Audience

Microsoft Azure Event Hubs

Event

Mintegral

Audience

Mixpanel

Audience

Event

Forwarding Data Subject Requests

MoEngage

Event

Audience

Feed

Moloco

Audience

Event

Monetate

Event

Movable Ink - V2

Event

Movable Ink

Event

Multiplied

Event

Nanigans

Event

myTarget

Audience

Event

Nami ML

Feed

Narrative

Audience

Feed

Event

Neura

Event

NCR Aloha

Event

OneTrust

Event

Optimizely

Audience

Event

Oracle BlueKai

Event

Oracle Responsys

Audience

Event

Paytronix

Feed

Personify XP

Event

Persona.ly

Audience

Pilgrim

Event

Feed

PieEye

Inbound Data Subject Requests

Audience

Event

Plarin

Event

Postie

Audience

Event

Quadratic Labs

Event

Primer

Event

Quantcast

Event

Radar

Event

Feed

Qualtrics

Event

Rakuten

Event

Audience

Event

Regal

Event

Remerge

Audience

Event

Retina AI

Event

Feed

Reveal Mobile

Event

Rokt

Rokt Thanks and Pay+

Audience

Event

RTB House

Event

Audience

RevenueCat

Feed

Sailthru

Audience

Event

Salesforce Email

Audience

Feed

Event

Salesforce Mobile Push

Event

Salesforce Sales and Service Cloud

Event

Samba TV

Audience

Event

Scalarr

Event

SendGrid

Feed

Audience

SessionM

Event

Feed

ShareThis

Audience

Feed

Shopify

Feed

Custom Pixel

SimpleReach

Event

Signal

Event

Singular

Event

Feed

Singular-DEPRECATED

Event

Skyhook

Event

Slack

Event

Smadex

Audience

SmarterHQ

Event

Snapchat

Event

Audience

Snapchat Conversions

Event

Snowplow

Event

Split

Feed

Event

Snowflake

Data Warehouse

Splunk MINT

Event

Sprig

Event

Audience

StartApp

Audience

Statsig

Feed

Event

Stormly

Audience

Event

Swrve

Event

Feed

Talon.One

Event

Audience

Feed

Loyalty Feed

Tapad

Audience

Taplytics

Event

Tapjoy

Audience

Taptica

Audience

Teak

Audience

The Trade Desk

Audience

Event

Cookie Sync

Ticketure

Feed

TikTok Event

Audience

Audience (Deprecated)

Event

Audience Migration

Treasure Data

Audience

Event

Triton Digital

Audience

TUNE

Event

Valid

Event

Twitter

Event

Audience

Vkontakte

Audience

Voucherify

Event

Audience

Vungle

Audience

Webhook

Event

Webtrends

Event

Wootric

Event

White Label Loyalty

Event

Xandr

Audience

Cookie Sync

Yahoo (formerly Verizon Media)

Audience

Cookie Sync

Yotpo

Feed

YouAppi

Audience

Z2A Digital

Audience

Event

Zendesk

Feed

Event

Punchh

Audience

Feed

Event

Pushwoosh

Audience

Event

Data Mapping

Warehouse Sync can be used to ingest two types of data:

User attribute data: data describing your users
Event data: data describing actions your users take

If you want to ingest event data, you must use the Field Transformations API to map the source data coming from your warehouse to the mParticle event data schema.

What is a field transformation?

A field transformation maps external data (such as a column, row, field, or more complex data object) to an event attribute or field within the mParticle platform. When ingesting data through a warehouse sync pipeline, a field transformation tells mParticle exactly where to store each new piece of data within the context of the JSON schema, the overarching definition for how data is organized in mParticle.

Field transformations are JSON formatted specifications created using the Field Transformations API, a subcomponent of the mParticle Platform API. The Field Transformations API is grouped with the Platform API instead of the Warehouse Sync API because its functionality is not necessarily limited to Warehouse Sync, and future mParticle features may leverage field transformations. The Field Transformations API simply provides a structured method of mapping one data object to another.

Example field transformation

Imagine the following simple data table and mParticle JSON data schema:

Example source database table:

eventId	sessionId	timeStamp	eventType
1234	5678	1402521613976	screen_view
…	…	…	…

Example mParticle JSON data schema:

{
  "events": [
    {
      "data": {
        "event_id": 1234,
        "session_id": 5678,
        "timestamp_unixtime_ms": 1402521613976
      },
      "event_type": "screen_view"
    }
  ]
}

When ingesting this data through a warehouse sync pipeline, we need to map each source column of our table to the appropriate fields in the mParticle JSON schema:

Source column name	Destination field name
`eventId`	`event_id`
`sessionId`	`session_id`
`timeStamp`	`timestamp_unixtime_ms`
`eventType`	`event_type`

The field transformation would be:

{
  "id": "example-field-transformation-id",
  "name": "Example Field Transformation",
  "destination_type": "event_batch",
  "mappings": [
    {
      "mapping_type": "column",
      "source": "$eventId",
      "destination": "events[].data.event_id"
    },
    {
      "mapping_type": "column",
      "source": "$sessionId",
      "destination": "events[].data.session_id"
    },
    {
      "mapping_type": "column",
      "source": "$timeStamp",
      "destination": "events[].data.timestamp_unixtime_ms"
    },
    {
      "mapping_type": "column",
      "source": "$eventType",
      "destination": "events[].event_type"
    }
  ],
  "created_on": "2023-11-14T21:15:43.182Z",
  "created_by": "developer@example.com",
  "last_modified_on": "2023-11-14T21:15:43.182Z",
  "last_modified_by": "developer@example.com"
}

Note that each field mapping is listed as an individual item within the array called mappings.

When we refer to the source field name in the mParticle JSON schema, we use a simplified JSON path that reflects the nested structure of events data. For example, events[].data.event_id refers to the field called event_id in the data object that sits within the events array. You can find a detailed explanation of the JSON path format in the Field Transformation API reference.

Mappings array

Source data fields and their destinations in mParticle are expressed using the mappings array in a field transformation. For every data object (either a column, single value, or array) you want to map, include a separate configuration object in the mappings array.

Each mapping object in the mappings array can be configured with the following settings:

mapping_type: specifies the way the source data is mapped. Options include:
- column: maps a column in your database to a destination in mParticle
- static: maps the value given to the value property to a destination field in mParticle
- ignore: excludes the source data defined for source from being ingested
source: the name of the column or field being mapped from
destination: the name of the field in mParticle being mapped to
value: used with a static or column mapping type. The value assigned to this property will be mapped directly to the mParticle field set in destination

How to create a field transformation

There are four steps to creating and using a field transformation:

Identify your source data: this is the names of the columns, fields, or rows of data in your data warehouse
Identify your field destinations: these are the field names as they exist in the mParticle JSON schema
Write your mappings: create the individual mapping objects for each data object you want to map
Create your field transformation: send an API request to the Field Transformations API with your new mappings

Step 1: Identify your source data

This first step is accomplished when setting up a warehouse sync pipeline. Part of the warehouse sync configuration requires you to create a data model. This data model is a SQL query that mParticle sends to your warehouse to retrieve the names of the data columns and fields in your warehouse that your pipeline will ingest.

These column and field names are what you supply for the values of the source setting of a mapping.

For more information about writing SQL queries for your data model, see the Warehouse Sync SQL Reference.

Step 2: Identify your field destinations

Your field destinations are the names of the fields you want to map your source data as specified in the mParticle JSON data structure.

Since data in mParticle is structured as a series of nested JSON objects and arrays, mappings refers to these fields using simplified JSON paths.

If a field name within your destination contains a period (.) character, then the field name must be escaped using curly braces and single quotes. For example: {{'field.name'}}. This is due to how Warehouse Sync parses through nested objects and fields when creating your mappings. If a period is found that doesn't indicate a nested object or field, you will receive an error.

This is commonly encountered when mapping to an mParticle custom flag, such as TikTok.URL. Because URL isn't a field nested within an object called TikTok, Warehouse sync will return an error unless you enclose the entire field within curly braces: {{'TikTok.URL'}}.

Step 3: Write your mappings

For every column of data you plan to ingest, you must create a mapping so that mParticle can determine where to put the data in that column.

Mappings are configured using a combination of the mapping_type, source, destination, and value settings.

Example mapping:

{
  "mapping_type": "column",
  "source": "$eventId",
  "destination": "events[].data.event_id"
}

To learn about the specific mapping settings, see Mapping object settings in the Field Transformations API reference.

Step 4: Create your field transformation

Once you have completed writing each mapping object in your mappings array for all of your source data fields, you can create a field transformation by sending a POST API request to the endpoint located at:

https://api.mparticle.com/platform/v2/workspaces/{workspaceId}/transformations/fields

The body of your API request must contain:

A unique id for your field transformation. This ID is referenced when creating your warehouse sync pipeline.
A unique name for your field transformation.
The destination_type for your field transformation. Currently, the only valid value is event_batch.
The mappings array you created in step 3.

Example field transformation request body:

{
  "id": "unique-id",
  "name": "your-field-transformation-name",
  "destination_type": "event_batch",
  "mappings": [
    {
      "mapping_type": "column",
      "source": "your-column-name",
      "destination": "mparticle-field"
    }
  ]
}

Step 5: Add your field transformation to a warehouse sync pipeline

When creating a warehouse sync pipeline, you can use the ID of the field transformation you just created as the value for field_transformation_id.

Mark historical batches with `source_info.is_historical`

If your pipeline is ingesting batches of older data (data that’s over 30 days old), you should explicitly mark those batches as historical so that Warehouse Sync applies historical-processing rules.

Add a boolean column to your SQL data model (for example, is_historical) that evaluates to TRUE when the event timestamp is beyond your chosen threshold:

SELECT
  *,
  CASE
    WHEN DATEDIFF('day', event_timestamp, CURRENT_TIMESTAMP()) > 30 THEN TRUE ELSE FALSE
  END AS is_historical
FROM your_source_table;

Create two mappings in your field transformation:

[
  {
    "mapping_type": "column",
    "source": "is_historical",
    "destination": "source_info.is_historical"
  },
  {
    "mapping_type": "static",
    "destination": "source_info.channel",
    "value": "server_to_server"
  }
]

Setting source_info.is_historical to true tells mParticle to treat the batch as historical: events are not forwarded in real time to any connected event, data warehouse, or audience outputs, timestamp validation is relaxed, and the data remains available for long-term use cases such as extended-lookback audiences.

Grouping commerce and crash event data

All data in mParticle is stored in one of:

list fields: List fields contain lists of data objects, such as ecommerce products, ecommerce impressions, or error data
single-value fields. Single-value fields contain things like transaction IDs, email addresses, or product names. This includes everything NOT on a product, impression, or error datum

When mParticle ingests commerce or crash event data from your warehouse, data that is mapped to a list field can be grouped within a single “event” according to a shared unique ID that you specify (like a transaction ID for a commerce product action).

To group commerce or crash data into a list field in mParticle, you must:

Find an ID, which we will refer to as a message ID, that uniquely identifies your commerce or crash event. For example: an event ID or transaction ID.
Map your unique message ID to the events[].data.source_message_id field in mParticle.
Map the remaining columns in your source data to their respective fields in mParticle. Note the following:
- Columns mapped to lists in mParticle may have multiple values across rows, even if each row has the same message ID value.
- Columns mapped to single-value fields must have the same value across rows if they have the same source message ID value. If a row is found with the same source message ID, but it has a different single-value field, then it won’t be able to be mapped to mParticle.

Columns mapped to single-value fields like transaction and event data MUST have the same value for all rows with the same source message ID. This includes values set to NULL, if values are not ignored when NULL for that column.

The list fields in the mParticle JSON schema that commerce and crash event data can be mapped to are:

events[].data.product_impressions
events[].data.product_impressions[].products
- You can create an additional mapping between any unique string ID and events[].data.product_impressions[].product_impressions_list to further segment product impressions into product impression lists.
events[].data.promotion_action.promotions
events[].data.product_action.products
events[].data.shopping_cart.products
events[].data.breadcrumbs
- This is the source_message_id of the crash_report event type.

How is commerce and crash data grouped?

mParticle groups event data into list fields if, and only if, a shared unique ID (such as a transaction ID or promotion ID) is mapped from your source data to the events[].data.source_message_id field in mParticle.

As mParticle ingests data, any time a unique set of source field values are mapped to a list field, a new object is added to that list.

For example, if a source field is mapped to events[].data.product_action.products[].name, then a new product object will be added to events[].data.product_action.products[] whenever unique value for name is found.

Example product action grouping

Imagine that an ecommerce customer purchases three different items in the same transaction. If Warehouse Sync were to ingest that event data without the shared transaction ID, it would create three separate events with product lists in mParticle, one for each of the three purchased items.

However, if a mapping exists between the source transaction ID and the source_message_id, Warehouse Sync creates a single product list containing all three items.

Each row sharing the same events[].data.source_message_id must able to be grouped within the same event, otherwise your pipeline will return an error. In the source data below, each row sharing the same transaction_id of 1 must have the same value for action, which is purchase.

Example source data

transaction_id	action	name	price
1	`purchase`	shirt	20.00
1	`purchase`	hat	10.00
1	`purchase`	scarf	5.00

Example mapping

[
  {
    "mapping_type": "column",
    "source": "transaction_id",
    "destination": "events[].data.source_message_id"
  },
  {
    "mapping_type": "column",
    "source": "action",
    "destination": "events[].data.product_action.action"
  },
  {
    "mapping_type": "column",
    "source": "name",
    "destination": "events[].data.product_action.products[].name"
  },
  {
    "mapping_type": "column",
    "source": "price",
    "destination": "events[].data.product_action.products[].price"
  }
]

Example output

{
  "events": [
    {
      "data": {
        "source_message_id": "1",
        "product_action": {
          "action": "purchase",
          "transaction_id": 1,
          "products": [
            {
              "name": "shirt",
              "price": "20.00"
            },
            {
              "name": "hat",
              "price": "10.00"
            },
            {
              "name": "scarf",
              "price": "5.00"
            }
          ]
        }
      }
    }
  ]
}

Without mapping transaction_id to source_message_id, the output would instead be:

{
  "events": [
    {
      "data": {
        "product_action": {
          "action": "purchase",
          "products": [
            {
              "name": "shirt",
              "price": "20.00"
            }
          ]
        }
      }
    },
    {
      "data": {
        "product_action": {
          "action": "purchase",
          "products": [
            {
              "name": "hat",
              "price": "10.00"
            }
          ]
        }
      }
    },
    {
      "data": {
        "product_action": {
          "action": "purchase",
          "products": [
            {
              "name": "scarf",
              "price": "5.00"
            }
          ]
        }
      }
    }
  ]
}

Example product impression grouping

Example source data

event_id	impression_group_id	product_impressions	product_name	product_price
1	1	“banner ad”	“shirt”	20.00
1	1	“banner ad”	“hat”	10.00
1	2	“video ad”	“scarf”	5.00

Example mapping

[
  {
    "mapping_type": "column",
    "source": "event_id",
    "destination": "events[].data.source_message_id"
  },
  {
    "mapping_type": "column",
    "source": "product_impressions",
    "destination": "events[].data.product_impressions[].product_impressions_list"
  },
  {
    "mapping_type": "column",
    "source": "product_impressions",
    "destination": "events[].data.product_impressions"
  },
  {
    "mapping_type": "column",
    "source": "product_name",
    "destination": "events[].data.product_impressions[].products[].name"
  },
  {
    "mapping_type": "column",
    "source": "price",
    "destination": "events[].data.product_impressions[].products[].price"
  }
]

Example output

{
  "events": [
    {
      "data": {
        "source_message_id": "1",
        "product_impressions": [
          {
            "product_impression_list": "banner ad",
            "products": [
              {
                "name": "shirt",
                "price": "20.00"
              },
              {
                "name": "hat",
                "price": "10.00"
              }
            ]
          },
          {
            "product_impression_list": "video ad",
            "products": [
              {
                "name": "scarf",
                "price": "5.00"
              }
            ]
          }
        ]
      }
    }
  ]
}

Field Transformation examples

Following are some field transformation examples for common use cases:

Ignoring fields with null values
Ignoring empty fields
Mapping multiple fields to an mParticle array
Mapping multiple fields in an object to multiple mParticle fields

Ignore null values

Example source data

first_name	premium_start_date	premium_discounts_applied	free_trial_expiration	free_trial_ad_impressions
“Bob”	`2023-11-29`	`["blackfriday20"]`	null	null

Example mapping

[
  {
      "mapping_type": "column",
      "source": "premium_*",
      "destination": "user_attributes.*",
      "ignore_when": "$null"
  },
  {
      "mapping_type": "column",
      "source": "free_trial_*",
      "destination": "user_attributes.*",
      "ignore_when": "$null"
  }
]

Example output

{
  "user_attributes": {
    "premium_start_date": "2023-11-29",
    "premium_discounts_applied": ["blackfriday20"]
  }
}

Ignore empty fields

Example source data

first_name	premium_start_date	premium_discounts_applied	free_trial_expiration	free_trial_ad_impressions
“Bob”	`2023-11-29`	`["cybermonday24"]`

Example mapping

[
  {
      "mapping_type": "column",
      "source": "premium_*",
      "destination": "user_attributes.*",
      "ignore_when": "$empty"
  },
  {
      "mapping_type": "column",
      "source": "free_trial_*",
      "destination": "user_attributes.*",
      "ignore_when": "$empty"
  }
]

Example output

{
  "user_attributes": {
    "premium_start_date": "2023-11-29",
    "premium_discounts_applied": ["cybermonday24"]
  }
}

Mapping multiple fields to an array

Example source data

first_name	favorite_store_1	favorite_store_2	favorite_store_3
“Bob”	“target”	“old navy”	“walmart”

Example mapping

{
  "mapping_type": "column",
  "source": "favorite_store_*",
  "destination": "user_attribute.favorite_stores[]",
  "value": "{{ value | upcase }}"
}

Example output

{
  "user_attributes": {
    "favorite_stores": [
      "TARGET",
      "OLD NAVY",
      "WALMART"
    ]
  }
}

Mapping fields from an object

Example source data

Imagine the following object of data within a column called profile_data:

profile_data
`{"customer_id": "12345", "first_name": "Bob", "ua_shirt_size": "M", "ua_favorite_color": "blue"}`

Using the column mapping type and the * wildcard, we create the following mapping:

Example mapping

[
  {
    "mapping_type": "column",
    "source": "profile_data.customer_id",
    "destination": "user_identities.customer_id"
  },
  {
    "mapping_type": "column",
    "source": "profile_data.fname",
    "destination": "user_attributes.$firstname"
  },
  {
    "mapping_type": "column",
    "source": "profile_data.ua_*",
    "destination": "user_attributes.*"
  }
]

Example output

{
  "user_identities": {
    "customer_id": "12345"
  },
  "user_attributes": {
    "$firstname": "Bob",
    "ua_shirt_size": "M",
    "ua_favorite_color": "blue"
  }
}

Was this page helpful?

Last Updated: August 6, 2025

Data Mapping

What is a field transformation?

Example field transformation

Mappings array

How to create a field transformation

Step 1: Identify your source data

Step 2: Identify your field destinations

Step 3: Write your mappings

Step 4: Create your field transformation

Step 5: Add your field transformation to a warehouse sync pipeline

Mark historical batches with source_info.is_historical

Grouping commerce and crash event data

How is commerce and crash data grouped?

Example product action grouping

Example source data

Example mapping

Example output

Example product impression grouping

Example source data

Example mapping

Example output

Field Transformation examples

Ignore null values

Example source data

Example mapping

Example output

Ignore empty fields

Example source data

Example mapping

Example output

Mapping multiple fields to an array

Example source data

Example mapping

Example output

Mapping fields from an object

Example source data

Example mapping

Example output

Mark historical batches with `source_info.is_historical`