User Identification in GA4: The Issue of “Artificial” Sessions

Problem

Recently, several Sublime clients have noticed significant discrepancies between the session data reported by Sublime and that collected by Google Analytics 4 (GA4). These discrepancies, which ranged between 30-40%, were particularly alarming given that Sublime bases its analyses on raw GA4 data. A thorough investigation revealed that GA4 was recording additional sessions, users, and events not reflected in the raw data. Furthermore, deeper dives into GA4’s data exploration tab often showed that data broken down by selected dimensions did not match the total presented (in the same GA4 reports!).

Cause

The root cause was found in GA4’s user identification settings, which utilize machine-learning models to estimate the behaviors of users who switch browsers/devices or do not consent to tracking (Consent Mode). As a result, GA4 was effectively adding “artificial” data. Returning to user identification via GA cookies retrospectively (since changes in GA4 generally do not affect historical data) drastically altered the results, aligning them with those recorded in raw data.

User Identification Methods in GA4

GA4 has introduced more advanced identification options compared to its predecessor, Universal Analytics, which primarily relied on cookies tied to a specific browser:

  • Device-based: Similar to the method used in Universal Analytics, this approach relies solely on device identifiers or first-party cookies.
  • Observed: This method goes a step further by integrating data from cookies, Google Signals (if enabled), and user identifiers.
  • Blended: The most advanced option, combining all previous methods and employing machine learning to model user behavior.

User Identification in Sublime

To accurately identify users while avoiding tracking errors or opaque modeling in GA4, Sublime uses GA4 raw data (Device-Based Reporting Identity) and links it to customer identifiers derived from sales data. As a result, Sublime is capable of both retrospective and future attribution of user actions on the site, regardless of the browser or device used by the customer.

Conclusion

The implementation of these methods in GA4, particularly in conjunction with Google Consent Mode, can lead to data distortions. Some clients have reported not just an increase in user and session numbers, but also significant challenges in correctly attributing sessions to their real sources, resulting in an unexpected surge in direct traffic at the expense of other channels, especially paid ones. This underscores the importance of understanding and properly implementing new identification technologies in data analysis.

If your analytics adds sessions that never happened, can you really trust it? Schedule a free demo and see how to get back.

Read our latest articles

Sublime-Attomy
How to Grow on Data: Analytics in Shopify
GUIDE
Read more
Boosting Campaigns with CAPI
Every Transaction Counts: Boosting Campaigns with CAPI
FEATURES
Read more
The Ultimate KPI Dashboard for DTC Growth Leaders
The Ultimate KPI Dashboard for DTC Growth Leaders
FEATURES
Read more
Personalization in E-commerce: Balance Between Scale and Experience
Personalization in E-commerce: Balance Between Scale and Experience
FEATURES
Read more

BUSINESS AREAS

SUBLIME IS A PERFECT MATCH FOR

Marketing teams

Sublime connects every campaign to real business outcomes, from first click to long-term retention and LTV. With first-party attribution and margin-based performance tracking, it becomes clear which channels truly drive profitable growth.

  • True impact of Meta, TikTok & influencers
  • Optimize ad spend based on margin
  • Retention & LTV by acquisition source
  • New vs returning users per campaign
  • Best-selling products to marketing channels

E-commerce teams

Sublime helps e-commerce teams understand how customers shop, why they come back and which products truly build long-term value. All in one structured, user-level customer journey, without stitching reports from GA4, Shopify or incomplete reports spread across platforms.

  • Retention by cohort or customer segment
  • Behaviors driving churn and repeat purchases
  • Discount impact from revenue to Margin II
  • High-value product pairs & cross-sell opportunities
  • Activate segments in Meta, Klaviyo, TikTok

Data analysts

Sublime gives analysts a trusted foundation to work with: raw user-level data, unified into one consistent model. With a consistent model and full SQL access, data teams can build dashboards from custom metrics and segments to advanced attribution and retention models.

You can:

  • Build custom dashboards on unified data
  • Explore raw events with SQL
  • Integrate Sublime with your existing BI tools
  • Define new metrics and custom segments
  • Full customer journeys across channels

BUSINESS AREAS

SUBLIME IS A PERFECT MATCH FOR

FEATURES

DATA-DRIVEN CAPABILITIES

True value of Meta and TikTok campaigns.

Sublime Analytics Platform connects customer journeys across devices and channels to show which campaigns truly drive valuable users.

Unlike GA4 (Google Analytics 4) or marketing platform reports, it captures the full path to conversion and beyond.

Optimize marketing spend based on profit

Revenue doesn’t always reflect true business performance – what really matters is profit.

Sublime lets you evaluate campaign effectiveness based on contribution margin and optimize for POAS (Profit on Ad Spend), instead of just ROAS (Return on Ad Spend).

Influencer campaigns tracked and measured like channels

Traffic and conversions from influencer links and coupon codes are attributed directly in the model.

This puts creators on the same playing field as paid, organic, or affiliate channels.

LTV broken down by acquisition channels.

You can see how lifetime value evolves over time by channel – Meta, Google, email, influencer, and more. Focus the budget on sources that bring in loyal, profitable customers.

True value of Meta and TikTok campaigns.

Sublime Analytics Platform connects customer journeys across devices and channels to show which campaigns truly drive valuable users.

Unlike GA4 (Google Analytics 4) or marketing platform reports, it captures the full path to conversion and beyond.

Revenue doesn’t always reflect true business performance – what really matters is profit.

Sublime lets you evaluate campaign effectiveness based on contribution margin and optimize for POAS (Profit on Ad Spend), instead of just ROAS (Return on Ad Spend).

Traffic and conversions from influencer links and coupon codes are attributed directly in the model.

This puts creators on the same playing field as paid, organic, or affiliate channels.

You can see how lifetime value evolves over time by channel – Meta, Google, email, influencer, and more. Focus the budget on sources that bring in loyal, profitable customers.

Unlock your business potential with Sublime