-
Differences in reference source aggregation between GA4 sessions and exported BQ data
Hey there,
I’ve got this problem that’s driving me nuts! You see, I run this e-commerce site, and I’ve been using GA4. I’m also exporting data from GA4 to BigQuery using their handy export function (link: https://support.google.com/analytics/answer/9358801).
Now here’s where it gets tricky. I used GA4’s search tool to whip up a report counting how many sessions each referrer got us. Next, I went to BigQuery, crunched the GA4 data, and did the same counting for each session referrer. But when I compared the GA4 report with the BigQuery results, the session count for some referrers was way lower in the BigQuery results.
Basically, I used BigQuery to tally the session numbers for each referrer, using the string value of the source as the key to group by. Wrote a SQL query on BigQuery for this – I can share it if that could help.
There’s this weird thing though – depending on which referrer directed the session, the numbers can be almost identical. Like, BQ’s app showed some big differences, but Instagram was almost spot-on. And the total session counts matched up – it’s just that there were a lot of sessions with null referrers.
So I’m scratching my head wondering if GA4 is categorizing what should be ‘app’ and others as null on BigQuery. I even tried using REGEX to pull the utm_source from the page_location myself, but the same discrepancy appeared.
Do you have any idea why this might be happening, or how I can get around it? I could really use some advice here. Thanks in advance!
Log in to reply.