-
Divergent Session Counts: GA4 vs BigQuery's Events_Intraday Table
We’ve run into a bit of a puzzle here. The session number from GA4 for November, at 559,555, is way more than what we’re calculating from the BigQuery table, which is 468,991. A bit odd, right? We’re leaning more towards BigQuery’s number as it aligns better with our actual traffic and Google Analytics 360 data. A thing to note – we started noticing these discrepancies when we introduced the eCommerce event on our site. Not sure if there’s a connection or not.
Here’s what our questions boil down to:
1. Wouldn’t you expect the GA4’s screen numbers and the BigQuery data to be matched up or at least along similar lines?
2. What’s the best way to get these numbers to align more closely? We really need them to make sense.To give you the full picture, we’ve been calculating session numbers in BigQuery with this formula:
SELECT HLL_COUNT.EXTRACT( HLL_COUNT.INIT( CONCAT( user_pseudo_id, (SELECT
value
FROM UNNEST(event_params) WHERE key = 'ga_session_id' LIMIT 1).int_value), 12)) AS session_count, FROMbigquery-public-data.ga4_obfuscated_sample_ecommerce.events_*
If you’re interested, here’s the link to the HLL guide we’ve been following: https://developers.google.com/analytics/blog/2022/hll.
You guys are the pros here. We’re all ears for any advice or insights you can provide so we can sort out this issue.
Log in to reply.