-
Troubleshooting Timeout Issue Running GA4 on Spark / Databricks
“Hi there, I’m running the GA4 Java libraries and have encountered a problem when running it in Databricks, even though the same code is running perfectly fine locally on my machine. It seems that this issue arises from going about it within the Spark cluster environment. I tried installing it on the cluster as a normal JAR and as a fat JAR using SBT in the clustered environment, but both methods resulted in a DEADLINE_EXCEEDED error, which appears to be a timeout after about 60 seconds. The odd thing is that the same cluster has successfully fetched data from the universal analytics API, which means the cluster has an internet connection. The same credentials are used for both my local machine and the GA4 that’s working. Also, I got an error when simply fetching metadata of a property, so it doesn’t seem to tie back to the size of the query. Can’t seem to understand the issue here. Do you think it might have something to do with dependency issues? Appreciate your input on this. Thanks! Eric”
Log in to reply.