It’s hard to believe, but Universal Analytics had a lifespan of 11 years, from its announcement in March 2012. Despite occasional criticism, this service established standards for the entire web analytics industry. Many metrics and reports became benchmarks for a whole generation of marketers. It truly was an era.
For instance, a lot of marketers got used to starting each workday by inspecting dashboards and standard traffic reports in the Universal Analytics web interface. There were so, so many of those days. They became so accustomed to Universal Analytics that they would enter reports, manipulate numbers, and play with metrics almost on autopilot, without much thought.
However, six months have passed since the sunset of Universal Analytics – precisely on July 1, 2023, when Google stopped processing requests for resources using the previous version of Google Analytics. The time when data about visitors and their interactions with the website were more clearly structured within the UA paradigm is now in the past. GA4 has brought a plethora of opportunities to marketers, but along with those opportunities came a series of complexities.
GA4 issues
Since its initial announcement in 2020, GA4 has been plagued with errors and inconsistencies. It still has poor and sometimes illogical documentation, numerous restrictions, and peculiar interface solutions. But more importantly, the barrier to entry into web analytics has significantly increased.
If you diligently follow GA4 updates, read the documentation, and possess skills in working with data (SQL and basic statistics), you probably won’t feel any problems – you know how to set up a convenient and efficient environment for your product and marketing data. But what if you’re not that proficient? That’s when issues arise.
In this article, we try to address a series of straightforward questions that less experienced users – marketers, project managers, SEO specialists, and others – want answers to. They have no time to delve into the intricacies of GA4 but seek access to the fundamentals crucial for their functionality.
Previously, in Universal Analytics, they could quickly and conveniently address their issues. Now, the situation has become, to put it mildly, more complex. We’ve identified 8 such questions for which the current version of GA4 either fails to provide answers or implies that answers would require significant enhancements. So, let’s dive into them one by one.
Question 1: What are the most popular traffic sources on my website?
Seemingly a straightforward question. What does GA4 tell us? It responds with a question: « Which traffic source parameter are you interested in? »
Wait, what?
People just want to know which resources bring them the most traffic. Is that really an issue?
Unfortunately, yes. In GA4, there are not one, not two, but three traffic source parameters:
- Session source.
- First User Source – the source of the first session for each user.
- Just the source – determined at the event or conversion level.
If you wanted to open a report and draw conclusions quickly, we have bad news for you. Before you start ranking your traffic sources by popularity, you need to do some mental work on which parameter and in what context you will look. And even when you decide, you’ll need to make a choice in the selection of standard reports: work with the User Acquisition Report or Traffic Acquisition.
Yes, there is a difference between them: the first uses the First User Source parameter, and the second uses the session source. And you need to figure that out too.
Question 2: What is my conversion rate?
This question concerns everyone, and it should be simple, implying a straightforward answer. But no.
In GA4, there are three conversion metrics (yes, three):
- Session conversion – the percentage of sessions with a conversion.
- User conversion – the percentage of users who completed a conversion.
- First-time Purchaser Conversion – the share of active users who made their first purchase.
If the last metric doesn’t interest us much, GA4 users can still choose something from the remaining two. But what’s next? Which parameters to use for comparison? Session source or user source? What if you want to see the conversion rate for a specific event? And how do you do this in analyses rather than in standard reports?
In the end, instead of an answer to a simple question, marketers get a bunch of new questions.
Question 3. Can I trust user and session metrics?
Unfortunately, no. This may boggle the mind of those not well-versed in the mechanics of calculating user and session metrics, but it’s the plain truth: the numbers in GA4 and those in reality may and will differ.
The reason is that GA4 uses the HyperLogLog++ statistical algorithm to count unique values. Without delving into details, it’s a mechanism for approximate estimation of a metric with a certain level of error.
This error level is quite well-documented. For instance, for the Total Users metric, the error level is +/-1.63% (for a 95% confidence interval). In simple terms, this means that 100,000 users in the GA4 interface equate to 100,000 +/- 1.63% in reality.
Furthermore – but this is no surprise to anyone – GA4 samples data. This means that with too large a sample size or when using a large number of parameters, the application will assess your metrics based on a partial sample – let’s say 5, 10, or 30% of the entire population.
It’s a reasonable assumption, but it can (and probably will) surprise marketers – the metrics will deviate from reality. All end-users can do (excluding delving into raw data methodologies) is to take this error level into account in their conclusions.
Question 4. How do I calculate First Click attribution?
You can’t. Unfortunately, as of late, GA4 offers only three attribution models available in the Attribution tab: Last Click, Last Click For Google Ads, and Data Driven. First Click attribution is essential for understanding where and when demand is generated. In the previous version of Google Analytics (and until recently, in the current one), users could quickly apply First Click and other attribution models, compare them, and gain insights. Now, this capability is gone.
Certainly, you can look at the conversion distribution considering the First User Source parameter – this will be some proxy for First Click attribution. However, comparing it with others in the Model Comparison tab won’t be possible. In the context of the GA4 interface, it makes sense to forget about non-standard attribution models.
Question 5. How do I account for intra-session traffic?
Intra-session traffic essentially refers to a change in traffic sources within a session. Imagine a scenario where a user comes to your site organically from Google and, within a minute, comes from an email campaign. In the previous version of Google Analytics, a new session with the traffic source « e-mail » would be created in such a case. But now, the situation has changed.
A session now only ends in the case of a timeout – say, 30 minutes without interaction. This means a session will always have a source from which it started. If a user changes the source within a session (clicks on an ad, from email campaigns, and so on), you won’t know anything about it until they convert. This is a significant blow to intra-session traffic since their contribution to traffic remains virtually unnoticed.
Question 6. How can I account for users who have not consented to the use of third-party cookies?
You can’t. Google Consent Mode settings imply several options when a user rejects the use of 3rd party cookies. In GA4 and BigQuery, depersonalized cookieless pings will be sent. These pings do not contain specific client_id, session_id, or other custom dimensions. As a result, you won’t be able to consider them as users or link the actions of such users together.
Question 7. How can I compare data in explorations with the previous year?
The maximum data retention period for a free GA4 account is 14 months. This means that if the date range is wider, you can only use standard reports. You won’t be able to compare or view cohorts or funnels for periods more than 14 months ago. This makes the product functionality less rich because various report formats in explorations are very convenient for comparing specific metrics in easily digestible reports.
Of course, you always have the option to connect BigQuery and store raw data without limitations, but this process usually requires the involvement of an advanced analyst. And precisely this option is unavailable to most marketers in small teams.
Question 8. Is the data for yesterday accurate?
Unknown. Google declares that data processing in GA4 takes up to 48 hours. And although this process is faster, most users still have room for frustration. And they can be understood.
What does « data processing takes 24-48 hours » mean? When will the data in reports be complete? For yesterday? Or the day before yesterday? Or for all days that were more than two days ago? Unclear. What should marketers tell their managers when they were asked if all the data is in this report? Well, probably all of it… or maybe not… Let’s wait for 48 hours…
Undoubtedly, computational resources and time are needed for data preprocessing and aggregation. It’s okay that data for today will not be up-to-date. And probably not for yesterday either. But people just want to know when they can trust their data. Are they asking for too much: just a note that this report contains all the data sent and processed by Google Analytics?
What should you do?
Credit should be given to the Google team – they have done a lot to enable users to answer these questions in one form or another. For example, you can use data streaming in BigQuery and work with raw data. The entry threshold for this functionality has been significantly lowered. In fact, if you are dissatisfied with the GA4 interface, you can organize your export to BigQuery and create your own reports without (almost) any restrictions.
Another strong option is the widespread launch of GTM Server Side. This allows you to quite freely modify the event model and essentially enrich each hit with various parameters, doing this in a first-party context. This, of course, reduces the harmful impact of most of the limitations described in this text.
But this is not a solution.
The users in question – marketers, managers, developers – they do not want or do not have the time for a deep dive into the issue. And they want simple answers to simple (it seemed) questions. And for now, unfortunately, GA4 is more of a professional tool for analysts than a convenient instrument for generating insights for not very advanced users.
Why is this such a serious issue?
The thing is – and this is crucial – over the past 10 years, Google has managed to create a sort of GA-bubble for marketers. Many of them have become so accustomed to Google Analytics that when faced with another issue, they don’t venture to explore alternative solutions but attempt to solve it on their own. And almost always, this turns out to be expensive and inconvenient.
However, with the latest updates to GA4, it is becoming increasingly evident that this application is struggling to address even the most basic questions from users. And these questions are not fantastically complex. Much of what was described in this article is not an unsolvable mystery and is successfully addressed by other analytics services.
Let’s try to answer some of the questions described from the perspective of Matomo.
Question 1: What are the most popular traffic sources? [Solved]
In the Acquisition panel, you will find at least three easily identifiable reports – for traffic channels (All Channels), sources (Websites), and campaigns (Campaigns).
With these, you can quickly and easily answer the question about the most popular traffic sources, and if needed, delve into more detailed information, such as landing pages.
Question 2: What is my conversion rate? [Solved]
Under Goals in Matomo, you’ll easily find the overall conversion rate for your site. Below that you’ll have access to the conversion rate of each goal you’ve set in your Matomo instance.
Question 3: Can I trust user and session metrics? [Solved]
Yes. With Matomo, you’re guaranteed 100% accurate data. Matomo does not apply sampling, does not employ specific statistical algorithms, or any analogs of threshold values. Yes, it is possible, and it’s perfectly normal. If you see a metric in the visits or users field, it accurately represents reality by 100%.
Try Matomo for Free
Get the web insights you need, without compromising data accuracy.
Question 4: How do I calculate First Click attribution? [Solved]
You can do this in the same section where the other 5 attribution models, available in Matomo, are calculated – in the Multi Attribution section.
You can choose a specific conversion and, in a few clicks, calculate and compare up to 3 marketing attribution models. This means you don’t have to spend several days digging through documentation trying to understand how a particular model is calculated. Have a question – get an answer.
Question 5: How do I account for intra-session traffic? [Solved]
Matomo creates a new visit when a user changes a campaign. This means that you will accurately capture all relevant traffic if it is adequately tagged. No campaigns will be lost within a visit, as they will have a new utm_campaign parameter.
This is a crucial point because when the Referrer changes, a new visit is not created, but the key lies in something else – accounting for all available traffic becomes your responsibility and depends on how you tag it.
Try Matomo for Free
Get the web insights you need, without compromising data accuracy.
Question 6: How can I account for users who have not consented to the use of third-party cookies? [Solved]
Google Analytics requires users to accept a cookie consent banner with « analytics_storage=granted » to track them. If users reject cookie consent banners, however, then Google Analytics can’t track these visitors at all. They simply won’t show up in your traffic reports.
Matomo doesn’t require cookie consent banners (apart from in the United Kingdom and Germany) and can therefore continue to track visitors even after they have rejected a cookie consent screen. This is achieved through a config_id variable (the user identifier equivalent which is updating once a day).
This means that virtually all of your website traffic will be tracked regardless of whether users accept a cookie consent banner or not.
Question 7: How can I compare data in explorations with the previous year? [Solved]
There is no limitation on data retention for your aggregated reports in Matomo. The essence of Matomo experience lies in the reporting data, and consequently, retaining reports indefinitely is a viable option. So you can compare data for any timeframe.
The primary motive for purging aggregated data arises only when ongoing access is unnecessary, or when the data occupies server space. To address this, Matomo provides various options, allowing you to specify the types of reports that are routinely removed from your server.
Question 8: Is the data for yesterday accurate? [Solved]
Yes, it is. By default, Matomo reports are in real time when you are viewing reports for “today” or any date range that includes today. Matomo processes the reports according to your current logs every time that you request them. By default, reports for today are processed every 15 minutes (900 seconds).
Conclusion
Yes, the latest version of Google Analytics is currently generating more questions than answers. Indeed, marketers and other users with less extensive technical experience find it challenging to gain insights in the same manner as before. However, this doesn’t mean that answers to these questions don’t exist.
Furthermore, the market has evolved to the point where other services handle seemingly typical tasks much more efficiently than GA4. One successful example for such scenarios is Matomo. You can try Matomo for free for 21-days. No credit card required.
Try Matomo for Free
21 day free trial. No credit card required.