- Data collection
- Data Maturity
- Data Modelling
- Data Ownership
- Data Warehouses
- Open Source
- People and process
- Snowplow Insights
Measuring and validating Core Web Vitals using Snowplow and Great Expectations in GCP
In this post we'll look at how to collect, enrich, model and finally validate some core web vital metrics that are critical in measuring the performance of pages on your site. Why does this matter? These measures effectively produce a...
The 2021 database showdown: BigQuery vs Redshift vs Snowflake
Last updated: 16th June 2021 For updates, changes and errata please see the Changelog at the end of this post. What is this guide? - Designed as a fair feature comparison between the different products - An up to date...
Building an SQL data model: slides
Data models are a key part of the modern digital analytics stack. They allow you to build upon event-level data and interpret it through shared business rules. The models you create will evolve with you as your business changes and...
Paper over your mistakes with data models
Mistakes happen. In the data world, your ugly mistakes live on forever. It’s not just the embarrassment that’s a problem though. Gaps and obvious errors in historical data distract your stakeholders from more important matters. Explaining the anomalies and getting...
Quantifying content velocity in Snowplow
Adam Greco is something of a legend in the Adobe Analytics space. I’ve been reading his blog posts and learning from him since I first started using Omniture back in 2007 or so. He literally wrote the book on Omniture and then Adobe...
Modelling your Snowplow event data: Part 5 Automation
In the first four parts of this series, we modelled out: Pageviews: accurate time spent incorporating page pings and maximum scroll depthSessions: traffic sources and initial landing detailsUsers: filtering internal traffic and looking up things we know about the users Now it’s...
Modelling your Snowplow event data: Part 4 Users
In the first three parts of this series, we looked at modelling out pageview events to include accurate time spent and scroll depth, then classifying sessions based on what we know about where the user came from. Now we’re going to look...
Modelling your Snowplow event data: Part 3 Sessions
In the first two parts of this series, we looked at modelling out pageview events to include accurate time spent and scroll depths. Now we’ll roll up sessions. A session or visit to a web site is when a user comes to...
Modelling your Snowplow event data: Part 2 Pageviews
In the first part of this series on data modelling we went through the background for building a data model. In this edition we’ll go through the steps to create a basic pageview model that incorporates page pings so we can see accurate...