Read Our Blog

View category:

Measuring and validating Core Web Vitals using Snowplow and Great Expectations in GCP

In this post we'll look at how to collect, enrich, model and finally validate some core web vital metrics that are critical in measuring the performance of pages on your site. Why does this matter? These measures effectively produce a...

Three

The 2021 database showdown: BigQuery vs Redshift vs Snowflake

Last updated: 16th June 2021 For updates, changes and errata please see the Changelog at the end of this post. What is this guide? - Designed as a fair feature comparison between the different products - An up to date...

Simon presenting

Building an SQL data model: slides

Data models are a key part of the modern digital analytics stack. They allow you to build upon event-level data and interpret it through shared business rules. The models you create will evolve with you as your business changes and...

Paper over your mistakes with data models

Mistakes happen. In the data world, your ugly mistakes live on forever. It’s not just the embarrassment that’s a problem though. Gaps and obvious errors in historical data distract your stakeholders from more important matters. Explaining the anomalies and getting...

Quantifying content velocity in Snowplow

Adam Greco is something of a legend in the Adobe Analytics space. I’ve been reading his blog posts and learning from him since I first started using Omniture back in 2007 or so. He literally wrote the book on Omniture and then Adobe...

Modelling your Snowplow event data: Part 5 Automation

In the first four parts of this series, we modelled out: Pageviews: accurate time spent incorporating page pings and maximum scroll depthSessions: traffic sources and initial landing detailsUsers: filtering internal traffic and looking up things we know about the users Now it’s...

Modelling your Snowplow event data: Part 4 Users

In the first three parts of this series, we looked at modelling out pageview events to include accurate time spent and scroll depth, then classifying sessions based on what we know about where the user came from. Now we’re going to look...

Modelling your Snowplow event data: Part 3 Sessions

In the first two parts of this series, we looked at modelling out pageview events to include accurate time spent and scroll depths. Now we’ll roll up sessions. A session or visit to a web site is when a user comes to...

Modelling your Snowplow event data: Part 2 Pageviews

In the first part of this series on data modelling we went through the background for building a data model. In this edition we’ll go through the steps to create a basic pageview model that incorporates page pings so we can see accurate...

Older Entries