Author Profile

Mike Robins

CTO at Poplin Data

More from Mike Robins:

BigQuery query notifications using Cloud Logging and Monitoring

Recently in Measure Slack a question was asked about how to send notifications from a data quality check in BigQuery for a job that was being executed by a service account. This question - or variants of it - seem...

Better bot detection analytics data using reCaptcha

Snowplow already provides a few tools for identifying events from non-human devices ("crawlers", "bots", etc.) however most of these function based on metadata collected with events, like the User Agent string and IP address (e.g., the IAB Bots and Spiders...

Snowplow Chrome Inspector major version release

Snowplow Inspector Chrome extension version 0.2.18 released After a bit of a hiatus, we are pleased to announce a new release of our Snowplow Inspector extension for Chrome - and slightly less pleased to announce some additional fixes for regressions...

What is data quality? Part 1

Data quality is a battle against entropy over time. More often than not, quality is a function of both sheer determination and the unwavering conviction that better data leads to improved and more consistent outcomes. Although data quality isn’t by...

Measuring and validating Core Web Vitals using Snowplow and Great Expectations in GCP

In this post we'll look at how to collect, enrich, model and finally validate some core web vital metrics that are critical in measuring the performance of pages on your site. Why does this matter? These measures effectively produce a...

Data QnA: An exploration of a conversational query interface through the work of Stanley Kubrick

Data QnA is a recent feature from Google that builds on some underlying research from the Analyza - a system designed to parse, converse and interpret natural language queries across data. You may have seen this in action under the...

What does BigQuery Omni mean for the future of data?

GCP recently announced the private alpha of BigQuery Omni, a service that allows you to run BigQuery (Dremel) on data that resides in another cloud (S3 in AWS at the moment, Azure soon) using Anthos. It’s an interesting foray into...

Three

The 2021 database showdown: BigQuery vs Redshift vs Snowflake

Last updated: 16th June 2021 For updates, changes and errata please see the Changelog at the end of this post. What is this guide? - Designed as a fair feature comparison between the different products - An up to date...

Establishing data lineage practices with Google Tag Manager analytics events

In this post we'll look at how to capture meta data from Google Tag Manager (GTM) from the tags that fire analytics requests. This post focuses on Snowplow specifically, however the same concepts can be easily adapted to Google Analytics...

Older Entries