What does BigQuery Omni mean for the future of data?

What does BigQuery Omni mean for the future of data?

GCP recently announced the private alpha of BigQuery Omni, a service that allows you to run BigQuery (Dremel) on data that resides in another cloud (S3 in AWS at the moment, Azure soon) using Anthos. It’s an interesting foray into what looks like one of the earliest approaches to multi cloud analytics. In this post […]

The 2020 database showdown: BigQuery vs Redshift vs Snowflake

The 2020 database showdown: BigQuery vs Redshift vs Snowflake

Last updated: 11th July 2020 For updates, changes and errata please see the Changelog at the end of this post. What is this guide? – Designed as a fair feature comparison between the different products – An up to date guide (hopefully with regular updates as new features are released or changed) – I’ve attempted […]

Establishing data lineage practices with Google Tag Manager analytics events

Establishing data lineage practices with Google Tag Manager analytics events

In this post we’ll look at how to capture meta data from Google Tag Manager (GTM) from the tags that fire analytics requests. This post focuses on Snowplow specifically, however the same concepts can be easily adapted to Google Analytics or your other tool of choice through custom fields / dimensions / properties. Although it’s […]

Adventures with Event Bridge

Adventures with Event Bridge

What is Amazon Event Bridge?   AWS describes Event Bridge as: … a serverless event bus that makes it easy to connect applications together using data from your own applications, integrated Software-as-a-Service (SaaS) applications, and AWS services.  EventBridge delivers a stream of real-time data from event sources, such as Zendesk, Datadog, or Pagerduty, and routes […]

Snowplow Inspector v0.2.15 Released

Snowplow Inspector v0.2.15 Released

What’s New? This small release adds support for custom endpoint paths in POST requests. What does that mean? As Snowplow grows in popularity, it gets used more and more for different use-cases. Some of those use-cases involve using Snowplow as a third-party analytics solution, tracking user behaviour across multiple sites, rather than as just a […]

Server Side Tracking – What’s Old is New Again

Server Side Tracking – What’s Old is New Again

Introduction: The Importance of Data Collection When we work with our clients on data and analytics; we strongly believe that the process of data collection is fundamental. Data quality should be the highest priority, and this can only be achieved through a robust and efficient workflow. It might seem obvious, but if you’re not collecting […]

The life of Python: Highlights from PyCon AU 2019

The life of Python: Highlights from PyCon AU 2019

Opportunities to actively engage and learn are vital to our work at Poplin Data so it was great to attend the recent  PyCon AU 2019 with other members of the engineering team. The event brought together professionals, students and enthusiast developers to discuss the many joys and challenges of programming in Python, which is literally […]

The importance of owning your own data

The importance of owning your own data

As Equifax prepares to pay out as much as $US700 million in compensation for its spectacular 2017 security breach, and cosmetics retailer Sephora apologises for a leak of Asia Pacific customer data, now is a good time to consider the advantages of data ownership. Peace of mind around security isn’t the only advantage, there are […]