Last updated: 11th July 2020 For updates, changes and errata please see the Changelog at the end of this post. What is this guide? – Designed as a fair feature comparison between the different products – An up to date guide (hopefully with regular updates as new features are released or changed) – I’ve attempted […]
A tutorial on setting up the AWS Athena driver and datasource in Jetbrains DataGrip
A tutorial on setting up the BigQuery driver and datasource in Jetbrains DataGrip
Business people get excited about the latest buzzwords: Big Data, Artificial Intelligence, Deep Learning… Before you can break out TensorFlow and start doing bleeding-edge data science, you need to ensure you’re working with data that reflects reality.
Reduce the storage size of your shredded Redshift tables.
A new compression option in Redshift allows you to make big storage savings, up to two-thirds in our tests, over the standard Snowplow setup. This guide shows how it works and how to get it happening.
Step by step instructions in Python to first decode and then deserialize bad rows data that comes out of Snowplow real-time.
Monitor your Snowplow Analytics bad rows using Amazon Lambda and Amazon Cloudwatch to track the number of bad rows turning up in Elasticsearch over time.