Setting up AWS Athena datasource in JetBrains DataGrip

Download the JDBC driver from AWS and place it in the DataGrip JDBC driver directory. On Linux this was ~/.DataGrip2018.1/config/jdbc-drivers/.

File > Data Sources to open the Data Sources panel and click ‘+ > Driver’. Name it AWS Athena. Here’s the confusing bit: skip down to Additional Files and click + and find the location of your driver. Now back up above that select the driver Class com.simba.athena.jdbc.Driver. Click OK.

Okay so you now have the JDBC driver installed, so you need to create a connection to your Athena instance.

File > Data Sources to open the Data Sources panel and click + > AWS Athena to create a new connection using the driver. Enter the access key and secret in the User and Password fields respectively.

Set the URL to jdbc:awsathena://athena.ap-YOURREGION.amazonws.com:443/ with your region string in there.

Now click Advanced and set the two required keys, AwsRegion and S3OutputLocation. The region should be the same as you used above.

S3OutputLocation is available in the AWS Console for Athena by clicking Settings.

Now click Test and you should get Success. You can now use Athena in DataGrip.

Popular posts like this

Monitoring Snowplow bad rows using Lambda and Cloudwatch

Decoding Snowplow real-time bad rows (Thrift)

Make big data small again with Redshift ZSTD compression