Quicksight Executing queries in Athena and saving output to a location

jhmayank · July 8, 2024, 8:46am

Hi All, Whenever Quicksight is refreshing a dataset or executing any queries for Athena datasets, it is writing outputs to a S3 bucket named : aws-athena-query-results-AWSREGION-AWSACCOUNTID. I understand that this is the default behaviour. With time this S3 bucket has become bigger in size.

We wanted to ask if we deleted the csv files from this bucket, will it impact quicksight datasets with Athena connections.
Once dataset created and spice dataset refreshed, does it matter if it is there in S3 bucket, is it ever used again ?

Naveed · July 8, 2024, 9:36am

Hi,

There should be no issue if you are not using the incremental refreshes of of datasets.
S3 is cost effective.
Regards,
Naveed Ali

jhmayank · July 8, 2024, 9:57am

I am using incremental refresh in most of my datasets. What if i keep only recent csv file for a dataset. Will it have impact on refresh. My understanding is that QS store datasets in spice and not in S3. It is creating S3 files only to use it for incremental refresh and after it is done, its no more required. Please let me know if my understanding is correct or wrong ?

Naveed · July 8, 2024, 10:09am

Hi,

Yes. You are correct. This will create a problem when you create need a complete refresh in future. QS sync data on S3 I think if you remove the big file from there spice will delete the old data and sync the rest of the files.

Regards,
Naveed Ali

jhmayank · July 8, 2024, 10:13am

Thanks a lot Naveed. I will test it out and see if its hampering our datasets integrity.