Understanding hash joins in Redshift query plan

dylanbaker · September 17, 2018, 2:04pm

Curiosity question: Can someone explain why Redshift goes through the step of hashing the join columns in some joins? I’m not sure I computationally understand what happens there and would be interested in knowing more. Couldn’t find anything good online.

Topic		Replies	Views
What are the best practices to partition big tables in Redshift In-Depth Discussions redshift , partitioning	2	31389	April 28, 2020
How to create efficient timeseries models (or how to join efficiently on inequalities) In-Depth Discussions incremental	3	5652	September 21, 2018
Tackling the complexity of joining snapshots In-Depth Discussions devblog	2	1801	February 20, 2024
Choosing a data warehouse In-Depth Discussions	5	12974	July 1, 2018
Unioning identically-structured data sources Show and Tell jinja , best-practice	12	36177	December 5, 2023

Understanding hash joins in Redshift query plan

Related topics