Curiosity question: Can someone explain why Redshift goes through the step of hashing the join columns in some joins? I’m not sure I computationally understand what happens there and would be interested in knowing more. Couldn’t find anything good online.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
What are the best practices to partition big tables in Redshift | 2 | 31389 | April 28, 2020 | |
How to create efficient timeseries models (or how to join efficiently on inequalities) | 3 | 5652 | September 21, 2018 | |
Tackling the complexity of joining snapshots | 2 | 1801 | February 20, 2024 | |
Choosing a data warehouse | 5 | 12974 | July 1, 2018 | |
Unioning identically-structured data sources | 12 | 36177 | December 5, 2023 |