Curiosity question: Can someone explain why Redshift goes through the step of hashing the join columns in some joins? I’m not sure I computationally understand what happens there and would be interested in knowing more. Couldn’t find anything good online.
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| What are the best practices to partition big tables in Redshift | 2 | 31701 | April 28, 2020 | |
| How to create efficient timeseries models (or how to join efficiently on inequalities) | 3 | 5750 | September 21, 2018 | |
| Tackling the complexity of joining snapshots | 2 | 1838 | February 20, 2024 | |
| Unioning identically-structured data sources | 12 | 37454 | December 5, 2023 | |
| Choosing a data warehouse | 5 | 13137 | July 1, 2018 |