Performance Effects of Skewed Row Distribution
Uneven distribution of table rows among AMPs (skew) can prevent efficient query processing.
Skewed distribution results in:
The effects of a skewed table appear in several types of operations. For example:
For example, assume you have a 5 million row table, with 5,000 rows having the same NUPI value. You are inserting 100,000 rows into that table, with 100 of those insert rows having the same NUPI value. The AMP holding the 5,000 rows with that NUPI value has to perform one half million duplicate row checks (5,000 * 100) for this NUPI. This operation results in poor parallel efficiency.