sortByKey 官方文档描述: Sort the RDD by key, so that each partition contains a sorted range of the elements in ascen…
标签:sortbykey
spark算子1:repartitionAndSortWithinPartitions
repartitionAndSortWithinPartitions算是一个高效的算子,是因为它要比使用repartition And sortByKey 效率高,这是由于它的排序是在shuffle过程中进行,一边shu…