Spark RDD countByKey

- 1 min

countByKey

跟count很相似,不过是作用在item为[K,V]结构的RDD上,返回每一个key对应的values数的map

注:这是一个action操作,会触发实际计算

函数原型

def countByKey(): Map[K, Long]

例子:



val c = sc.parallelize(List((3, "Gnu"), (3, "Yak"), (5, "Mouse"), (3, "Dog")), 2)
c.countByKey
res3: scala.collection.Map[Int,Long] = Map(3 -> 3, 5 -> 1)
comments powered by Disqus
rss facebook twitter github youtube mail spotify lastfm instagram linkedin google google-plus pinterest medium vimeo stackoverflow reddit quora