Spark Tutorial for Beginners

Spark实战练习之一

Guru    Tuesday, December 10, 2019    1499


#以下是以Java语言为主

点击下面看视频。


val strRDD=sc.parallelize(Array("Java","Scala","Python","Spark","JavaScript","Java"))

#Return another RDD

val filteredRDD = strRDD.filter(s =>s.startsWith("S"))

#collect operation returned an array of strings

val list = filteredRDD.collect

list


#word count


#The pairRDD consists of pairs of the word

val pairRDD=strRDD.map( s => (s,1))


val countRDD=pairRDD.reduceByKey((x,y) =>x+y)


countRDD.collect



#


val intRDD = sc.parallelize(Array(1,4,5,6,7,10,17))