首页scala 对List(“hello tom hello jerry”, “hello jerry”, “hello kitty”)进行单词计数

scala 对List(“hello tom hello jerry”, “hello jerry”, “hello kitty”)进行单词计数

时间: 2024-04-16 20:23:25 浏览: 165

在Scala中，可以使用flatMap和groupBy函数对List中的字符串进行词计数。首先，我们需要将每个字符串拆分成单词，并将它们扁平化为一个列表。然后，我们可以使用groupBy函数按照单词进行分组，并计算每个单词的数量。下面是一个示例代码： ```scala val list = List("hello tom hello jerry", "hello jerry", "hello kitty") val wordCount = list .flatMap(_.split(" ")) // 将每个字符串拆分成单词并扁平化 .groupBy(identity) // 按照单词进行分组 .mapValues(_.size) // 计算每个单词的数量 wordCount.foreach(println) ``` 输出结果为： ``` (hello,3) (tom,1) (jerry,2) (kitty,1) ```

2、使用scala代码完成下面词汇统计的案例 2.1需求，实现单词计数：统计里面出现的相同的单词，进行计数，取出单词频数排名倒数前三的结果，给出scala代码

以下是使用Scala实现单词计数的代码： ```scala val text = "Hello World, Hello Scala, Scala is cool" val wordCounts = text .toLowerCase() .split("\\W+") .groupBy(identity) .mapValues(_.length) .toList .sortBy(-_._2) .take(3) println(wordCounts) ``` 代码解释： 1. 定义一个字符串`text`，表示要进行单词计数的文本。 2. 将文本转换为小写，并使用正则表达式`\W+`分割为单词数组。 3. 使用`groupBy`方法按照单词分组，并使用`mapValues`方法计算每组单词的数量。 4. 将结果转换为列表，并按照单词数量降序排序。 5. 取出前三个单词数量最多的单词及其数量，并打印输出。输出结果为： ``` List((hello,2), (scala,2), (world,1)) ``` 表示在文本中，单词`hello`和`scala`各出现了2次，而单词`world`只出现了1次。前三个单词数量最多的单词是`hello`和`scala`。

阅读全文