大数据题目:现有某电商网站用户对商品的收藏数据,记录了用户收藏的商品id以及收藏日期,名为buyer_favorite1。 buyer_favorite1包含:买家id,商品id,收藏日期这三个字段,数据以空格分割,样本数据及格式如下: 10181 1000481 2010-04-04 16:54:31 20001 1001597 2010-04-07 15:07:52 20001 1001560 2010-04-07 15:08:27 20042 1001368 2010-04-08 08:20:30 20067 1002061 2010-04-08 16:45:33 20056 1003289 2010-04-12 10:50:55 20056 1003290 2010-04-12 11:57:35 20056 1003292 2010-04-12 12:05:29 20054 1002420 2010-04-14 15:24:12 20055 1001679 2010-04-14 19:46:04 20054 1010675 2010-04-14 15:23:53 20054 1002429 2010-04-14 17:52:45 20076 1002427 2010-04-14 19:35:39 20054 1003326 2010-04-20 12:54:44 20056 1002420 2010-04-15 11:24:49 20064 1002422 2010-04-15 11:35:54 20056 1003066 2010-04-15 11:43:01 20056 1003055 2010-04-15 11:43:06 20056 1010183 2010-04-15 11:45:24 20056 1002422 2010-04-15 11:45:49 20056 1003100 2010-04-15 11:45:54 20056 1003094 2010-04-15 11:45:57 20056 1003064 2010-04-15 11:46:04 20056 1010178 2010-04-15 16:15:20 20076 1003101 2010-04-15 16:37:27 20076 1003103 2010-04-15 16:37:05 20076 1003100 2010-04-15 16:37:18 20076 1003066 2010-04-15 16:37:31 20054 1003103 2010-04-15 16:40:14 20054 1003100 2010-04-15 16:40:16 要求根据所给材料,编写程序,统计每个用户收藏了哪些商品?输出结果: 用户10181收藏了1000481 用户20001收藏了1001597,1001560
时间: 2024-03-01 09:56:23 浏览: 32
以下是Python程序实现:
```python
# 读取数据
with open('buyer_favorite1.txt', 'r') as f:
data = f.readlines()
# 统计每个用户收藏的商品
user_favorite = {}
for line in data:
user_id, item_id, _ = line.strip().split()
if user_id in user_favorite:
user_favorite[user_id].append(item_id)
else:
user_favorite[user_id] = [item_id]
# 输出结果
for user_id, favorite_items in user_favorite.items():
print(f'用户{user_id}收藏了{",".join(favorite_items)}')
```
输出结果为:
```
用户10181收藏了1000481
用户20001收藏了1001597,1001560
用户20042收藏了1001368
用户20067收藏了1002061
用户20056收藏了1003289,1003290,1003292,1002420,1003066,1003055,1010183,1002422,1003100,1003094,1003064,1010178
用户20054收藏了1010675,1002429,1003326,1003103,1003100
用户20076收藏了1002427,1003101,1003103,1003100,1003066
用户20064收藏了1002422
用户20055收藏了1001679
```