首页通过python爬虫爬取https://www.forbeschina.com/lists/1781的前100个富豪姓名和财富值[['nameChinese', 'assets']]

通过python爬虫爬取https://www.forbeschina.com/lists/1781的前100个富豪姓名和财富值[['nameChinese', 'assets']]

时间: 2024-04-18 21:26:13 浏览: 88

你可以使用Python的Requests库和BeautifulSoup库来实现爬取网页数据的功能。下面是一个示例代码，可以帮助你实现爬取https://www.forbeschina.com/lists/1781的前100个富豪姓名和财富值的需求： ```python import requests from bs4 import BeautifulSoup url = 'https://www.forbeschina.com/lists/1781' # 发起请求 response = requests.get(url) # 解析网页内容 soup = BeautifulSoup(response.text, 'html.parser') # 定位富豪信息所在的表格 table = soup.find('table', class_='tablelist') # 提取前100个富豪的姓名和财富值 data = [] for row in table.tbody.find_all('tr')[:100]: cells = row.find_all('td') name = cells[1].text.strip() assets = cells[2].text.strip() data.append([name, assets]) # 打印结果 for item in data: print(item) ``` 运行这段代码，你将得到一个包含前100个富豪姓名和财富值的二维列表。每个富豪的姓名和财富值分别存储在列表的两个元素中。

阅读全文