python爬取链家二手房信息400条，并存入house.xlsx表格中

好的，我可以为您提供以下代码来实现这个爬取任务。不过需要注意的是，网站的反爬虫机制比较严格，如果爬取过于频繁会被封IP，建议加上一些延时等措施。 ``` import requests import openpyxl from bs4 import BeautifulSoup import time url = "https://bj.lianjia.com/ershoufang/pg" headers = { "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.3"} wb = openpyxl.Workbook() sheet = wb.active sheet.title = "house" sheet['A1'] = '小区名称' sheet['B1'] = '户型' sheet['C1'] = '面积' sheet['D1'] = '朝向' sheet['E1'] = '楼层' sheet['F1'] = '年代' sheet['G1'] = '总价(万)' sheet['H1'] = '单价(元/平米)' row = 2 for i in range(1, 21): print("正在爬取第{}页数据".format(i)) res = requests.get(url + str(i), headers=headers) res.encoding = 'utf-8' soup = BeautifulSoup(res.text, 'html.parser') house_list = soup.find_all('div', {'class': 'info clear'}) for house in house_list: name = house.find('div', {'class': 'title'}).a.get_text() room = house.find('div', {'class': 'houseInfo'}).get_text().split('|')[1].strip() square = house.find('div', {'class': 'houseInfo'}).get_text().split('|')[2].strip() direction = house.find('div', {'class': 'houseInfo'}).get_text().split('|')[3].strip() floor = house.find('div', {'class': 'positionInfo'}).get_text().split('-')[1].strip() year = house.find('div', {'class': 'positionInfo'}).get_text().split('-')[0].strip() total_price = house.find('div', {'class': 'totalPrice'}).span.get_text() unit_price = house.find('div', {'class': 'unitPrice'}).get_text().strip()[2:-4] sheet.cell(row=row, column=1, value=name) sheet.cell(row=row, column=2, value=room) sheet.cell(row=row, column=3, value=square) sheet.cell(row=row, column=4, value=direction) sheet.cell(row=row, column=5, value=floor) sheet.cell(row=row, column=6, value=year) sheet.cell(row=row, column=7, value=total_price) sheet.cell(row=row, column=8, value=unit_price) row += 1 time.sleep(1) wb.save('house.xlsx') print("数据爬取完成！") ```

阅读全文

python爬取链家二手房信息400条，并存入house.xlsx表格中

相关推荐

Python实现杭州二手房数据采集及可视化分析

Python操作.xlsx工作簿：xlwings库实战与对比

Python爬虫技术：如何爬取并整理网页表格数据

爬取链家二手房房价数据存入mongodb并进行分析

如何python爬取链家制成表格

武汉五月份链家二手房房源.xlsx

二手房房天下数据——郑州市.xlsx

员工考勤表格模板.xlsx.xlsx

爬取成都链家租房信息租房基本数据信息 .docx

爬取黑马程序员网站的信息并存在xlsx表格中.py

年度考勤表表格模板.xlsx.xlsx

公司年度销售额统计表格.xlsx.xlsx

链家_海淀区二手房_100页.xlsx

python.xlsx

Python.xlsx

54820-数据科学导论——基于Python语言（微课版）-勘误信息表.xlsx.xlsx

使用python爬取某租房网站租房信息

发布媒体_2018年.xlsx(python爬虫爬取2015-1019年新浪新闻)

用Python爬取5000条数据并存入excel

如何将Python爬取的数据存入Excel表格

最新推荐

Python爬取数据并实现可视化代码解析

python实现PDF中表格转化为Excel的方法

使用python批量读取word文档并整理关键信息到excel表格的实例

python制作爬虫并将抓取结果保存到excel中

Python数据处理课程设计-房屋价格预测

JavaScript实现的高效pomodoro时钟教程

管理建模和仿真的文件

【WebLogic客户端兼容性提升秘籍】：一站式解决方案与实战案例

使用jupyter读取文件“近5年考试人数.csv”，绘制近5年高考及考研人数发展趋势图，数据如下（单位：万人）。

CMake 3.25.3版本发布：程序员必备构建工具