首页当当网爬虫保存到本地

当当网爬虫保存到本地

时间: 2024-02-11 20:02:05 浏览: 120

今日头条新闻信息爬虫保存到本地

以下是一个将当当网的书名、链接、价格和评论数爬取并保存到本地的爬虫示例： ```python import requests from lxml import etree import csv # 定义爬取函数 def crawl_dangdang(): url = "http://search.dangdang.com/?key=python&act=input&page_index=1" headers = { "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36" } response = requests.get(url, headers=headers) html = response.text # 使用lxml解析网页源码 tree = etree.HTML(html) # 获取书名、链接、价格和评论数 book_names = tree.xpath('//ul[@class="bigimg"]/li//a[@class="pic"]/@title') book_links = tree.xpath('//ul[@class="bigimg"]/li//a[@class="pic"]/@href') book_prices = tree.xpath('//ul[@class="bigimg"]/li//p[@class="price"]/span[@class="search_now_price"]/text()') book_comments = tree.xpath('//ul[@class="bigimg"]/li//p[@class="search_star_line"]/a/text()') # 将数据写入CSV文件 with open('books.csv', 'w', encoding='utf-8', newline='') as csvfile: writer = csv.writer(csvfile) writer.writerow(['书名', '链接', '价格', '评论数']) for i in range(len(book_names)): writer.writerow([book_names[i], book_links[i], book_prices[i], book_comments[i]]) # 调用爬取函数 crawl_dangdang() ``` 这个爬虫会将爬取到的书名、链接、价格和评论数保存到名为`books.csv`的CSV文件中。

阅读全文

最新推荐

当当网爬虫保存到本地

相关推荐

python 网络爬虫并保存text

Python3爬虫学习之将爬取的信息保存到本地的方法详解

爬虫爬取图片保存到本地

python中多图片爬虫保存在本地磁盘

go语言爬虫将图片和视频保存到本地

网络爬虫爬取数据保存到数据库过程

使用Python爬虫怎么将网页图片保存到本地

python爬虫当当网

python爬虫当当网可视化

python爬虫爬取图片并保存到本地的代码

python爬虫爬取当当网

当当网新书热卖榜爬虫

python爬虫当当网畅销书籍

python爬虫当当网图书信息

天眼查爬虫并保存到excel

Python爬虫抓取网页数据并保存到本地数据文件中

使用Python爬虫怎么将多个网页图片保存到本地

python爬虫保存图片

python爬虫爬取网站正文并保存在本地

最新推荐

python制作爬虫并将抓取结果保存到excel中

网络爬虫.论文答辩PPT

Python发展史及网络爬虫

java实现一个简单的网络爬虫代码示例

hadoop中实现java网络爬虫(示例讲解)

深入浅出：自定义 Grunt 任务的实践指南

管理建模和仿真的文件

数据可视化在缺失数据识别中的作用

ABB机器人在自动化生产线中是如何进行路径规划和任务执行的？请结合实际应用案例分析。

网络物理突变工具的多点路径规划实现与分析