首页python selenium多线程爬虫案例

python selenium多线程爬虫案例

时间: 2023-11-03 21:57:13 浏览: 197

下面是一个简单的Python Selenium多线程爬虫案例： ```python import threading from selenium import webdriver class MyThread(threading.Thread): def __init__(self, url): threading.Thread.__init__(self) self.url = url def run(self): driver = webdriver.Chrome() driver.get(self.url) # 在这里处理你的爬虫逻辑 driver.quit() urls = ["http://www.example.com/page1", "http://www.example.com/page2", "http://www.example.com/page3"] threads = [] for url in urls: threads.append(MyThread(url)) for thread in threads: thread.start() for thread in threads: thread.join() ``` 在这个例子中，我们创建了一个自定义的线程类MyThread。这个类继承了Python threading模块中的Thread类，并重写了它的run()方法。在线程启动时，它将使用Selenium webdriver启动一个Chrome浏览器，并访问指定的URL。你可以在run()方法中添加你的爬虫逻辑。最后，我们创建了一个列表来存储所有的线程，并依次启动它们。我们使用join()方法等待所有线程的执行完成。

阅读全文