seleniumbasic 获取html 为乱码
时间: 2024-02-16 20:02:19 浏览: 20
如果您使用Selenium Basic获取的HTML为乱码,这可能是因为您的网页编码与Selenium Basic默认编码不同所致。您可以尝试指定网页编码并重新获取HTML。以下是一些示例代码:
VBScript:
```
Dim driver
Set driver = CreateObject("Selenium.ChromeDriver")
driver.Start "chrome", "http://www.example.com"
driver.Get "/"
Dim html
html = driver.ExecuteScript("return document.documentElement.outerHTML")
html = FixEncoding(html, "utf-8") '指定网页编码为UTF-8
MsgBox(html)
Function FixEncoding(str, encoding)
Dim stream, bytes
Set stream = CreateObject("ADODB.Stream")
stream.Open
stream.Charset = encoding
stream.WriteText str
stream.Position = 0
stream.Type = 2 'adTypeText
stream.Charset = "utf-8"
bytes = stream.Read
stream.Close
Set stream = Nothing
FixEncoding = bytes
End Function
```
Python:
```
from selenium import webdriver
import requests
driver = webdriver.Chrome()
driver.get("http://www.example.com")
html = driver.execute_script("return document.documentElement.outerHTML")
html = html.encode('utf-8') #指定网页编码为UTF-8
html = html.decode('utf-8')
print(html)
```
请注意,您需要将“http://www.example.com”更改为您要获取HTML的网页地址,还需要根据需要更改网页编码。