python playwright 获取当前url
时间: 2023-10-15 08:08:09 浏览: 115
要使用 Python Playwright 获取当前 URL,可以使用以下代码:
```python
import asyncio
from playwright.async_api import async_playwright
async def get_current_url():
async with async_playwright() as p:
browser = await p.chromium.launch()
page = await browser.new_page()
await page.goto('https://www.example.com')
current_url = page.url
await browser.close()
return current_url
# 使用 asyncio 运行异步函数
current_url = asyncio.run(get_current_url())
print(current_url)
```
这个例子中,我们使用了一个异步函数 `get_current_url` 来获取当前 URL。在函数中,我们使用 Playwright 启动了一个 Chromium 浏览器,并在新页面中打开了一个网址。然后我们使用 `page.url` 获取当前 URL,并关闭了浏览器。最后,我们返回了获取到的 URL。
相关问题
python playwright获取二进制文本
要获取二进制文本,您需要使用Playwright的`page.content()`方法并设置`binary`选项为`True`,这将返回一个`bytes`类型的对象,而不是默认的`str`类型。以下是一个简单的示例代码:
```python
import asyncio
from playwright.async_api import async_playwright
async def main():
async with async_playwright() as p:
browser = await p.chromium.launch()
page = await browser.new_page()
await page.goto('https://example.com')
content = await page.content(binary=True)
print(content)
asyncio.run(main())
```
在上面的代码中,我们通过将`binary`选项设置为`True`来获取页面内容的二进制表示。在这种情况下,`content`变量将包含以字节形式编码的HTML代码。
python playwright选取class为python playwright的ul标签
要使用Python Playwright选择class为"python playwright"的ul标签,可以按照以下步骤进行操作:
1. 导入Playwright库:
```python
from playwright.sync_api import sync_playwright
```
2. 启动Playwright并创建一个浏览器实例:
```python
with sync_playwright() as playwright:
browser = playwright.chromium.launch()
page = browser.new_page()
```
3. 打开目标网页:
```python
page.goto("https://example.com")
```
4. 使用CSS选择器选择目标元素:
```python
ul_element = page.query_selector('ul.python.playwright')
```
这里使用了CSS选择器,其中`ul.python.playwright`表示选择class为"python playwright"的ul标签。
5. 获取元素的文本内容:
```python
ul_text = ul_element.inner_text()
print(ul_text)
```
6. 关闭浏览器实例:
```python
browser.close()
```