首页用list1=html.select('body > div.box > div.list > ul > li >a')怎么提取[<a href="/50416/1" title="第1章师傅">第1章师傅</a>中的数字？

用list1=html.select('body > div.box > div.list > ul > li >a')怎么提取[<a href="/50416/1" title="第1章师傅">第1章师傅</a>中的数字？

时间: 2024-05-17 17:17:47 浏览: 120

提取字符串中的数字

可以将字符串中的数字提取出来并赋给一个整形数组欢迎大家参考

可以使用正则表达式来提取其中的数字，示例如下： ```python import re from bs4 import BeautifulSoup import requests url = 'http://www.biquge.info/10_10231/' response = requests.get(url) html = BeautifulSoup(response.content, 'html.parser') list1 = html.select('body > div.box > div.list > ul > li > a') for item in list1: title = item['title'] pattern = r'\d+' match = re.search(pattern, title) if match: print(match.group()) ``` 其中，`item['title']`表示获取`<a>`标签中的`title`属性值，即章节标题。然后使用正则表达式`\d+`匹配其中的数字，`match.group()`返回匹配到的字符串。上述代码会输出所有章节的数字编号。

阅读全文