[<a href="/50416/1" title="第1章 师傅">第1章 师傅</a> 如何提取其中的数字?
时间: 2023-08-14 13:16:48 浏览: 31
可以使用正则表达式来提取其中的数字,示例如下:
```python
import re
text = '<a href="/50416/1" title="第1章 师傅">第1章 师傅</a>'
pattern = r'\d+'
match = re.search(pattern, text)
if match:
print(match.group())
```
输出结果为:
```
1
```
其中,`\d+`表示匹配一个或多个数字,`re.search(pattern, text)`表示在`text`中搜索符合`pattern`正则表达式的字符串,`match.group()`表示返回匹配到的字符串。
相关问题
<!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8"> <meta name="viewport" content="width=device-width, initial-scale=1.0"> <link rel=icon href=/static/dist/favicon.ico> <title>Document</title> <link href=/static/dist/css/chunk-13070ec1.ccda3c25.css rel=prefetch> <link href=/static/dist/css/chunk-1f6eb24a.5552800c.css rel=prefetch> <link href=/static/dist/css/chunk-2450c4ac.37f7ca9b.css rel=prefetch> <link href=/static/dist/css/chunk-24a27c0c.d809b953.css rel=prefetch> <link href=/static/dist/css/chunk-25dec777.b68c08db.css rel=prefetch> <link href=/static/dist/css/chunk-3a7e7ac7.61f67a30.css rel=prefetch> <link href=/static/dist/css/chunk-3ac3afd8.98bc23e9.css rel=prefetch> <link href=/static/dist/css/chunk-3b4a96bb.a0ee3bc1.css rel=prefetch> <link href=/static/dist/css/chunk-42b28a6b.64434a61.css rel=prefetch> <link href=/static/dist/css/chunk-517ab105.39040074.css rel=prefetch> <link href=/static/dist/css/chunk-56490945.643cad5c.css rel=prefetch> <link href=/static/dist/css/chunk-63b82705.d2b7ad58.css rel=prefetch> <link href=/static/dist/css/chunk-716622da.8a497f1a.css rel=prefetch> <link href=/static/dist/js/chunk-13070ec1.cc5aaa8f.js rel=prefetch> <link href=/static/dist/js/chunk-1f6eb24a.bca948d6.js rel=prefetch> <link href=/static/dist/js/chunk-2450c4ac.58e1bc6a.js rel=prefetch> <link href=/static/dist/js/chunk-24a27c0c.0ab7f6d8.js rel=prefetch> <link href=/static/dist/js/chunk-25dec777.2148f1f7.js rel=prefetch> <link href=/static/dist/js/chunk-3a7e7ac7.513dffb8.js rel=prefetch> <link href=/static/dist/js/chunk-3ac3afd8.6c148bd8.js rel=prefetch> <link href=/static/dist/js/chunk-3b4a96bb.73517657.js rel=prefetch> <link href=/static/dist/js/chunk-42b28a6b.1e8780b2.js rel=prefetch> <link href=/static/dist/js/chunk-517ab105.1e512cbc.js rel=prefetch> <link href=/static/dist/js/chunk-56490945.c3e3cef6.js rel=prefetch> <link href=/static/dist/js/chunk-63b82705.f1066fe6.js rel=prefetch> <link href=/static/dist/js/chunk-716622da.244a901e.js rel=prefetch> <link href=/static/dist/css/app.a627b381.css rel=preload as=style> <link href=/static/dist/css/chunk-vendors.3fe6fb1a.css rel=preload as=style> <link href=/static/dist/js/app.a15d8424.js rel=preload as=script> <link href=/static/dist/js/chunk-vendors.eac65f44.js rel=preload as=script> <link href=/static/dist/css/chunk-vendors.3fe6fb1a.css rel=stylesheet> <link href=/static/dist/css/app.a627b381.css rel=stylesheet> </head> <body><noscript><strong>We're sorry but iview-admin doesn't work properly without JavaScript enabled. Please enable it to continue.</strong></noscript> <div id=app></div> <script src=/static/dist/js/chunk-vendors.eac65f44.js></script> <script src=/static/dist/js/app.a15d8424.js></script> </body> </html> 帮我整理一下代码
这是一个 HTML 页面的代码。以下是格式整理后的代码:
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<link rel=icon href=/static/dist/favicon.ico>
<title>Document</title>
<link href=/static/dist/css/chunk-13070ec1.ccda3c25.css rel=prefetch>
<link href=/static/dist/css/chunk-1f6eb24a.5552800c.css rel=prefetch>
<link href=/static/dist/css/chunk-2450c4ac.37f7ca9b.css rel=prefetch>
<link href=/static/dist/css/chunk-24a27c0c.d809b953.css rel=prefetch>
<link href=/static/dist/css/chunk-25dec777.b68c08db.css rel=prefetch>
<link href=/static/dist/css/chunk-3a7e7ac7.61f67a30.css rel=prefetch>
<link href=/static/dist/css/chunk-3ac3afd8.98bc23e9.css rel=prefetch>
<link href=/static/dist/css/chunk-3b4a96bb.a0ee3bc1.css rel=prefetch>
<link href=/static/dist/css/chunk-42b28a6b.64434a61.css rel=prefetch>
<link href=/static/dist/css/chunk-517ab105.39040074.css rel=prefetch>
<link href=/static/dist/css/chunk-56490945.643cad5c.css rel=prefetch>
<link href=/static/dist/css/chunk-63b82705.d2b7ad58.css rel=prefetch>
<link href=/static/dist/css/chunk-716622da.8a497f1a.css rel=prefetch>
<link href=/static/dist/js/chunk-13070ec1.cc5aaa8f.js rel=prefetch>
<link href=/static/dist/js/chunk-1f6eb24a.bca948d6.js rel=prefetch>
<link href=/static/dist/js/chunk-2450c4ac.58e1bc6a.js rel=prefetch>
<link href=/static/dist/js/chunk-24a27c0c.0ab7f6d8.js rel=prefetch>
<link href=/static/dist/js/chunk-25dec777.2148f1f7.js rel=prefetch>
<link href=/static/dist/js/chunk-3a7e7ac7.513dffb8.js rel=prefetch>
<link href=/static/dist/js/chunk-3ac3afd8.6c148bd8.js rel=prefetch>
<link href=/static/dist/js/chunk-3b4a96bb.73517657.js rel=prefetch>
<link href=/static/dist/js/chunk-42b28a6b.1e8780b2.js rel=prefetch>
<link href=/static/dist/js/chunk-517ab105.1e512cbc.js rel=prefetch>
<link href=/static/dist/js/chunk-56490945.c3e3cef6.js rel=prefetch>
<link href=/static/dist/js/chunk-63b82705.f1066fe6.js rel=prefetch>
<link href=/static/dist/js/chunk-716622da.244a901e.js rel=prefetch>
<link href=/static/dist/css/app.a627b381.css rel=preload as=style>
<link href=/static/dist/css/chunk-vendors.3fe6fb1a.css rel=preload as=style>
<link href=/static/dist/js/app.a15d8424.js rel=preload as=script>
<link href=/static/dist/js/chunk-vendors.eac65f44.js rel=preload as=script>
<link href=/static/dist/css/chunk-vendors.3fe6fb1a.css rel=stylesheet>
<link href=/static/dist/css/app.a627b381.css rel=stylesheet>
</head>
<body>
<noscript>
<strong>We're sorry but iview-admin doesn't work properly without JavaScript enabled. Please enable it to continue.</strong>
</noscript>
<div id=app></div>
<script src=/static/dist/js/chunk-vendors.eac65f44.js></script>
<script src=/static/dist/js/app.a15d8424.js></script>
</body>
</html>
用list1=html.select('body > div.box > div.list > ul > li >a')怎么提取[<a href="/50416/1" title="第1章 师傅">第1章 师傅</a>中的数字?
可以使用正则表达式来提取其中的数字,示例如下:
```python
import re
from bs4 import BeautifulSoup
import requests
url = 'http://www.biquge.info/10_10231/'
response = requests.get(url)
html = BeautifulSoup(response.content, 'html.parser')
list1 = html.select('body > div.box > div.list > ul > li > a')
for item in list1:
title = item['title']
pattern = r'\d+'
match = re.search(pattern, title)
if match:
print(match.group())
```
其中,`item['title']`表示获取`<a>`标签中的`title`属性值,即章节标题。然后使用正则表达式`\d+`匹配其中的数字,`match.group()`返回匹配到的字符串。上述代码会输出所有章节的数字编号。