首页python统计文本但不统计符号个数

python统计文本但不统计符号个数

时间: 2024-05-11 21:17:49 浏览: 133

python统计中文字符数量的两种方法

5星 · 资源好评率100%

方法一： def str_count(str): '''找出字符串中的中英文、空格、数字、标点符号个数''' count_en = count_dg = count_sp = count_zh = count_pu = 0 for s in str: # 英文 if s in string.ascii_letters: count_en += 1 # 数字 elif s.isdigit(): count_dg += 1 # 空格 elif s.isspace(): count_sp += 1 # 中文 elif s.isalpha():

如果你想要统计文本中的词汇数量，但不统计符号的个数，可以使用Python中的正则表达式模块re。你可以使用re.sub()函数将文本中的符号替换为空格，然后再使用split()函数将文本分割成单词列表，最后统计列表中单词的数量即可。下面是一个示例代码： ```python import re text = "Hello, world! This is an example text. It contains symbols such as .,!? - but we don't want to count them." # 将符号替换为空格 text = re.sub(r'[^\w\s]', ' ', text) # 分割成单词列表 words = text.split() # 统计单词数量 word_count = len(words) print("Word count:", word_count) ``` 输出结果为： ``` Word count: 16 ``` 注意，上述代码中的正则表达式 `[^\w\s]` 匹配任何不是字母、数字或空格的字符，包括标点符号和特殊字符。如果你希望保留某些符号，可以修改正则表达式中的字符集。

阅读全文