首页python怎么实时语音识别文字

python怎么实时语音识别文字

时间: 2024-10-03 22:02:18 浏览: 39

在Python中实现实时语音识别并将文字化，可以使用开源库如SpeechRecognition和Google Cloud Speech-to-Text API。以下是基本步骤： 1. 安装必要的库： - `pip install SpeechRecognition` - 如果需要Google API支持，还需安装`google-cloud-speech`库，并设置Google Cloud SDK。 2. 导入所需模块： ```python import speech_recognition as sr from google.cloud import speech_v1p1beta1 as speech ``` 3. 实例化语音识别器： ```python r = sr.Recognizer() client = speech.SpeechClient() ``` 4. 配置音频输入（麦克风或其他录音设备）： ```python # 录音函数 def record_audio(): with sr.Microphone() as source: print("请开始说话...") audio = r.listen(source) return audio # Google API的音频文件识别 def transcribe_google(audio): config = speech.RecognitionConfig( encoding=speech.RecognitionConfig.AudioEncoding.LINEAR16, language_code="zh-CN", sample_rate_hertz=16000 ) response = client.recognize(config=config, audio=audio) text = "" for result in response.results: text += result.alternatives[0].transcript + " " return text.strip() # 示例 audio_data = record_audio() transcribed_text = transcribe_google(audio_data) print(f"你说的是: {transcribed_text}") ``` 请注意，实时语音识别可能会有延迟，而且网络连接对Google API的访问至关重要。同时，免费额度有限，超出后可能需要付费。

阅读全文