我正在尝试制作一个Python应用程序,使用PyAudio、SpeechRecognition和PocketSphinx可以记录音频并将其转换为英文文本。我在Mac OS X El Capitan,版本10.11.2上运行。
像这样的教程和其他教程一样,我下载了PyAudio 0.2.9版本,以及SpeechRecognition和PocketSphinx,然后安装到Conda环境中。我按照这个网站的说明,在我的OS X上使用brew install swig git python
,希望它能够帮助。
这是我的代码:
# Load packages
import speech_recognition as sr
import sphinxbase
import pocketsphinx
# obtain audio from the microphone
r = sr.Recognizer()
with sr.Microphone() as source:
print("Say something!")
audio = r.listen(source)
# write audio to a WAV file
with open("microphone-results.wav", "wb") as f:
f.write(audio.get_wav_data())
目前为止一切运作良好。我可以顺利地录制和播放我的WAV文件。但问题出在这里...
r = sr.Recognizer()
with sr.AudioFile('microphone-results.wav') as source:
audio = r.record(source) # read the entire audio file
try:
print("You said " + r.recognize_sphinx(audio))
except LookupError: # speech is unintelligible
print("Could not understand audio")
当我运行这段代码时,我获得以下错误信息:RequestError: missing PocketSphinx module: ensure that PocketSphinx is set up correctly.
我已经在全局和虚拟conda环境中都安装了PocketSphinx和Sphinxbase,但是问题仍然存在。在Google / SO帖子上关于此错误的文档接近于零,所以我不确定出了什么问题。非常感谢您能提供任何帮助/建议。以下是我在conda虚拟环境中安装的软件包及其版本列表:
# packages in environment at /Users/nathancheever/anaconda/envs/audio_2:
#
appnope 0.1.0 py27_0
backports-abc 0.4 <pip>
backports.ssl-match-hostname 3.4.0.2 <pip>
backports_abc 0.4 py27_0
decorator 4.0.9 py27_0
freetype 2.5.5 0
ipykernel 4.3.1 py27_0
ipython 4.1.2 py27_2
ipython-genutils 0.1.0 <pip>
ipython_genutils 0.1.0 py27_0
ipywidgets 4.1.1 py27_0
jinja2 2.8 py27_0
jsonschema 2.4.0 py27_0
jupyter 1.0.0 py27_2
jupyter-client 4.2.2 <pip>
jupyter-console 4.1.1 <pip>
jupyter-core 4.1.0 <pip>
jupyter_client 4.2.2 py27_0
jupyter_console 4.1.1 py27_0
jupyter_core 4.1.0 py27_0
libpng 1.6.17 0
markupsafe 0.23 py27_0
mistune 0.7.2 py27_1
nbconvert 4.1.0 py27_0
nbformat 4.0.1 py27_0
notebook 4.1.0 py27_2
openssl 1.0.2g 0
path.py 8.1.2 py27_1
pexpect 4.0.1 py27_0
pickleshare 0.5 py27_0
pip 8.1.1 py27_1
ptyprocess 0.5 py27_0
pyaudio 0.2.9 <pip>
pygments 2.1.3 py27_0
pyqt 4.11.4 py27_1
python 2.7.11 0
pyzmq 15.2.0 py27_0
qt 4.8.7 1
qtconsole 4.2.1 py27_0
readline 6.2 2
setuptools 20.6.7 py27_0
simplegeneric 0.8.1 py27_0
singledispatch 3.4.0.3 py27_0
sip 4.16.9 py27_0
six 1.10.0 py27_0
speechrecognition 3.4.2 <pip>
sphinxbase 0.8 <pip>
sqlite 3.9.2 0
ssl_match_hostname 3.4.0.2 py27_0
terminado 0.5 py27_1
tk 8.5.18 0
tornado 4.3 py27_0
traitlets 4.2.1 py27_0
wheel 0.29.0 py27_0
zlib 1.2.8 0