如何在Python中将立体声WAV文件转换为单声道？

Question

如何在Python中将立体声WAV文件转换为单声道？

pythonaudiowav

28

我不想使用其它应用程序（如sox） - 我想在纯Python中完成此操作。安装所需的Python库是可以的。

- Matt

3个回答

11

如果WAV文件是PCM编码的话，您可以使用wave。打开源文件和目标文件，读取样本，对通道进行平均处理，然后将其写出。

- Ignacio Vazquez-Abrams

1

似乎我的尝试中无法安装它，但我能够让Jiaaro的pydub工作。 - Shane

不需要安装任何东西。它随附于Python中。 - Ignacio Vazquez-Abrams

您不需要设置参数。它们将从文件中读取。 - Ignacio Vazquez-Abrams

0

我能想到的最简单的方法是使用PyTorch的mean函数，就像下面的示例一样。

import torch
import torchaudio

def stereo_to_mono_convertor(signal):
    # If there is more than 1 channel in your audio
    if signal.shape[0] > 1:
        # Do a mean of all channels and keep it in one channel
        signal = torch.mean(signal, dim=0, keepdim=True)
    return signal

# Load audio as tensor
waveform, sr = torchaudio.load('audio.wav')
# Convert it to mono channel
waveform = stereo_to_mono_convertor(waveform)

- Ladislav Vašina

网页内容由stack overflow 提供, 点击上面的

可以查看英文原文，
原文链接

- Jiaaro · Accepted Answer

我维护一个开源库pydub，它使这件事相当简单。

from pydub import AudioSegment
sound = AudioSegment.from_wav("/path/to/file.wav")
sound = sound.set_channels(1)
sound.export("/output/path.wav", format="wav")

提醒一点：它使用ffmpeg来处理音频格式转换，但如果您只使用wav格式，则可以纯粹使用Python。