FFmpeg/Libav音频解码示例

4
我正尝试使用代码将MP2文件转换为原始PCM并获得与ffmpeg/avconv相同的输出结果。我使用了FFmpeg/Libav文档中的教程中的audio_decode_example函数,但输出结果不同。
我需要在程序中添加哪些步骤才能获得与ffmpeg/avconv相同的输出结果?
编辑: 这是从avconv -v debug -i test.mp2 test.sw得到的输出结果:
avconv version v10_beta1-302-g606cc8a, Copyright (c) 2000-2014 the Libav developers
built on Mar 20 2014 20:28:32 with gcc 4.8 (Ubuntu/Linaro 4.8.1-10ubuntu9)
configuration: 
libavutil     53.  6. 0 / 53.  6. 0
libavcodec    55. 34. 1 / 55. 34. 1
libavformat   55. 12. 0 / 55. 12. 0
libavdevice   54.  0. 0 / 54.  0. 0
libavfilter    4.  3. 0 /  4.  3. 0
libavresample  1.  2. 0 /  1.  2. 0
libswscale     2.  1. 2 /  2.  1. 2
Splitting the commandline.
Reading option '-v' ... matched as option 'v' (set libav* logging level) with argument 'debug'.
Reading option '-i' ... matched as input file with argument 'test.mp2'.
Reading option 'test1.sw' ... matched as output file.                                  
Finished splitting the commandline.                                       
Parsing a group of options: global .                
Applying option v (set libav* logging level) with argument debug.
Successfully parsed a group of options.
Parsing a group of options: input file test.mp2.
Successfully parsed a group of options.
Opening an input file: test.mp2.
[mp3 @ 0x27e5060] Probed with size=2048 and score=51
[mp3 @ 0x27e5060] Estimating duration from bitrate, this may be inaccurate
Guessed Channel Layout for  Input Stream #0.0 : stereo
Input #0, mp3, from 'test.mp2':
  Duration: 00:00:04.80, start: 0.000000, bitrate: 64 kb/s
    Stream #0.0, 200, 1/14112000: Audio: mp2, 48000 Hz, stereo, s16p, 64 kb/s
Successfully opened the file.
Parsing a group of options: output file test1.sw.
Successfully parsed a group of options.
Opening an output file: test1.sw.
Successfully opened the file.
Detected 2 logical cores.
[abuffer @ 0x27e5860] tb:1/48000 samplefmt:s16p samplerate: 48000 ch layout:0x3
[abuffersink @ 0x2807040] auto-inserting filter 'auto-inserted fifo 0' between the filter 'audio format for output stream 0:0' and the filter 'output stream 0:0'
[aformat @ 0x2806bc0] auto-inserting filter 'auto-inserted resampler 0' between the filter 'Parsed filter 0 anull' and the filter 'audio format for output stream 0:0'
[AVAudioResampleContext @ 0x28092a0] audio_convert: found function: s16p to s16  (C)
[AVAudioResampleContext @ 0x28092a0] audio_convert: found function: s16p to s16  (SSE2)
[resample @ 0x2808560] fmt:s16p srate:48000 cl:stereo -> fmt:s16 srate:48000 cl:stereo
Output #0, s16le, to 'test1.sw':
  Metadata:
    encoder         : Lavf55.12.0
    Stream #0.0, 0, 1/90000: Audio: pcm_s16le, 48000 Hz, stereo, s16, 1536 kb/s
  Stream mapping:
    Stream #0:0 -> #0:0 (mp2 -> pcm_s16le)
Press ctrl-c to stop encoding
Input stream #0:0 frame changed from rate:48000 fmt:s16p ch:2 chl:stereo to rate:48000 fmt:s16 ch:2 chl:stereo
Detected 2 logical cores.
[abuffer @ 0x2806e40] tb:1/48000 samplefmt:s16 samplerate: 48000 ch layout:0x3
[abuffersink @ 0x2806fa0] auto-inserting filter 'auto-inserted fifo 0' between the filter 'audio format for output stream 0:0' and the filter 'output stream 0:0'
No more output streams to write to, finishing.
size=     900kB time=4.80 bitrate=1536.0kbits/s    
video:0kB audio:0kB global headers:0kB muxing overhead 0.000000%

1
原始的PCM输出不同吗?听起来一样吗?也许你可以尝试使用- loglevel verbose运行ffmpeg以获取更多信息。 - rogerdpack
程序的输出文件与avconv的大小相同,但声音不同(不连续且有更多噪音)。 - rxjsisfine
1
你是否正在使用FFmpeg和avconv的二进制文件?如果是,请提供完整的命令行和未经剪切的控制台输出。 - rogerdpack
好的,@rogerdpack 我已经按照你的要求做了。我正在使用从Github上编译的Libav源代码中获取的avconv。 - rxjsisfine
1
“auto-inserted resampler 0” 可能是一个线索,但这就是我所知道的全部了... - rogerdpack
1个回答

2
我遇到了同样的问题。似乎libav mp2解码默认输出S16P格式(平面格式),这在支持原始输入的各种音频编辑应用程序中并不常见(例如audacity)。我通过在AVCodecContext中指定request_sample_fmt为AV_SAMPLE_FMT_S16来解决我的问题。因此,没有额外需要进行软件重新采样。
在函数audio_decode_example中:
...
c = avcodec_alloc_context3(codec);
if (!c) {
    fprintf(stderr, "Could not allocate audio codec context\n");
    exit(1);
}
c->request_sample_fmt = AV_SAMPLE_FMT_S16;
...

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接