python获取音频文件采样率的方法

本文主要是介绍python获取音频文件采样率的方法，希望对大家解决编程问题提供一定的参考价值，需要的开发者们随着小编来一起学习吧！

在 Python 中，你可以使用多种库来获取音频文件的采样率（sampling rate）。常见的音频文件格式包括 .wav、.mp3、.flac 等。对于不同的音频文件格式，可以使用不同的库来处理。以下是一些常见的库和对应的方法：
1. 使用 wave 库处理 .wav 文件
对于 .wav 文件，你可以使用 Python 标准库中的 wave 模块来读取采样率。
import wave

def get_sampling_rate_wav(file_path):
with wave.open(file_path, 'rb') as wav_file:
# 获取采样率
sampling_rate = wav_file.getframerate()
return sampling_rate

# 示例
file_path = 'example.wav'
sampling_rate = get_sampling_rate_wav(file_path)
print(f"Sampling rate: {sampling_rate} Hz")

2. 使用 pydub 库处理多种音频格式
pydub 是一个轻量级的音频处理库，支持多种音频格式。你可以使用它来读取多种格式的音频文件，并获取采样率。
pip install pydub

from pydub import AudioSegment

def get_sampling_rate_pydub(file_path):
audio = AudioSegment.from_file(file_path)
return audio.frame_rate

# 示例
file_path = 'example.mp3'
sampling_rate = get_sampling_rate_pydub(file_path)
print(f"Sampling rate: {sampling_rate} Hz")

3. 使用 librosa 库处理多种音频格式
librosa 是一个强大的音频处理库，广泛应用于音乐和音频信号分析。它可以处理多种音频格式，并提供丰富的音频处理功能。
pip install librosa

import librosa

def get_sampling_rate_librosa(file_path):
y, sr = librosa.load(file_path, sr=None)
return sr

# 示例
file_path = 'example.flac'
sampling_rate = get_sampling_rate_librosa(file_path)
print(f"Sampling rate: {sampling_rate} Hz")

4. 使用 soundfile 库处理多种音频格式
soundfile 库也是一个很好的音频处理库，可以处理多种音频格式，并且非常轻量级。
pip install soundfile

import soundfile as sf

def get_sampling_rate_soundfile(file_path):
_, sr = sf.read(file_path, dtype='float32')
return sr

# 示例
file_path = 'example.aiff'
sampling_rate = get_sampling_rate_soundfile(file_path)
print(f"Sampling rate: {sampling_rate} Hz")

示例代码汇总
以下是各种方法的总结：
import wave
from pydub import AudioSegment
import librosa
import soundfile as sf

def get_sampling_rate_wav(file_path):
with wave.open(file_path, 'rb') as wav_file:
sampling_rate = wav_file.getframerate()
return sampling_rate

def get_sampling_rate_pydub(file_path):
audio = AudioSegment.from_file(file_path)
return audio.frame_rate

def get_sampling_rate_librosa(file_path):
y, sr = librosa.load(file_path, sr=None)
return sr

def get_sampling_rate_soundfile(file_path):
_, sr = sf.read(file_path, dtype='float32')
return sr

# 示例文件路径
file_path = 'example.wav'

# 使用 wave 库获取采样率
sampling_rate_wave = get_sampling_rate_wav(file_path)
print(f"Sampling rate (Wave): {sampling_rate_wave} Hz")

# 使用 pydub 库获取采样率
sampling_rate_pydub = get_sampling_rate_pydub(file_path)
print(f"Sampling rate (PyDub): {sampling_rate_pydub} Hz")

# 使用 librosa 库获取采样率
sampling_rate_librosa = get_sampling_rate_librosa(file_path)
print(f"Sampling rate (Librosa): {sampling_rate_librosa} Hz")

# 使用 soundfile 库获取采样率
sampling_rate_soundfile = get_sampling_rate_soundfile(file_path)
print(f"Sampling rate (SoundFile): {sampling_rate_soundfile} Hz")

选择合适的库
根据你的需求和音频文件的格式选择合适的库。对于 .wav 文件，wave 库是最简单的选择。对于其他格式，可以考虑使用 pydub、librosa 或 soundfile 库。这些库提供了更多的功能，并且支持多种音频格式。

使用 torchaudio 库来获取音频文件的采样率是一个非常直接的过程。torchaudio 是 PyTorch 社区开发的一个音频处理库，非常适合用于音频信号处理和深度学习研究。
首先，你需要安装 torchaudio 库。如果你还没有安装，可以使用以下命令安装：
pip install torchaudio

接下来，你可以使用 torchaudio 库来加载音频文件并获取采样率。下面是一个简单的示例代码：
import torchaudio

def get_sampling_rate_torchaudio(file_path):
# 加载音频文件
waveform, sample_rate = torchaudio.load(file_path)

# 返回采样率
return sample_rate

# 示例文件路径
file_path = 'example.wav' # 你可以替换为你自己的音频文件路径

# 使用 torchaudio 库获取采样率
sampling_rate = get_sampling_rate_torchaudio(file_path)
print(f"Sampling rate: {sampling_rate} Hz")

详细解释
1. 导入 torchaudio 库：
• import torchaudio 导入 torchaudio 库。
2. 定义函数 get_sampling_rate_torchaudio：
• 该函数接收一个参数 file_path，表示音频文件的路径。
3. 加载音频文件：
• 使用 torchaudio.load 函数加载音频文件，该函数返回一个包含波形数据和采样率的元组。
• waveform 是音频的波形数据，通常是一个二维张量（Tensor），形状为 [num_channels, num_samples]。
• sample_rate 是音频的采样率。
4. 返回采样率：
• 函数返回 sample_rate。
示例代码
假设你有一个名为 example.wav 的音频文件，你可以使用以下代码来获取它的采样率：
import torchaudio

def get_sampling_rate_torchaudio(file_path):
# 加载音频文件
waveform, sample_rate = torchaudio.load(file_path)

# 返回采样率
return sample_rate

# 示例文件路径
file_path = 'example.wav' # 替换为你的音频文件路径

# 使用 torchaudio 库获取采样率
sampling_rate = get_sampling_rate_torchaudio(file_path)
print(f"Sampling rate: {sampling_rate} Hz")

处理多种音频格式
torchaudio 支持多种音频格式，包括 .wav, .flac, .ogg, .mp3 等。如果需要处理其他格式的音频文件，可以直接使用相同的 torchaudio.load 方法。
完整示例
下面是一个完整的示例代码，展示了如何使用 torchaudio 获取不同格式的音频文件的采样率：
import torchaudio

def get_sampling_rate_torchaudio(file_path):
# 加载音频文件
waveform, sample_rate = torchaudio.load(file_path)

# 返回采样率
return sample_rate

# 示例文件路径
file_paths = [
'example.wav', # WAV 文件
'example.mp3', # MP3 文件
'example.flac', # FLAC 文件
'example.ogg' # OGG 文件
]

# 遍历每个文件路径
for file_path in file_paths:
try:
# 使用 torchaudio 库获取采样率
sampling_rate = get_sampling_rate_torchaudio(file_path)
print(f"Sampling rate of {file_path}: {sampling_rate} Hz")
except Exception as e:
print(f"Failed to load {file_path}: {e}")

通过上述代码，你可以轻松地获取不同格式音频文件的采样率。torchaudio 库提供了简洁的 API，使得音频文件的加载和处理变得非常方便。

这篇关于python获取音频文件采样率的方法的文章就介绍到这儿，希望我们推荐的文章对编程师们有所帮助！