datasets>=1.14.0
librosa
torchaudio
torch>=1.6