Skip to content

Latest commit

Β 

History

History
25 lines (18 loc) Β· 2.16 KB

File metadata and controls

25 lines (18 loc) Β· 2.16 KB

Audio & Speech Processing

Speech recognition, text-to-speech, audio classification, music generation, and audio ML.

Courses

Tools & Libraries

  • SpeechBrain - Open-source PyTorch toolkit for speech and audio processing. Intermediate
  • Mozilla Common Voice - Free open speech dataset for building voice applications. All Levels
  • AssemblyAI Tutorials - Practical guides on speech-to-text and audio AI. Beginner
  • Whisper (OpenAI) - Open-source speech recognition model with documentation and examples. Intermediate
  • Librosa Documentation - Python library for audio and music analysis with tutorials. Beginner
  • ESPnet - End-to-end speech processing toolkit with recipes and tutorials. Advanced

Reading & Research