What is the basic pipeline of a speech recognition system?
194
15-Apr-2025
Updated on 15-Apr-2025
ICSM Computer
15-Apr-20251. Audio Input
.wav,.mp3, etc.)2. Preprocessing
3. Feature Extraction
4. Acoustic Modeling
5. Language Modeling
6. Decoding
7. Post-processing
End-to-End Models
Modern systems (like Whisper, DeepSpeech) use end-to-end deep learning models that skip traditional pipelines and directly map audio to text.