Project Speech

Overview

Project Speech was my university dissertation project focused on building and evaluating a speech-to-text model. The main goal was to test whether training with generic speech, impaired speech, or a mixture of both would improve or worsen the model's performance.

Features

Custom Speech-to-Text Model: Developed and trained using Python and deep learning frameworks.
Dataset Experimentation: Compared model performance on generic speech, impaired speech, and a mixed dataset.
Performance Analysis: Evaluated accuracy and robustness across different data types.
Research Focus: Explored the impact of dataset composition and model architecture on speech recognition.

Findings

The results showed that model performance depended more on the type of model architecture used and the inclusion of a mixed dataset, rather than just the type of speech data.
Using a mixture of generic and impaired speech data generally improved robustness.

Technologies Used

Python
Deep Learning (e.g., TensorFlow or PyTorch)
Data preprocessing and augmentation

Challenges & Solutions

Data Collection: Sourced and prepared both generic and impaired speech datasets.
Model Selection: Experimented with different architectures to find the best fit.
Evaluation: Designed fair tests to compare model performance across datasets.