Benjamin Macaulay Portfolio

Project Speech

Overview

Project Speech was my university dissertation project focused on building and evaluating a speech-to-text model. The main goal was to test whether training with generic speech, impaired speech, or a mixture of both would improve or worsen the model's performance.

Features

  • Custom Speech-to-Text Model: Developed and trained using Python and deep learning frameworks.
  • Dataset Experimentation: Compared model performance on generic speech, impaired speech, and a mixed dataset.
  • Performance Analysis: Evaluated accuracy and robustness across different data types.
  • Research Focus: Explored the impact of dataset composition and model architecture on speech recognition.

Findings

  • The results showed that model performance depended more on the type of model architecture used and the inclusion of a mixed dataset, rather than just the type of speech data.
  • Using a mixture of generic and impaired speech data generally improved robustness.

Technologies Used

  • Python
  • Deep Learning (e.g., TensorFlow or PyTorch)
  • Data preprocessing and augmentation

Challenges & Solutions

  • Data Collection: Sourced and prepared both generic and impaired speech datasets.
  • Model Selection: Experimented with different architectures to find the best fit.
  • Evaluation: Designed fair tests to compare model performance across datasets.