About

This model was created to support experiments for evaluating phonetic transcription with the Buckeye corpus as part of https://github.com/ginic/multipa. This is a version of facebook/wav2vec2-large-xlsr-53 fine tuned on a specific subset of the Buckeye corpus. For details about specific model parameters, please view the config.json here or training scripts in the scripts/buckeye_experiments folder of the GitHub repository.

Experiment Details

The entire train split of the Buckeye corpus was used to train this model. The only data excluded are samples in the train split that are too short (< 0.1 seconds) or too long (>12 seconds) to be used to train the model

Goals:

Include the largest amount of training data possible.
Can be used with a different corpus (e.g. TIMIT, Speech Accent Archive) for evaluation to test generalization to other dialects and language varieties.

Downloads last month: -

Safetensors

Model size

0.3B params

Tensor type

F32

Collection including ginic/full_dataset_train_4_wav2vec2-large-xlsr-53-buckeye-ipa

Wav2IPA

Collection

Tools and models built as part of the Wav2IPA project at University of Massachusetts, Amherst • 93 items • Updated Jan 11