https://github.com/facebookresearch/fairseq/tree/main/examples/mms Skip to content Toggle navigation Sign up * Product + Actions Automate any workflow + Packages Host and manage packages + Security Find and fix vulnerabilities + Codespaces Instant dev environments + Copilot Write better code with AI + Code review Manage code changes + Issues Plan and track work + Discussions Collaborate outside of code Explore + All features + Documentation + GitHub Skills + Blog * Solutions For + Enterprise + Teams + Startups + Education By Solution + CI/CD & Automation + DevOps + DevSecOps Case Studies + Customer Stories + Resources * Open Source + GitHub Sponsors Fund open source developers + The ReadME Project GitHub community articles Repositories + Topics + Trending + Collections * Pricing [ ] * # In this repository All GitHub | Jump to | * No suggested jump to results * # In this repository All GitHub | Jump to | * # In this organization All GitHub | Jump to | * # In this repository All GitHub | Jump to | Sign in Sign up {{ message }} facebookresearch / fairseq Public * Notifications * Fork 5.5k * Star 22.5k * Code * Issues 891 * Pull requests 107 * Actions * Projects 0 * Security * Insights More * Code * Issues * Pull requests * Actions * Projects * Security * Insights main Switch branches/tags [ ] Branches Tags Could not load branches Nothing to show {{ refName }} default View all branches Could not load tags Nothing to show {{ refName }} default View all tags Name already in use A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch? Cancel Create fairseq/examples/mms/ Go to file fairseq/examples/mms/ Latest commit @vineelpratap vineelpratap Update README.md (#5118) ... af12c9c May 22, 2023 Update README.md (#5118) af12c9c Git stats * History Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time . . asr Mms release (#3948) (#5110) May 21, 2023 21:15 data_prep Mms release (#3948) (#5110) May 21, 2023 21:15 lid Mms release (#3948) (#5110) May 21, 2023 21:15 tts Mms release (#3948) (#5110) May 21, 2023 21:15 MODEL_CARD.md Mms release (#3948) (#5110) May 21, 2023 21:15 README.md Update README.md (#5118) May 22, 2023 14:16 [ ] MMS: Scaling Speech Technology to 1000+ languages Pretrained models Finetuned models ASR TTS LID Commands to run inference ASR TTS LID Forced Alignment Tooling License Citation README.md MMS: Scaling Speech Technology to 1000+ languages The Massively Multilingual Speech (MMS) project expands speech technology from about 100 languages to over 1,000 by building a single multilingual speech recognition model supporting over 1,100 languages (more than 10 times as many as before), language identification models able to identify over 4,000 languages (40 times more than before), pretrained models supporting over 1,400 languages, and text-to-speech models for over 1,100 languages. Our goal is to make it easier for people to access information and to use devices in their preferred language. You can find details in the paper Scaling Speech Technology to 1000+ languages and the blog post. An overview of the languages covered by MMS can be found here. Pretrained models Model Link MMS-300M download MMS-1B download Example commands to finetune the pretrained models can be found here. Finetuned models ASR Model Languages Dataset Model Supported languages MMS-1B:FL102 102 FLEURS download download MMS-1B:L1107 1107 MMS-lab download download MMS-1B-all 1162 MMS-lab + FLEURS download download + CV + VP + MLS TTS 1. Download the list of iso codes of 1107 languages. 2. Find the iso code of the target language and download the checkpoint. Each folder contains 3 files: G_100000.pth, config.json, vocab.txt. The G_100000.pth is the generator trained for 100K updates, config.json is the training config, vocab.txt is the vocabulary for the TTS model. # Examples: wget https://dl.fbaipublicfiles.com/mms/tts/eng.tar.gz # English (eng) wget https://dl.fbaipublicfiles.com/mms/tts/azj-script_latin.tar.gz # North Azerbaijani (azj-script_latin) LID # Dataset Model Dictionary Supported Languages languages 126 FLEURS + VL + MMS-lab-U + download download download MMS-unlab 256 FLEURS + VL + MMS-lab-U + download download download MMS-unlab 512 FLEURS + VL + MMS-lab-U + download download download MMS-unlab 1024 FLEURS + VL + MMS-lab-U + download download download MMS-unlab 2048 FLEURS + VL + MMS-lab-U + download download download MMS-unlab 4017 FLEURS + VL + MMS-lab-U + download download download MMS-unlab Commands to run inference ASR Run this command to transcribe one or more audio files: cd /path/to/fairseq-py/ python examples/mms/asr/infer/mms_infer.py --model "/path/to/asr/model" --lang lang_code --audio "/path/to/audio_1.wav" "/path/to/audio_1.wav" For more advance configuration and calculate CER/WER, you could prepare manifest folder by creating a folder with this format: $ ls /path/to/manifest dev.tsv dev.wrd dev.ltr dev.uid # dev.tsv each line contains