https://github.com/innovatorved/whisper.api Skip to content Toggle navigation Sign up * Product + Actions Automate any workflow + Packages Host and manage packages + Security Find and fix vulnerabilities + Codespaces Instant dev environments + Copilot Write better code with AI + Code review Manage code changes + Issues Plan and track work + Discussions Collaborate outside of code Explore + All features + Documentation + GitHub Skills + Blog * Solutions For + Enterprise + Teams + Startups + Education By Solution + CI/CD & Automation + DevOps + DevSecOps Resources + Customer Stories + White papers, Ebooks, Webinars + Partners * Open Source + GitHub Sponsors Fund open source developers + The ReadME Project GitHub community articles Repositories + Topics + Trending + Collections * Pricing Search or jump to... Search code, repositories, users, issues, pull requests... Search [ ] Clear Search syntax tips Provide feedback We read every piece of feedback, and take your input very seriously. [ ] [ ] Include my email address so I can be contacted Cancel Submit feedback Saved searches Use saved searches to filter your results more quickly Name [ ] Query [ ] To see all available qualifiers, see our documentation. Cancel Create saved search Sign in Sign up You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. {{ message }} innovatorved / whisper.api Public * Notifications * Fork 3 * Star 171 This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model. innovatorved-whisper-api.hf.space/ 171 stars 3 forks Activity Star Notifications * Code * Issues 0 * Pull requests 1 * Actions * Projects 0 * Security * Insights More * Code * Issues * Pull requests * Actions * Projects * Security * Insights innovatorved/whisper.api This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository. main Switch branches/tags [ ] Branches Tags Could not load branches Nothing to show {{ refName }} default View all branches Could not load tags Nothing to show {{ refName }} default View all tags Name already in use A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch? Cancel Create 1 branch 0 tags Code * Local * Codespaces * Clone HTTPS GitHub CLI [https://github.com/i] Use Git or checkout with SVN using the web URL. [gh repo clone innova] Work fast with our official CLI. Learn more about the CLI. * Open with GitHub Desktop * Download ZIP Sign In Required Please sign in to use Codespaces. Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Launching GitHub Desktop If nothing happens, download GitHub Desktop and try again. Launching Xcode If nothing happens, download Xcode and try again. Launching Visual Studio Code Your codespace will open once ready. There was a problem preparing your codespace, please try again. Latest commit @innovatorved innovatorved README.md updated ... 3dea319 Aug 22, 2023 README.md updated 3dea319 Git stats * 53 commits Files Permalink Failed to load latest commit information. Type Name Latest commit message Commit time app config updated August 21, 2023 16:45 audio Added Transcribe support August 14, 2023 17:14 binary Updated v3 August 15, 2023 02:24 models Added Transcribe support August 14, 2023 17:14 transcribe Added Transcribe support August 14, 2023 17:14 .env.example things gonna change August 13, 2023 20:36 .gitignore .gitignore updated August 18, 2023 18:39 CONTRIBUTING.md CONTRIBUTING.md added August 22, 2023 23:02 Dockerfile update dockerfiel August 21, 2023 00:56 Pipfile initial setup correct August 12, 2023 23:03 README.md README.md updated August 22, 2023 23:03 requirements.txt Dockerfile updated for deploy August 21, 2023 00:35 View code [ ] Whisper API - Speech to Text Transcription Installation Running the Project Get Your token Example to Transcribe a File License Reference & Credits Authors About Me Support README.md title emoji colorFrom colorTo sdk app_file app_port whisper.api [?] purple gray docker Dockerfile 7860 Whisper API - Speech to Text Transcription This open source project provides a self-hostable API for speech to text transcription using a finetuned Whisper ASR model. The API allows you to easily convert audio files to text through HTTP requests. Ideal for adding speech recognition capabilities to your applications. Key features: * Uses a finetuned Whisper model for accurate speech recognition * Simple HTTP API for audio file transcription * User level access with API keys for managing usage * Self-hostable code for your own speech transcription service * Quantized model optimization for fast and efficient inference * Open source implementation for customization and transparency This repository contains code to deploy the API server along with finetuning and quantizing models. Check out the documentation for getting started! Installation To install the necessary dependencies, run the following command: # Install ffmpeg for Audio Processing sudo apt install ffmpeg # Install Python Package pip install -r requirements.txt Running the Project To run the project, use the following command: uvicorn app.main:app --reload Get Your token To get your token, use the following command: curl -X 'POST' \ 'https://innovatorved-whisper-api.hf.space/api/v1/users/get_token' \ -H 'accept: application/json' \ -H 'Content-Type: application/json' \ -d '{ "email": "example@domain.com", "password": "password" }' Example to Transcribe a File To upload a file and transcribe it, use the following command: Note: The token is a dummy token and will not work. Please use the token provided by the admin. Here are the available models: * tiny.en * tiny.en.q5 * base.en.q5 # Modify the token and audioFilePath curl -X 'POST' \ 'http://localhost:8000/api/v1/transcribe/?model=tiny.en.q5' \ -H 'accept: application/json' \ -H 'Authentication: e9b7658aa93342c492fa64153849c68b8md9uBmaqCwKq4VcgkuBD0G54FmsE8JT' \ -H 'Content-Type: multipart/form-data' \ -F 'file=@audioFilePath.wav;type=audio/wav' License MIT Reference & Credits * https://github.com/openai/whisper * https://openai.com/blog/whisper/ * https://github.com/ggerganov/whisper.cpp Authors * Ved Gupta About Me I'm a Developer i will feel the code then write. Support For support, email vedgupta@protonmail.com About This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model. innovatorved-whisper-api.hf.space/ Topics whisper asr transcribe innovatorved Resources Readme Activity Stars 171 stars Watchers 3 watching Forks 3 forks Report repository Languages * Python 96.7% * Dockerfile 3.3% Footer (c) 2023 GitHub, Inc. Footer navigation * Terms * Privacy * Security * Status * Docs * Contact GitHub * Pricing * API * Training * Blog * About You can't perform that action at this time.