https://github.com/innovatorved/whisper.api

Skip to content Toggle navigation
 
Sign up

  * Product
      +  
        Actions
        Automate any workflow
      +  
        Packages
        Host and manage packages
      +  
        Security
        Find and fix vulnerabilities
      +  
        Codespaces
        Instant dev environments
      +  
        Copilot
        Write better code with AI
      +  
        Code review
        Manage code changes
      +  
        Issues
        Plan and track work
      +  
        Discussions
        Collaborate outside of code
    Explore
      + All features
      + Documentation
      + GitHub Skills
      + Blog
  * Solutions
    For
      + Enterprise
      + Teams
      + Startups
      + Education
    By Solution
      + CI/CD & Automation
      + DevOps
      + DevSecOps
    Resources
      + Customer Stories
      + White papers, Ebooks, Webinars
      + Partners
  * Open Source
      +  
        GitHub Sponsors
        Fund open source developers
      +  
        The ReadME Project
        GitHub community articles
    Repositories
      + Topics
      + Trending
      + Collections
  * Pricing

Search or jump to...

Search code, repositories, users, issues, pull requests...

Search
[                    ]
Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

[                    ] [ ] Include my email address so I can be
contacted
Cancel Submit feedback

Saved searches

Use saved searches to filter your results more quickly

Name [                    ] 
Query [                    ]

To see all available qualifiers, see our documentation.

Cancel Create saved search
Sign in
Sign up
You signed in with another tab or window. Reload to refresh your
session. You signed out in another tab or window. Reload to refresh
your session. You switched accounts on another tab or window. Reload
to refresh your session.
{{ message }}
innovatorved / whisper.api Public

  * Notifications
  * Fork 3
  * Star 171

This project provides an API with user level access support to
transcribe speech to text using a finetuned and processed Whisper ASR
model.

innovatorved-whisper-api.hf.space/
171 stars 3 forks Activity
Star
Notifications

  * Code
  * Issues 0
  * Pull requests 1
  * Actions
  * Projects 0
  * Security
  * Insights

More

  * Code
  * Issues
  * Pull requests
  * Actions
  * Projects
  * Security
  * Insights

innovatorved/whisper.api

This commit does not belong to any branch on this repository, and may
belong to a fork outside of the repository.
main
Switch branches/tags
[                    ]
Branches Tags
Could not load branches
Nothing to show
{{ refName }} default View all branches
Could not load tags
Nothing to show
{{ refName }} default
View all tags

Name already in use

A tag already exists with the provided branch name. Many Git commands
accept both tag and branch names, so creating this branch may cause
unexpected behavior. Are you sure you want to create this branch?
Cancel Create
1 branch 0 tags
Code

  * Local
  * Codespaces

  *  
    Clone
    HTTPS GitHub CLI
    [https://github.com/i]

    Use Git or checkout with SVN using the web URL.

    [gh repo clone innova]

    Work fast with our official CLI. Learn more about the CLI.

  * Open with GitHub Desktop
  * Download ZIP

Sign In Required

Please sign in to use Codespaces.

Launching GitHub Desktop

If nothing happens, download GitHub Desktop and try again.

Launching GitHub Desktop

If nothing happens, download GitHub Desktop and try again.

Launching Xcode

If nothing happens, download Xcode and try again.

Launching Visual Studio Code

Your codespace will open once ready.

There was a problem preparing your codespace, please try again.

Latest commit

@innovatorved
innovatorved README.md updated
...
3dea319 Aug 22, 2023
README.md updated
3dea319

Git stats

  * 53 commits

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
app
config updated
August 21, 2023 16:45
audio
Added Transcribe support
August 14, 2023 17:14
binary
Updated v3
August 15, 2023 02:24
models
Added Transcribe support
August 14, 2023 17:14
transcribe
Added Transcribe support
August 14, 2023 17:14
.env.example
things gonna change
August 13, 2023 20:36
.gitignore
.gitignore updated
August 18, 2023 18:39
CONTRIBUTING.md
CONTRIBUTING.md added
August 22, 2023 23:02
Dockerfile
update dockerfiel
August 21, 2023 00:56
Pipfile
initial setup correct
August 12, 2023 23:03
README.md
README.md updated
August 22, 2023 23:03
requirements.txt
Dockerfile updated for deploy
August 21, 2023 00:35
View code
[                    ]
Whisper API - Speech to Text Transcription Installation Running the
Project Get Your token Example to Transcribe a File License Reference
& Credits Authors  About Me Support

README.md

   title    emoji colorFrom colorTo  sdk    app_file  app_port
whisper.api [?]  purple    gray    docker Dockerfile 7860

 Whisper API - Speech to Text Transcription

This open source project provides a self-hostable API for speech to
text transcription using a finetuned Whisper ASR model. The API
allows you to easily convert audio files to text through HTTP
requests. Ideal for adding speech recognition capabilities to your
applications.

Key features:

  * Uses a finetuned Whisper model for accurate speech recognition
  * Simple HTTP API for audio file transcription
  * User level access with API keys for managing usage
  * Self-hostable code for your own speech transcription service
  * Quantized model optimization for fast and efficient inference
  * Open source implementation for customization and transparency

This repository contains code to deploy the API server along with
finetuning and quantizing models. Check out the documentation for
getting started!

 Installation

To install the necessary dependencies, run the following command:

# Install ffmpeg for Audio Processing
sudo apt install ffmpeg

# Install Python Package
pip install -r requirements.txt

 Running the Project

To run the project, use the following command:

uvicorn app.main:app --reload

 Get Your token

To get your token, use the following command:

curl -X 'POST' \
  'https://innovatorved-whisper-api.hf.space/api/v1/users/get_token' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "email": "example@domain.com",
  "password": "password"
}'

 Example to Transcribe a File

To upload a file and transcribe it, use the following command: Note:
The token is a dummy token and will not work. Please use the token
provided by the admin.

Here are the available models:

  * tiny.en
  * tiny.en.q5
  * base.en.q5

# Modify the token and audioFilePath
curl -X 'POST' \
  'http://localhost:8000/api/v1/transcribe/?model=tiny.en.q5' \
  -H 'accept: application/json' \
  -H 'Authentication: e9b7658aa93342c492fa64153849c68b8md9uBmaqCwKq4VcgkuBD0G54FmsE8JT' \
  -H 'Content-Type: multipart/form-data' \
  -F 'file=@audioFilePath.wav;type=audio/wav'

 License

MIT

 Reference & Credits

  * https://github.com/openai/whisper
  * https://openai.com/blog/whisper/
  * https://github.com/ggerganov/whisper.cpp

 Authors

  * Ved Gupta

  About Me

I'm a Developer i will feel the code then write.

 Support

For support, email vedgupta@protonmail.com

About

This project provides an API with user level access support to
transcribe speech to text using a finetuned and processed Whisper ASR
model.

innovatorved-whisper-api.hf.space/

Topics

whisper asr transcribe innovatorved

Resources

Readme
Activity

Stars

171 stars

Watchers

3 watching

Forks

3 forks
Report repository

Languages

  * Python 96.7%
  * Dockerfile 3.3%

Footer

 (c) 2023 GitHub, Inc.

Footer navigation

  * Terms
  * Privacy
  * Security
  * Status
  * Docs
  * Contact GitHub
  * Pricing
  * API
  * Training
  * Blog
  * About

You can't perform that action at this time.