Project Title

Subtitle Generation with Faster Whisper

Description

This project utilizes HDemucs and WhisperX to generate subtitles from audio files. The pipeline is divided into three steps:

Audio extraction using ffmpeg.
Audio separation using HDemucs
Subtitle generation using WhisperX (Speech-to-text + Alignment). The main goal of this project is to provide a simple and efficient way to generate subtitles from audio files, also allowing future users to extend or replace the current steps of the pipeline.

Installation

Follow the instructions on WhisperX installation's page for installing the required dependencies.

Usage

The library is still WIP, so the usage will be updated continuously. The current way to run the pipeline is to run the main.py file, which will run the pipeline.py module for handling all the steps.

python main.py

Planned Improvements:

Complete the lib API and config:
- A YAML config file is WIP: there will be stored all the default parameters.
- The API will be in the main.py file, allowing to run the pipeline thorugh:
  - CLI.
  - Python import (PIP install).
  - Docker.
Update WhisperX to match new CTranslate and Torch, for easier setup.
Make the library more modular, configurable and extensible, allowing new users to replace or extend the current steps of the pipeline.
Develop and tune a new model for direct subtitle generation, capable of handling complex and long audios even in presence of background effects or silences, without relying on external tools such as VADs or vocals separation.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
configs		configs
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pipeline.py		pipeline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Title

Description

Table of Contents

Installation

Usage

Planned Improvements:

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

jgurakuqi/auto-subtitles-generator

Folders and files

Latest commit

History

Repository files navigation

Project Title

Description

Table of Contents

Installation

Usage

Planned Improvements:

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages