×

注意!页面内容来自https://obsproject.com/forum/resources/localvocal-local-live-captions-translation-on-the-go.1769/,本站不储存任何内容,为了更好的阅读体验进行在线解析,若有广告出现,请及时反馈。若您觉得侵犯了您的利益,请通知我们进行删除,然后访问 原网页

You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an alternative browser.
LocalVocal: Local Live Captions & Translation On-the-Go

LocalVocal: Local Live Captions & Translation On-the-Go v0.5.3

Supported Bit Versions
  1. 64-bit
Source Code URL
https://github.com/royshil/obs-localvocal
Minimum OBS Studio Version
30.2.0
Supported Platforms
  1. Windows
  2. Mac OS X
  3. Linux
LocalVocal plugin allows you to transcribe & translate speech into text locally on your machine in real time. ✅ No GPU required*✅ no cloud costs✅ no network and ✅ minimal lag! Privacy first - all data stays on your machine. (* GPU acceleration via CUDA or AMD is supported!) Sponsor my work https://github.com/sponsors/royshil ^_^

If this plugin has been valuable to you consider adding a ⭐ to the GH repo or rating it here on OBS.

localvocal rt translation.png localvocal quickstart.png

Do more with LocalVocal:

The plugin adds an Audio Filter - use it on a speech source (micvideo) to get a transcription. Send the captions to a Text Source to show on scene.

Current Features:
  • Transcribe audio to text in real time in 100 languages
  • Display captions on screen using text sources
  • Send captions to a .txt or .srt file (to read by external sources or video playback) with and without aggregation option
  • Sync'ed captions with OBS recording timestamps
  • Send captions on a RTMP stream to e.g. YouTubeTwitch
  • Bring your own Whisper model (any GGML)
  • Translate captions in real time to major languages (both Whisper built-in translation as well as NMT models with CTranslate2)
  • CUDAOpenCLApple Arm64AVX & SSE acceleration support
Roadmap:
  • More robust built-in translation options
  • Additional output options: .vtt.ssa.subetc.
  • Speaker diarization (detecting speakers in a multi-person audio stream)
Internally the plugin is running a neural network (OpenAI Whisper) locally to predict in real time the speech and provide captions.

It's using the Whisper.cpp project from ggerganov to run the Whisper network in a very efficient way on CPUs and GPUs. For translation it's using CTranslate2 and the M2M100 model.

Check out my other plugins:
  • Background Removal removes background from webcam without a green screen.
  • Detect will detect and track >80 types of objects in real-time inside OBS
  • URL/API Source that allows fetching live data from an API and displaying it in OBS.
  • localvocal partial.png
    localvocal partial.png
    587.9 KB · Views: 15,345
Author
royshilkrot
Downloads
66,413
Views
236,842
First release
Last update
Rating
4.00 star(s) 12 ratings

More resources from royshilkrot

Latest updates

  1. v0.5.3

    What's Changed Update whisper dependencies to 0.0.12 by @Tabby in #264 Updates Vulkan SDK on...
  2. v0.5.2

    What's Changed Fix MacOS 12 crashes and codesigning issues by @Tabby in #262 [EDIT] Previous...
  3. v0.5.1

    Small re-release to fix missing installers from the last release What's Changed Fix release...

Latest reviews

process of model downloading stuck in 0%how to solve
Doesn't work as intended and the author might've retired this project. Do not install
So the plugin works quite well in itself...what I'm missinghoweveris documentation. If there is anyI can't find it.
It is very difficult to find the right model that works perfectly. Sometimes words come out that I didn't say or that don't sound like what I said. But on the whole it works very well. If now the possibility to integrate the AI better instead of using a tokenI would also be happybecause I think that the AI could understand better than the internal models. I hope the plugin will be developed furtherespecially for me as a German-speaking streamer who doesn't speak English wellthis tool is really worth its weight in goldbecause it also keeps English-speaking viewers there.
Wow. Just wow.
Something I dreamed ofand the implementation is so polished !
Downloading of models straight from within the settingsparameters details only showing when you activate certain functions.

Suggestion :
- The list of available models to be dynamically linked to the GitHub (to keep the models always on the latest version without having to manually push a plugin update)
- Hover tooltips would be great. There are a lot of parameters.
royshilkrot
royshilkrot
wonderful suggestions! yes i've been thinking about making the "model directory" be something dynamic so i can update easily. plus there are literally 100s of available models on hugging face... I will implement this soon.
It does the job 90% greatStill need to get the AMD accelerated version workingon my nvidia machineit's quickon CPU without being able to get AMD GPU workingit's mehand also occasionally it will just start repeating previous stuff. Like the buffer isn't cleared out.
I've been meaning to set up closed captions on my stream for agesbut never knew how to do it. It only took me about 5-minutes with LocalVocal. The default model is efficient enough to have no negative effect on my streamwhile giving great closed captions. Thanks for making thisRoy!
Its a good plugin. It runs an AI Model in the background that process in real time the audio from the microphone and generate transcriptions to a label in OBS or to a file.

I'd like to have both options at the same timebut I guess that they are working on :)
Impossible to work
i7 4790 K and 32 RAM
royshilkrot
royshilkrot
I'm sorry this isn't working for you right away please reach out https://discord.gg/CJHr5zHXD3 and I will help you set it up
Does exactly what it says on the tin! An amazing tool.
This is brilliant. The fully-local implementation of speech-to-text already works very well.

I can't wait to see what transpires as this matures.
Top