capability
Whisper agents
This page lists every AI agent in the MeshKore directory tagged with the Whisper capability. Agents are sourced from public platforms (GitHub, Hugging Face, npm, PyPI, awesome-list curations, and direct submissions), normalized by the MeshKore worker, and ranked by GitHub stars. Each card links to the agent's profile with details on capabilities, framework, language, freshness, and source attribution.
473 agents in this capability · ranked by popularity
Top 200 Whisper agents
Faster Whisper transcription with CTranslate2
OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
🎒 飞书 ×(GPT-4 + GPT-4V + DALL·E-3 + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀
Low-latency AI engine for mobile devices & wearables
Mac app for crushing tech interviews with AI
A nearly-live implementation of OpenAI's Whisper.
「妙幕」是一款跨平台客户端工具,可以批量为视频或者音频生成字幕文件,并支持对字幕进行翻译,支持百度、火山、openai、ollama、deepseek 等多家翻译
🤖 A Telegram bot that integrates with OpenAI's official ChatGPT APIs to provide answers, written in Python
Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
Voice-to-text dictation app with local (Nvidia Parakeet/Whisper) and cloud models (BYOK). Privacy-first and…
OpenAI API + Ruby! 🤖❤️ GPT-5 & Realtime WebRTC compatible!
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
.NET library for the OpenAI service API by Betalgo Ranul
faster_whisper GUI with PySide6
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Text-To-Speech, RAG, and LLMs. All local!
OpenAI API client for Kotlin with multiplatform and coroutines capabilities.
👻 Proxy API gateway for Kiro IDE & CLI (Amazon Q Developer / AWS CodeWhisperer). Use free Claude models with…
Using OpenAI's Whisper to automatically generate YouTube subtitles
The TypeScript library for building AI applications.
Whisper command line client compatible with original OpenAI client based on CTranslate2.
Natural voice conversations with Claude Code
AI Vtuber for Streaming on Youtube/Twitch
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Open source voice dictation technology
Generate subtitles, summaries, and chapters from videos in seconds
An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS…
React Native binding of whisper.cpp.
🎤 The easiest way to transcribe audio in Swift
Your CrewAI Powered Video Editing Assistant
Running speech to text model (whisper.cpp) in Unity3d on your local machine.
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for…
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Conversational voice AI agents
Real-time transcription using faster-whisper
Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks
[CVPR 2025] Video Narration as Vocabulary & Video as Long Document
Effortlessly add AI-generated transcription subtitles to your videos
Mac compatible Ollama Voice
Private on-device AI suite for Android. Fork of Google AI Edge Gallery with llama.cpp, whisper.cpp…
An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and…
Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over…
Your personal voice assistant based on OpenAI ChatGPT.
End-to-end platform for building voice first multimodal agents
The best way to use AI is on your own computer. Use local or paid API models, and ctrl+k to show/hide the…
一款JavaSDK用于快速接入AI大模型应用,整合多平台大模型,如OpenAi、智谱Zhipu(ChatGLM)、深度求索DeepSeek、月之暗面Moonshot(Kimi)、腾讯混元Hunyuan、零一万物(01)等…
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight…
Java client library for OpenAI API.Full support for all OpenAI API models including Completions, Chat, Edits…
How to use OpenAIs Whisper to transcribe and diarize audio files
Alfred workflow using ChatGPT, DALL·E 2 and other models for chatting, image generation and more.
⚡ Edgen: Local, private GenAI server alternative to OpenAI. No GPU required. Run AI models locally: LLMs…
Program that lets you ask questions about your documents, audio, and video files.
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
An API to transcribe audio with OpenAI's Whisper Large v3!
Simple self-hosted web application, which can be used to convert audio to subtitles by OpenAI's Whisper model
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
🎩 An Alfred 5 Workflow for using OpenAI Chat API to interact with GPT models 🤖💬 It also allows image…
AI-powered tool for real-time interview question transcription and response generation.
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI…
OpenAI (and DeepSeek, Azure OpenAI, YandexGPT, Ollama, GigaChat, Qwen) API wrapper for Delphi. Use ChatGPT…
Node.js bindings for OpenAI's Whisper. (C++ CPU version by ggerganov)
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
Transcribe is a real time transcription, conversation, Language learning platform. It provides live…
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered…
She's the AI agent you come home to.
The definitive, open-source Swift framework for interfacing with generative AI.
Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)
Open-source, fully private and local alternative to NotebookLM. Chat with your documents, generate audio…
NodeJS Bindings for Whisper - the CPU version of OpenAI's Whisper, as initially crafted in C++ by ggerganov.
A powerful Whisper AI keyboard for reliable speech transcription
Full stack voice chatbot
A quick experiment to achieve almost realtime transcription using Whisper.
A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for…
According to all known laws of aviation, there is no way that a bee should be able to fly. Its wings are too…
Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak…
Flutter App That Can Transcribe Audio Offline/On Device with Whisper C++ Bindings via Rust
OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as…
Unleash the power of Chatty: the intersection of ChatGPT’s intelligence, DALL·E's creativity, and Whisper's…
This repository contains a Python script that allows users to download the audio from a YouTube video…
Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using…
A voice chatbot based on GPT4All and talkGPT, running on your local pc!
Automatically generate viral-ready vertical short clips from long-form gameplay footage using AI-powered…
为 Bilibili、YouTube 及本地视频提供 AI 视频摘要和知识库.AI video summarizer and knowledge base for Bilibili, YouTube and local…
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly…
True on-device AI for Kotlin Multiplatform (Android, iOS, Desktop, JVM, WASM). LLM, Speech-to-Text and Image…
Input a YouTube video link or upload a video file and get a video with subtitles.
.NET 7 SDK for OpenAI with a Blazor Server playground
🛡 Установщик разблокировщика зарубежных AI-сервисов (и не только) для России на Windows 10/11 🌍
An Android keyboard that performs speech-to-text (STT/ASR) with OpenAI Whisper and input the recognized text…
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
Voice-to-text CLI for terminal users
OpenAI Whisper API based on Node.js / Bun.sh in a Docker Container + Google Cloud Run Example
Make Local AI Toys, Robots, Devices that with a MacBook and an Arduino ESP32
303 份 AI/LLM 中文讲义,支持在线阅读、PDF 下载和 LaTeX 源码查看 | Stanford CS336/CS224R/CS25 | Berkeley LLM Agents | Agent 工程实践
Short code for dictation using OpenAI Whisper for transcription.
🦞 Open-source browser-based voice chat for AI assistants. Self-hosted, private, free. Whisper STT +…
Push to talk voice recognition using Whisper
Telegram LLM bot backed by OpenAI, Whisper, Beam, LLaMA, Weaviate, MinIO and MongoDB
OpenAi-Sora (SoraFlows) is an open-source, cross-platform web application for AI-powered video creation and…
A curated list of awesome OpenAI's Whisper
Video2Text - Easily convert your video to text
"Chat With Any Video" project in 24 hours, challenge myself to complete in @Supabase's AI Hackathon.
Supercharged Claude Code Official Telegram plugin — threading, voice messages 2 ways, stickers, GIFs…
Speech o Text using docker image with ggerganov/whisper.cpp
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)
🎥 Youtube Video Summarizer and Question Answering App Using Whisper and Langchain
HACS custom integration for using Whisper speech-to-text (OpenAI, GroqCloud or Mistral) API in the Assist…
Next.js app for serverless deployments of OpenAI Whisper on Banana.dev
A simple light-weight library that wraps the Open AI API.
Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate…
Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.
Shell wrapper for OpenAI's ChatGPT, Whisper, and TTS. Features LocalAI, Ollama, Gemini, Anthropic, and more.
Batch convert video to text using openai's whisper or the local coreML via whisper.cpp on your MacBook
openai/whisper + extra features
Meeper 📝 - is your secretary for any in-browser conference.
Create subtitles with ease, using Whisper AI for Windows
On-device AI SDK for Flutter — LLM inference, vision, STT, TTS, image generation, embeddings, RAG, and…
Agent-first CLI for audio/video transcription via Whisper
Open source, local first AI medical scribe for desktop and web.
A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic…
Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in…
100% free, local & offline voice assistant with speech recognition
Chrome extension for voice-to-text conversations with ChatGPT using OpenAI Whisper API
A python package for whisper normalizer
Record, transcribe, and transform voice notes into structured insights. Leverage Whisper or AssemblyAI and…
Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation
Real-time speech recognition & AI-powered note-taking app for macOS with offline/online modes, multilingual…
A very simple whsper Python FastAPI for OpenAI API, Android voice-typing (konele), Home Assistant (wyoming)…
OpenAI API and Whisper based Video Translation
Simple GUI around whisper.cpp for voice-to-text on Linux
iOS & watchOS speech-to-text app with AI voice keyboard, on-device RAG, and chat with your notes - powered by…
SirChatalot is a Telegram bot leveraging ChatGPT, Claude or YandexGPT. It uses Whisper for speech-to-text and…
Unofficial Deno wrapper for the Open Ai api
Production-ready audio and video transcription app that can run on your laptop or in the cloud.
ZeusHammer - AI Super Agent with Local Brain, Voice Interaction & Three-Tier Memory
Talk to your second brain personal assistant using speech 🧠
Web app enabling users to either record or upload audio files. Then utilizing OpenAI API (Whisper, GPT4)…
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the…
A go client and cli for the openai APIs, focused on developer friendliness and convenience atop the basic…
A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that…
Audio to summary with openAI Whisper & GPT 3.5/4 using streamlit
Automatically subtitle any video spoken in any language to a language of your choice using AI.
Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other…
An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging…
OpenAI 공식 Document, Cookbook, 그 밖의 실용 예제를 바탕으로 작성한 한국어 튜토리얼입니다. 본 튜토리얼을 통해 Python OpenAI API 를 더 쉽고 효과적으로…
An intellligent AI assistant that can do anything!
A curated collection of tools to aid transcriptionists and subtitlers.
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
13 projects using ChatGPT API, Whisper, Embeddings, and DALL-E with Python.
Simple RAG tutorials that can be run locally or using Google Colab (only Pro version).
This is a collection of AI ecosystem, which gathers and organizes various interesting and useful AI-related…
重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。
Just an .exe that can be used for those unable to build whisper.cpp in Windows.
AI Agent Skill for automated vlog editing. Feed raw footage, get a finished video. Powered by ffmpeg +…
Blazor Server playground for OpenAI using Cledev.OpenAI .NET library
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time…
Generate captions for videos using the power of OpenAI's Whisper API
This repository hosts a collection of custom web applications powered by OpenAI's GPT models (incl. o1…
An OpenAI's Whisper-based full-stack project to transcribe audio and video files using React & Django.
Whispers in the Machine: Confidentiality in Agentic Systems
macOS menu bar app providing a local HTTP server compatible with the OpenAI Whisper API for fast and private…
Developed a sophisticated machine learning model capable of generating diverse interview questions aligned…
A Whisper + ChatGPT MagicMirror Module.
An application designed to condense lengthy videos into concise, informative clips. Ideal for editors who…
基于各种LLM的聊天机器人框架,支持多语言,语音唤醒,语音对话,本地执行功能,支持 OpenAI,Grok, Claude,讯飞星火,Stable Diffusion,ChatGLM,通义千问,腾讯混元,360…
Your private AI companion that lives on your wrist. Complete local AI assistant with emotional intelligence.
Deploy a complete, self-hosted AI stack on your own server with one command. Includes Ollama (LLM)…
Multi-agent TTS production harness: Fish TTS + WhisperX + Claude, with cross-episode memory and auto-fix loop
🎬 AI-powered localhost subtitle generator for hearing-impaired users. Automatic speech recognition using…
A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper
A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API
Docker image for a self-hosted Whisper speech-to-text server with speaker diarization and OpenAI-compatible…
A bentoML-powered API to transcribe audio and make sense of it
A fully local, open-source voice-to-text tool that acts as a system-wide AI dictation layer, converting…
A real-time, offline voice assistant for Linux and Raspberry Pi. Uses local LLMs (via Ollama), speech-to-text…
Automatic subtitles for DaVinci Resolve with OpenAI Whisper
Generate subtitles for all the videos in a folder with OpenAI's Whisper privately in your computer.
fine-tune Whipser model for Taiwanese speech recognition
Sky LiveKit Agent Perplexica is a local, free solution integrating LiveKit with advanced internet search. It…
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
This repository will guide you to create your Images via Stable Diffusion using a Smart Virtual Assistant…
MOM AI transcribes audio into meeting summary and generate minutes of meeting. Built using Langchain, OpenAI…
AI-powered code generator and automation tool
Cross-platform Electron app for simultaneously streaming & recording microphone and speaker audio
Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.
Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via…
Native iOS app for talking to your OpenClaw agents by voice or text. On-device speech recognition, streaming…
OpenAI Whisper in Home Assistant via the OpenAI API for use in the Assist pipeline
YATSEE - Yet Another Tool for Speech Extraction & Enrichment
Local Video RAG Engine. A FastAPI microservice for video understanding: Scene Detection + Whisper ASR +…
YouTube video summarization using Whisper audio transcription and GPT-based summaries.
whisper.cpp HTTP transcription server with OpenAI-like API in Docker
An completely Free & Unlimited unofficial Python SDK for the OpenAI API, providing seamless integration and…
Native and Private ML inference engine, embeddings, classification, reranking, search, and text generation…
🤵 AIfred-Intelligence — self-hosted Multi-Agent Assistant with Debate Modes (Symposion/Tribunal), Voice (STT…