capability
Transcri agents
This page lists every AI agent in the MeshKore directory tagged with the Transcri capability. Agents are sourced from public platforms (GitHub, Hugging Face, npm, PyPI, awesome-list curations, and direct submissions), normalized by the MeshKore worker, and ranked by GitHub stars. Each card links to the agent's profile with details on capabilities, framework, language, freshness, and source attribution.
289 agents in this capability · ranked by popularity
Top 200 Transcri agents
Faster Whisper transcription with CTranslate2
Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili…
Instantly generate AI-powered subtitles on your device. Works standalone or connects to DaVinci Resolve.
Voice-to-text dictation app with local (Nvidia Parakeet/Whisper) and cloud models (BYOK). Privacy-first and…
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
faster_whisper GUI with PySide6
World's first AI meeting copilot → The Invisible Companion for Work + Life
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Open-source meeting transcription API for Google Meet, Microsoft Teams & Zoom. Auto-join bots, real-time…
Using OpenAI's Whisper to automatically generate YouTube subtitles
Natively - Free open-source AI interview copilot & meeting assistant. The best Cluely alternative, Final…
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
Generate subtitles, summaries, and chapters from videos in seconds
React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in
🎤 The easiest way to transcribe audio in Swift
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for…
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Real-time transcription using faster-whisper
Effortlessly add AI-generated transcription subtitles to your videos
Make your meetings accessible to AI Agents
An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and…
A general, evolvable, and distributed agent framework & harness for data science.
Open-source AI meeting copilot - real-time transcription, echo cancellation, and AI assistance. Captures…
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs…
How to use OpenAIs Whisper to transcribe and diarize audio files
An API to transcribe audio with OpenAI's Whisper Large v3!
Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and…
Turn meetings into live agent loops. Record, transcribe, and analyze meetings with real-time AI intelligence…
Discord AI Chatbot using DialoGPT, trained on the game transcript of The World Ends With You
AI-powered tool for real-time interview question transcription and response generation.
Music Analysis, Chord Recognition, Beat Tracking, Guitar Diagrams, Piano Visualizer, Lyrics Transcription…
TranscriberBot for Telegram
Talk to ChatGPT in real time using LiveKit
Transcribe is a real time transcription, conversation, Language learning platform. It provides live…
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered…
YouTube Transcript API skills for AI agents. Get transcripts, search videos, browse channels. Works with…
🎙️ AI generated subtitles and segmented chapters for podcasts
A modern Next.js-based tool for AI-powered YouTube video summarization. Features smart chapter detection…
A powerful Whisper AI keyboard for reliable speech transcription
A quick experiment to achieve almost realtime transcription using Whisper.
System audio capture + multi-provider ASR + local-first AI review workspace. Floating live captions, 12 ASR…
A sample web app using OpenAI Whisper to transcribe audio built on Next.js. It records audio continuously for…
Flutter App That Can Transcribe Audio Offline/On Device with Whisper C++ Bindings via Rust
OpenAI's Whisper Audio to text transcription right into your web browser! An open source AI subtitling suite.
Unleash the power of Chatty: the intersection of ChatGPT’s intelligence, DALL·E's creativity, and Whisper's…
This repository contains a Python script that allows users to download the audio from a YouTube video…
A Personal Tool for Transcribing & Translating My Vlogs into Japanese
Conversational & memory-enabled AI research partner for multi-omics analysis. CLI + Desktop App (installers…
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly…
Multi-agent LLM driven cell type annotation for single-cell RNA-Seq data
A free MCP server to analyze and extract insights from public filings, earnings transcripts, financial…
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
Voice-to-text CLI for terminal users
Short code for dictation using OpenAI Whisper for transcription.
Transform YouTube videos into a compounding knowledge base with transcripts, vision analysis, and agentic…
Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)
Telegram bridge for the Pi coding agent — continue sessions from your phone with voice, images, and handback
Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.
Realtime Interview Copilot is a web application that assists users in crafting responses during interviews…
Meeper 📝 - is your secretary for any in-browser conference.
Open-source free alternative to Cluely & Parakeet AI — real-time AI interview copilot with live…
Agent-first CLI for audio/video transcription via Whisper
Callytics is an advanced call analytics solution that leverages speech recognition and large language models…
SemantiClip is an intelligent video processing application that transforms video content into rich…
A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic…
Fast Audio/Video transcribe using Openai's Whisper and Modal, an hour audio/video file can be transcribed in…
Record, transcribe, and transform voice notes into structured insights. Leverage Whisper or AssemblyAI and…
Real-time speech recognition & AI-powered note-taking app for macOS with offline/online modes, multilingual…
OpenAI API and Whisper based Video Translation
Fast transcript search for humans & agents. Supports Claude Code, Codex CLI & OpenCode
⚡ Build structured YouTube datasets at scale — effortlessly fetch transcripts and rich metadata for NLP, ML…
STT 한글 문장 인식기 출력 스크립트의 외자 오류율(CER), 단어 오류율(WER)을 계산하는 Python 함수 패키지
Production-ready audio and video transcription app that can run on your laptop or in the cloud.
Talk to your second brain personal assistant using speech 🧠
A SpeechToText application that uses OpenAI's whisper via faster-whisper to transcribe audio and send that…
13 Claude Code skills for video production (transcribe / translate / dub / multicam / subtitles / reframe) +…
Automatically subtitle any video spoken in any language to a language of your choice using AI.
Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other…
Real-time translation copilot for your browser
A curated collection of tools to aid transcriptionists and subtitlers.
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
OpenAI/ChatGPT library for Java - Requires JDK 11 at minimum.
STAgent is a multimodal LLM-based AI agent that enables deep research about spatial transcriptomics data
A cutting-edge AI SaaS platform that enables users to create, discover, and enjoy podcasts with advanced…
Just an .exe that can be used for those unable to build whisper.cpp in Windows.
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time…
Since November 2025 D-PC Messenger, a decentralized, Privacy-First Infrastructure for Human-AI-Team…
In-depth analysis of AI agent transcripts.
An OpenAI's Whisper-based full-stack project to transcribe audio and video files using React & Django.
Telegram MCP server with HTTP-MTProto Bridge — direct API/curl access, multi-user Bearer auth, Docker…
macOS menu bar app providing a local HTTP server compatible with the OpenAI Whisper API for fast and private…
🎬 AI-powered localhost subtitle generator for hearing-impaired users. Automatic speech recognition using…
MCP server for spatial transcriptomics analysis through natural language interfaces.
Docker image for a self-hosted Whisper speech-to-text server with speaker diarization and OpenAI-compatible…
A bentoML-powered API to transcribe audio and make sense of it
Multi-agent LLM driven cell type annotation for single-cell RNA-Seq data
This is a fun Python project that allows you to chat with a chatbot about the PDF you uploaded. and generate…
Simple Python audio transcriber using OpenAI's Whisper speech recognition model
MOM AI transcribes audio into meeting summary and generate minutes of meeting. Built using Langchain, OpenAI…
Cross-platform Electron app for simultaneously streaming & recording microphone and speaker audio
Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.
Whisper Speech-to-Text is a JavaScript library for recording and transcribing user audio into text via…
Real-time transcription, AI-driven answer suggestions, and interview simulation using Next.js, React, Azure…
Record audio from a meeting, then transcribe, conclude and send the conclusion and a piece of advice to Slack
A Claude Code plugin that gives Claude persistent memory across sessions — stores lessons and decisions as…
The self-evolving agentic framework for bioinformatics
YouTube video summarization using Whisper audio transcription and GPT-based summaries.
whisper.cpp HTTP transcription server with OpenAI-like API in Docker
A minimalistic web app to generate transciption for audio built using Python
Coffee Chat Voice Assistant is a voice-driven ordering system powered by Azure OpenAI GPT-4o Realtime API…
An AI agent skill that extracts subtitles/transcripts from video platforms and generates structured summary…
Self-improving behavioral files for AI coding agents. Analyzes session transcripts for correction patterns…
A simple matrix bot that transcribes your voice to text message
A beautiful, native macOS desktop application for transcribing audio and video files using whisper.cpp
Streamline your note-taking with ChatGPT's AI expertise and Whisper's precise transcription, enabling fast…
MediBeng Whisper Tiny improves doctor-patient transcription by training the Whisper Tiny model to translate…
Voice memos recorded from the microphone, transcribed offline to text and converted to Joplin notes
VOXRAD is a voice transcription application for radiologists leveraging locally deployed ASR and LLM models.
Code for the OpenCV demo of a recipe transcription OCR agent.
Transcription from mp3 files to html with or without embedded player
🎙️ Fast CLI tool to transcribe audio/video files to SRT format using OpenAI Whisper API
Speakscribe is a web application that allows users to transcribe audios using OpenAI and also interact with a…
Unlock AI power with AudioInsightsGenerator! From audio to summaries, emotion analysis, idea generation…
A stand-alone application with GUI for OpenAI's Whisper
Turn Claude Code session transcripts into blog-style articles — narrative, not logs
Enterprise-grade browser extension bringing multilingual voice interaction to AI chatbots (Pi, Claude…
Shell scripts for automated transcription on macOS: Integrates whisper.cpp with QuickTime Player and…
Real-time behavioral intelligence for call centers. Transcribes support calls, redacts PII, extracts…
Interview Amigo is an AI-powered SaaS platform designed to help users enhance their job interview skills…
An open source recorder integrating OpenAI Whisper and ChatGPT.
OpenAI & Anthropic Compatible API Gateway for AWS Bedrock and AI Services
Video URL transcriber and translator using AI. Download from Youtube and translate automatically by adding…
Whisper.cpp with diarization
Summarization web service via the use of OpenAI Whisper and GPT-3 models
Telegram MCP server (MTProto). Connect Claude, Cursor, Claude Code, VS Code, Codex, Cline, Windsurf to a real…
Scribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the…
🤖 A WhatsApp bot to transcribe and summarize audio messages.
把高质量播客和长文 RSS 自动转成 karpathy-claude-wiki 兼容的 source-summary 页面 / Turn high-signal podcasts and long-form RSS…
End-to-end AI-powered video call application where you implement real-time calls with customized AI agents…
Intelligent Applications with Spring AI. Practical integration of LLMs, chat interaction, image generation…
Generate summaries of Udemy video transcripts using the OpenAI API
Real-time conversation assistant with dual audio transcription and GPT-powered responses, perfect for…
Audio transcription UI for OpenAI Whisper, GPT4o Transcribe and AssemblyAI APIs
format whisper transcripts to .srt
AI-powered recipe extraction from TikTok, YouTube & Instagram videos with automatic import to Tandoor/Mealie…
This repository houses a Python application for extracting YouTube video transcripts and summarizing its…
Smart assistant in Telegram bot format for transcribing online meetings
A Claude skill that teaches your AI to watch videos. Use it to learn, absorb, copy, or give visual feedback…
Turn CRM and transcript data into structured, versioned deal intelligence. 6 agents, 3 frameworks…
Precision Medicine MCP Platform: A set of bioinformatics servers + tools - production multiomics/genomics +…
Claude Code WhatsApp channel plugin — run AI directly from WhatsApp, voice transcription, remote tool…
One memory, three terminals. Shared memory layer for Claude Code, Codex, and Gemini CLI — hybrid retrieval…
When an audio message is received, the bot downloads the audio file, converts it to a numpy array, loads the…
Self-hosted AI knowledge base with hybrid semantic search (pgvector + FTS + RRF), MCP server, multi-provider…
🏛️ Belief Archaeology - An AI Agent skill that excavates hidden worldviews from YouTube videos instead of…
Privacy-first AI interview assistant with live transcription, real-time AI suggestions for any type of…
The Advanced Interview Responder uses AI to generate tailored responses from a user’s resume and real-time…
Agentic Spec-Driven Development — 13 agents, 57 MCP tools, 10-phase enforced pipeline. From meeting…
Experimental voice user interface (VUI) to interact with an agentic AI assistant
A web-based application enabling users to interact with and extract insights from YouTube video transcripts…
A full-stack application that allows practitioners to record voice notes and also export them to Google…
#3 Winner of Best Use of Zoom API at Stanford TreeHacks 2025! An AI-powered meeting assistant that captures…
AI tool that turns meeting transcripts into Jira tickets. Claude analyzes your meetings, checks your codebase…
AI YouTube Video Chat application, to ask questions to a YouTube video bot and get answers.
▶️ Video Fact Finder for YouTube, using CrewAI agents and Perplexity to verify facts.
🤖🎙️ Explore Lex Fridman Podcast Transcripts with a smart chatbot!
This project is a multi-modal AI voice assistant that uses LM Studio, OpenAI API or Claude Code, audio…
A sample Nuxt 3 application that listens to chatter in the background and transcribes it using the powerful…
FastAPI + Whisper + Ollama: Audio transcription and LLM processing API. Convert speech to text with OpenAI…
Generate a WhatsApp-style HTML page from an exported chat, with support for images, videos, audio, PDFs, and…
This repository will guide you to create automatically generate YouTube Transcription using Using OpenAI's…
A platform to enhance medical e-Shadowing.
Japanese meeting transcription & minutes generation app with local ASR (Kotoba Whisper) + LLM (Ollama) + RAG…
Voice calling plugin for OpenClaw — give your AI agent a phone number. Inbound/outbound calls, batch calling…
A starter kit for building your own YouTube digest bot for any channel. It watches videos, processes…
An OpenAI multimodel chatbot multi-AI-Tool all in one application for iOS built with Objective C. In…
AI integration into any app for everyday use.
CLI agent that analyses meeting transcripts using Recursive Language Models (RLMs) to extract decisions…
Transcribe YouTube videos, extract topics, and answer questions interactively.
🎙️ Lightweight macOS menubar app for voice-to-text dictation using OpenAI Whisper API. Hold-to-record with Fn…
CoFlu is a powerful text manipulation, generation, and comparison tool. It's designed for tasks like…
Faster Whisper with Speaker Diarization
Self-hosted Japanese immersion player — clickable subs, Whisper transcription & Anki export
Docker image for a self-hosted WhisperLive real-time speech-to-text server, powered by faster-whisper…
CLI educacional para transcrição com OpenAI Whisper
PyTranscriptorAi - Transcript videos to text with Ai and add subtitles - OpenAi
MCP server for semantic search through Apple developer documentation, WWDC transcripts, and code examples…
The first Minecraft AI that doesn't just talk—it lives in your world. High-performance Gemini-driven…
Build a robust content generation engine that automates blog creation from topics or video transcripts. This…
Intelligent FFMPEG agent node for ComfyUI - transforms natural language video editing prompts into automated…
AI Subtitle Translator for SRT, ASS, and VTT files and audio transcription with custom endpoints for…
This Python script provides a simple interface to transcribe audio files using the OpenAI API's…
Demo Multilingual Near Real-Time Transcriber
Descrição automática de mensagens de voz em conversas privadas no Telegram
A macOS menu bar app that turns speech into refined, ready-to-send text anywhere you type — powered by local…
Generate audio transcripts and summaries by using OpenAI.
This project is an advanced WhatsApp bot that leverages artificial intelligence for automated audio…
Speech to Text Transcription using OpenAI Whisper v3 and FastAPI
An OpenClaw skill that uses faster-whisper (a faster implementation of the Whisper transcription model) to…
AI agent skill: tell Claude Code, Codex, Gemini or OpenClaw to upload your recording to YouTube — it…