capability

Tts agents

This page lists every AI agent in the MeshKore directory tagged with the Tts capability. Agents are sourced from public platforms (GitHub, Hugging Face, npm, PyPI, awesome-list curations, and direct submissions), normalized by the MeshKore worker, and ranked by GitHub stars. Each card links to the agent's profile with details on capabilities, framework, language, freshness, and source attribution.

349 agents in this capability · ranked by popularity

Top 200 Tts agents

ChatTTS39,328 ★

A generative speech model for daily dialogue.

CosyVoice21,264 ★

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

moonshine8,268 ★

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and…

Vision-Agents7,849 ★

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses…

wukong-robot7,119 ★

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。

Streamer-Sales3,695 ★

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy…

polyglot2,590 ★

🤖️ Cross-platform AI language practice app (跨平台AI语言练习应用)

comfyui_LLM_party2,258 ★

LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt…

epub_to_audiobook1,984 ★

EPUB to audiobook converter, optimized for Audiobookshelf, WebUI included

scaphandre1,941 ★

⚡ Energy consumption metrology agent. Let "scaph" dive and bring back the metrics that will help you make…

Dot1,911 ★

Text-To-Speech, RAG, and LLMs. All local!

openai-edge-tts1,899 ★

Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs

ElatoAI1,756 ★

Realtime Voice AI with 100+ Models on Arduino ESP32 with Secure Websockets and Edge Functions for AI Toys…

bailing1,701 ★

百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,接入openClaw,真正的个人语音助手,时延低至800ms,Mac等低配置也可运行,支持打断

ava-whatsapp-agent-course1,661 ★

Meet Ava, the WhatsApp Agent

RCLI1,514 ★

Talk to your Mac, query your docs, no cloud required. On-device voice AI + RAG

langchain4j-aideepin1,288 ★

基于AI的工作效率提升工具(聊天、绘画、知识库、工作流、 MCP服务市场、语音输入输出、长期记忆) | Ai-based productivity tools (Chat,Draw,RAG,Workflow,MCP…

handcrafted-persona-engine1,276 ★

An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming…

vllm-mlx1,256 ★

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama…

voicemode1,195 ★

Natural voice conversations with Claude Code

AI-Waifu-Vtuber1,078 ★

AI Vtuber for Streaming on Youtube/Twitch

lobe-vidol954 ★

🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne

esp-ai831 ★

The simplest and lowest-cost AI integration solution. If you like this project, please give it a Star~ |…

gitpodcast807 ★

Convert any git repository into an engaging podcast

ttsfm727 ★

TTSFM mirrors OpenAI's TTS service, providing a compatible interface for text-to-speech conversion with…

LocalAIVoiceChat721 ★

Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for…

mlx-omni-server718 ★

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple…

whisper_android660 ★

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

voice-ai646 ★

Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational…

chatterbox-tts-api600 ★

Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned…

UnrealGenAISupport592 ★

Unreal Engine plugin for LLM/GenAI models & MCP UE5 server. OpenAI GPT-5, Deepseek R1, Claude Opus/Sonnet…

alan-sdk-reactnative579 ★

The Self-Coding System for Your App — Alan AI SDK for React Native

echokit_server565 ★

Open Source Voice Agent Platform

JARVIS523 ★

Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface

Facemoji454 ★

😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek…

JARVIS-ChatGPT450 ★

A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM…

OpenArc445 ★

Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over…

-Prototype-AIVTuber440 ★

a open-source Artificial Intelligence Virtual Youtuber (AI VTuber), (this project is deprecated)

bolna431 ★

End-to-end platform for building voice first multimodal agents

alan-sdk-pcf428 ★

The Self-Coding System for Your App — Alan AI SDK for Power Apps

AGI-Papers407 ★

A curated archive of breakthroughs in Agents, Architecture, Training, RAG, and On-Device AI.

GPTPortal397 ★

A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight…

Stream-Omni386 ★

Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across…

VectorDB-Plugin365 ★

Program that lets you ask questions about your documents, audio, and video files.

LiveWhisper360 ★

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

jarvis318 ★

Jarvis is a voice-activated, conversational AI assistant powered by a local LLM (Qwen via Ollama). It listens…

ChatGPT-OpenAI-Smart-Speaker312 ★

This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice…

gpt-voice-conversation-chatbot302 ★

Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while…

ai-devices295 ★

AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more

tetos279 ★

A unified interface for multiple Text-to-Speech (TTS) providers.

safestclaw276 ★

Safestclaw is the alternative to openclaw.. You can naturally chat with it via text and voice, and you can…

gemini-youtube-automation275 ★

A fully autonomous AI Agent/Python pipeline that utilizes Large Language Models (LLMs) like Gemini to…

voiceai274 ★

Set of 📝 with 🔗 to help those building Voice AI agents 🎙️🤖

MITSUHA274 ★

World's First Multilingual Inexpensive Therapeutic Sophisticated Ultra-responsive Holographic Agent. In…

kitt268 ★

Talk to ChatGPT in real time using LiveKit

skills266 ★

Collections of skills for building with ElevenLabs

twelvet257 ★

(Spring Boot 3. X Microservices framework) 基于Spring Boot 3.X 的 Spring Cloud Alibaba / Spring Cloud Tencent +…

gpt_server253 ★

gpt_server是一个用于生产级部署LLMs、Embedding、Reranker、ASR、TTS、文生图、图片编辑和文生视频的开源框架。

sapphire252 ★

She's the AI agent you come home to.

M.I.L.E.S232 ★

M.I.L.E.S, a GPT-4-Turbo voice assistant, self-adapts its prompts and AI model, can play any Spotify song…

bella-openapi232 ★

Bella…

insights-lm-local-package214 ★

Open-source, fully private and local alternative to NotebookLM. Chat with your documents, generate audio…

openai_tts193 ★

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible endpoint to…

uxie189 ★

pdf reader app with note taking, annotations, collaboration, ai features (chat, flashcards generation w…

zai-tts189 ★

🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API,支持克隆音色,基于智谱TTS

wyoming_openai186 ★

OpenAI-Compatible Proxy Middleware for the Wyoming Protocol

BlahST174 ★

Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp OFFLINE. Speak…

Pilipili-AutoVideo173 ★

🎬 全自动 AI 视频代理 · 一句话生成带字幕成片 · Fully Automated AI Video Agent · Local Deployment

kkclaw165 ★

🦞 一个可爱的桌面龙虾AI助手 - Desktop lobster pet with OpenClaw AI, Edge TTS voice, and emotion animations

aidialer160 ★

A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching…

Unitale158 ★

一个基于Indextts和Qwen3TTS的 AI 有声书制作工具。利用 LLM 自动拆解剧本与识别情绪,集成多角色 TTS…

OpenGuider156 ★

An AI companion that lives on your desktop - OpenGuider watches your screen, listens to your voice, and…

autoshorts149 ★

Automatically generate viral-ready vertical short clips from long-form gameplay footage using AI-powered…

podcast-llm146 ★

Automatically generate engaging AI podcasts from nothing but an episode title.

AgentVibes144 ★

🎭 TTS for Claude Code

AutoTTS144 ★

The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"

portable-hermes-agent141 ★

Hermes Agent made portable desktop for Windows — 100 tools, GUI, local models via LM Studio, TTS, Music…

NachoBot133 ★

基于Maibot核心修改而成的多功能笨蛋机器人

PersonalAssistantChatbot133 ★

It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra…

axiom-voice-agent131 ★

Run a <400ms latency Voice Agent on just 4GB VRAM. Fully offline, no API keys required. Optimized for GTX…

skills129 ★

AAHL's Agent Skills. 汇集了多种实用的智能体技能,涵盖Home Assistant智能家居控制、微软Edge…

chatgpt-web128 ★

ChatGPT web application. ChatGPT 网页应用,支持多对话、海量提示词、PWA、ASR、TTS

vibeframe121 ★

AI-Native Video Editor — CLI-first, MCP-ready. Generate, edit, and ship videos from your terminal.

OpenToys118 ★

Make Local AI Toys, Robots, Devices that with a MacBook and an Arduino ESP32

openclaw-voice116 ★

🦞 Open-source browser-based voice chat for AI assistants. Self-hosted, private, free. Whisper STT +…

workersai113 ★

Full-stack AI chat platform built on Cloudflare using Workers, Durable Objects, KV, and AI Gateway. Features…

JARVIS-AI-ASSISTANT103 ★

A true Artificial Intelligent Assistant with ALICE as backend and offline speech recognition with vosk engine…

gptspeaker100 ★

The ChatGPT/DeepSeek Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with…

speech-rest-api99 ★

Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)

XnneHangLab96 ★

希望用代码为 waifus 绘心。

unspeech95 ★

🗣️🔊 Your Text-to-Speech Services, All-in-One.

open-audio95 ★

Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate…

shellChatGPT92 ★

Shell wrapper for OpenAI's ChatGPT, Whisper, and TTS. Features LocalAI, Ollama, Gemini, Anthropic, and more.

avr-infra92 ★

The AVR Infrastructure project is designed to launch the Agent Voice Response application, which will start…

agentcall90 ★

AgentCall lets AI Agents join meetings with voice, video & screen-share to build together. Supports Google…

achatbot89 ★

An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and…

Talk2GPT89 ★

GPT-3 client for Windows and Unix with memories management that supports both text and speech in any…

OpenAI-Text-To-Speech-for-Unity85 ★

Implementation of OpenAI's Text-To-Speech in Unity. Synthesize any text and play it via any AudioSource.

MAS-TTS82 ★

Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning (NeurIPS2025-SEA)

langchain-telegram-gpt-chatbot81 ★

An AI-powered chatbot integrated with Telegram, using OpenAI GPT-3.5 Turbo, language embeddings, and FAISS…

Chatbot80 ★

Hybrid Conversational Bot based on both neural retrieval and neural generative mechanism with TTS.

google-trends80 ★

时下热词追踪Agent 💡,集成多 Tools、TTS、ASR、HeyGem API

alts79 ★

100% free, local & offline voice assistant with speech recognition

leopard-chat-ui-teneo72 ★

Leopard Chat UI - A Teneo Chat Client based on Vue and Vuetify

omnigram71 ★

Omnigram is a Flutter-based file reader and audiobook . It accommodates EPUB and PDF and offers audiobook…

keepyourmouthshut71 ★

Acid Reflux for your Ears!

art-voice-agent-accelerator70 ★

Build, test, and ship omnichannel voice agents on Azure—ACS telephony, custom STT→LLM→TTS pipeline, Voice…

voiceblender69 ★

A programmable voice platform: SIP and WebRTC call control, multi-party mixing, recording, TTS/STT, and…

quivr-whisper69 ★

Talk to your second brain personal assistant using speech 🧠

echook65 ★

🔊 echook — AI-operated audio notifications for Claude Code, Cursor IDE & Codex CLI — 26 hooks, voice + chime…

lobe-chat61 ★

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3…

HanaVerse60 ★

HanaVerse is a interactive web UI for chatting with ollama with a lively 2D anime character Hana. Star it on…

OpenAI-TTS-Gradio59 ★

Use OpenAI TTS(Text to Speech) API with Gradio

AgentOS2-Live58 ★

AgentOS2-Live by OrionStar — an end-to-end real-time voice interaction solution based on the Realtime API. No…

become-ceo57 ★

Your AI executive team on Discord. 7 specialized agents — Engineering, Finance, Marketing, DevOps, Legal…

ClawCode56 ★

Persistent agents for Claude Code as a plugin, not a harness. Memory, personality, messaging across WhatsApp…

gpt_chatbot55 ★

This chatbot lets you use your microphone to communicate with GPT-4. It uses the OpenAI text to speech to…

MeuxCompanion55 ★

A self-hosted AI companion web app with anime-style Live2D and VRM characters. Talk with your companion via…

narrator-ai-cli54 ★

AI 解说大师 — CLI 客户端;用于终端调用视频解说生成 API

Twitch-Streamer-GPT54 ★

Twitch Streamer GPT is a NodeJS-based Twitch enhancement tool, offering interactive stream experiences with…

Voice-Chat-Bot52 ★

Real-time AI ChatBot and voice-enabled AI VoiceBot using Deepgram (STT ↔ TTS) and Groq LLM for natural…

OpenVoiceUI51 ★

Voice-powered AI assistant platform — connect any LLM, any TTS, with a live web canvas, music generation, and…

nabu50 ★

A multi engine TTS & LLM edge computing playground with audio book features and more!

voice-assistant50 ★

重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。

DigitalLife49 ★

一个具有长时记忆和 Live2d 形象的"数字生命" / A digital life with long-term memories and live2d body

local-livekit-plugins49 ★

Local STT/TTS plugins for LiveKit Agents - no cloud APIs required

cadis48 ★

Rust-first, local-first multi-agent runtime with a desktop HUD, policy-gated tools, voice, and…

kuon47 ★

久远:一个开发中的大模型语音助手,当前关注易用性,简单上手,支持对话选择性记忆和Model Context Protocol (MCP)服务。 KUON:A large language model-based…

stimm46 ★

The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI pipelines for real-time conversations…

live-interview44 ★

Chatbot with a 3D avatar that can answer interview questions in your behalf. It can speak and understand…

Python-Voice-Assistant43 ★

A Python based Voice Assistant like Siri

MMM-WhisperGPT43 ★

A Whisper + ChatGPT MagicMirror Module.

Nova242 ★

An AI assistant building SDK in python

openrouter-mcp-multimodal41 ★

MCP server for OpenRouter — chat with 300+ LLMs (Claude, Gemini, GPT), analyze images / audio / video…

styletts2-ukrainian-openai-tts-api41 ★

OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline

docker-ai-stack41 ★

Deploy a complete, self-hosted AI stack on your own server with one command. Includes Ollama (LLM)…

tts-agent-harness41 ★

Multi-agent TTS production harness: Fish TTS + WhisperX + Claude, with cross-episode memory and auto-fix loop

SalesGPT---Twilio40 ★

A modified version of SalesGPT with the addition of TTS, STT, and Twilio to make calls. A Context-aware AI…

Live2D-LLM-Chat39 ★

Live2D + ASR + LLM + TTS → Real-time communication + Offline Deployment/Cloud Inference 实时沟通 本地部署/云端推理

DeepCo39 ★

A Chat Client for LLMs, written in Compose Multiplatform.

pdf-to-audiobook38 ★

Uses OpenAI API to clean pdf then converts it to professional grade audiobook with text to speech.

JavaAI37 ★

Lightweight Java library to interact with the OpenAI API (GPT, DALL-E, TTS, etc.)

Free-Unoffical-OpenAI-API37 ★

A powerful, unofficial OpenAI-compatible API service offering free access to GPT-4o, GPT-4-turbo, and audio…

Daisy-openAI-chat37 ★

Python platform for working with LLMs

sky-livekit-agent-perplexica37 ★

Sky LiveKit Agent Perplexica is a local, free solution integrating LiveKit with advanced internet search. It…

langchain-voice-agent-node35 ★

Langchain Voice Agent with Inworld TTS

ProductVideoCreator35 ★

AI Agent Skills toolkit for automated product introduction video generation with Remotion, Playwright, and…

azure-avatar-demo34 ★

Text To Speech Demo in ReactJS Application using Azure Avatar AI Service.

modelship34 ★

Self-hosted, multi-model AI inference server. Run LLMs, TTS, STT, embeddings, and image generation with an…

svelte-vrm-live33 ★

Threlte Live – A SvelteKit + Three.js platform for live-streaming 3D VRM avatars. Features real-time chat…

Youtube-Shorts-Generator33 ★

Harness OpenAI's power to effortlessly create YouTube Shorts with this project. Includes tools for generating…

OpenAI-GPT-4o-Mini-TTS-Home-Assistant-Integration32 ★

OpenAI GPT-4o Mini TTS – Home Assistant Integration

openai-unofficial32 ★

An completely Free & Unlimited unofficial Python SDK for the OpenAI API, providing seamless integration and…

mmx-mcp-server32 ★

A unified Model Context Protocol server for MiniMax CLI (mmx)

awesome-voice-agents32 ★

A curated list of voice AI agent frameworks, tools, resources, and best practices

voicedoc-agent31 ★

🎙️ Voice-native document intelligence using Gemini, ElevenLabs STT/TTS, and Datadog observability — turning…

OpenAI-Realtime-API-for-Unity31 ★

Implementation of OpenAI's Realtime API in Unity. Easily integrate low-latency, multi-modal conversations via…

WhisCall31 ★

A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo, Virtual Audio Cable, and the…

p8hub29 ★

Private AI Hub (P8Hub) - Host and use your own AI Services. Keep everything simple and private.

opensource-voice-tools28 ★

A repo listing known open source voice tools, ordered by where they sit in the voice stack

xiaoclaw28 ★

Local AI Agent firmware running on ESP32-S3, integrating offline voice wake-up with cloud TTS, supporting…

PyWaifu27 ★

PyWaifu is an Offline, all-in-one pipeline designed to facilitate seamless interactions with virtual anime…

livekit-voice-ai-agent-setup27 ★

This is the guide to show the method to build your own AI-Powered voice agent with LiveKit and Twillio

multivoice27 ★

Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our…

salmalm27 ★

🧠 Personal AI Gateway — Single-file Python AI agent with multi-LLM, tools, vision, TTS, encrypted vault. Your…

monika26 ★

Monika is an AI assistant that combines speech-to-text, natural language processing, and text-to-speech…

ai-avatar26 ★

Zippy Talking Avatar uses Azure Cognitive Services and OpenAI API to generate text and speech. It is built…

cyrano26 ★

Your AI assistant, stt -> llm agent -> tts, full api, can run on raspberry pi

dria-livekit-agent-deep-research25 ★

DRIA (Deep Research and Intelligence Agent) is a fully local voice assistant that can hold real-time…

live-translator25 ★

Real-time system audio translation for macOS — translate any audio (YouTube, podcasts, meetings) live on…

avr-app25 ★

Design, train, and orchestrate AI voice agents in a single dashboard, then connect them to your preferred…

clopinette-ai24 ★

Cloudflare-native AI agent — 13 tools, codemode, 5-layer memory, self-learning, multimodal I/O. Telegram…

xiaolong-openclaw24 ★

中文语音助手 | 唤醒词 + ASR + OpenClaw Agent + TTS | 离线唤醒、流式语音交互、工具调用、Skills 扩展

ai-agent24 ★

Build realtime AI interviewer voice agent that joins meetings. It demonstrates integrating Deepgram (STT)…

claude-tts24 ★

Text-to-speech plugin for Claude Code — multi-provider support (ElevenLabs, OpenAI, Google, Amazon Polly…

localspeech-AI24 ★

A one command Voice AI deployment script for MacOS. Supports Sesame, Kokoro, Spark, Zonos and Whisper…

RasaChatbot-with-ASR-and-TTS23 ★

This repository contains an attempt to incorporate Rasa Chatbot with state-of-the-art ASR (Automatic Speech…

Virtual-Voice-Assistant23 ★

Model uses Whisper, CHATGPT, GTTS

tts-studio23 ★

Text to Speech Studio to convert text into natural-sounding speech using advanced AI models from leading…

Eolian23 ★

Eolian is a Discord music bot which provide a very powerful API for queuing songs from a variety of sources…

chatgpt-auto-talk23 ★

📣 Auto-plays ChatGPT responses

LLM-VoIP-Caller23 ★

This project is the backend engine for a fully autonomous AI-powered call center. It integrates a large…

EmaAgent-Python-Prototype23 ★

持续更新中

a-Realtime-Voice-to-Voice-Agentic-RAG-Application-using-LiveKit-and-Redis22 ★

A Voice-to-Voice AI Agent that lets you naturally talk to documents in real time. Powered by LiveKit's…

AIPE21 ★

AIPE (AI Pipeline Engine) is a flexible and powerful tool for creating and executing complex AI workflows

chatGPTVoiceDiscBot21 ★

Discord bot that uses OpenAI chatGPT under the hood. Prompts and answers using voice with(gTTS)

desktop4mistral20 ★

A desktop client with MCP support for Mistral LLMs

AmaiGirl20 ★

面向全平台愿景的原生 AI 桌面助手,支持 Live2D 角色交互与 OpenAI 兼容对话/TTS API。 | A cross-platform and native vision AI desktop…

LiveKit-Outbound-Caller-Voice-Agent19 ★

Outbound PSTN calling agent using LiveKit SIP trunks with a voice pipeline (Silero VAD, Deepgram STT, OpenAI…

raguelike19 ★

A project combining roguelike with LLMs, RAG, Text2Speech, and Speech2Text

XyvaClaw18 ★

Self-evolving AI assistant platform with 42+ skills, 5-level model fallback, lossless context, deep…

Speak-Turbo18 ★

Ultra-fast local TTS for AI agents. ~90ms to first sound.

GLadOS-Voice-Assistant18 ★

GLaDOS Terminal-based AI Assistant

Voice-to-text-and-voice-chatbot18 ★

Voice-to-Voice Chatbot using Whisper, LLaMA, and Groq API

ai-conversation18 ★

🤖 AI Conversation Agent for Home Assistant. Compatible with any OpenAI format LLM providers, supports STT/TTS

GPTAssistant-ElevenLabs18 ★

OpenAI-Assistant API integration with Speech Recognition and Eleven Labs TTS. User can choose name…

simple-openai-tts-playground18 ★

Try out the OpenAI Text to Speech API in your browser.

SmartCall-Agent17 ★

🐙 AI Agent System with RAG and outbound calling through LiveKit, OpenAI Realtime for voice chats; integrates…

IntroventsEnglishCorner17 ★

A spoken English education chatbot based on ChatGPT/whsiper and gTTS.社恐人士的英语角

voice-chat-ai-configurable-agent17 ★

voice ai agent that's able to do tool calls with composio integrations

Browse other capabilitys