Skip to main content

AI Skills for Whisper

Discover 112+ Speech-to-text

Installation guide →

Browse AI Skills for Whisper

sickn33 sickn33 / audio-transcriber

39.2K

Automates audio-to-text transcription, generating professional Markdown documentation and summaries for meetings and lectures.

claudecopilot
83
69

sickn33 sickn33 / daily

39.2K

Provides a comprehensive reference for building real-time voice and multimodal AI applications using Daily, enabling seamless integration of AI services.

openclaw
75
100

aiskillstore aiskillstore / video-processor

345

Processes video files with audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.

openclaw
83
100

InternLM InternLM / eachlabs-voice-audio

388

Facilitates text-to-speech, speech-to-text, and voice conversion using EachLabs AI models for enhanced audio processing.

openclaw
75
53

Dokhacgiakhoa Dokhacgiakhoa / voice-ai-engine-development

444

Architects real-time Voice AI agents with low-latency communication, utilizing advanced speech processing and AI technologies.

openclaw
67
100

nicepkg nicepkg / transcribe-and-analyze

193

Transcribes audio and video from URLs using WhisperKit and analyzes transcripts with AI upon request.

openclaw
92
94

alsk1992 alsk1992 / voice

323

Enables voice recognition and control for trading applications, enhancing user interaction through wake words and speech commands.

openclaw
75
70

pedronauck pedronauck / promo-video

394

Creates professional promotional videos with AI voiceover and music using Remotion, enhancing visual marketing efforts.

openclaw
67
94

Microck Microck / Video Processor

224

Processes video files with audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.

openclaw
83
100

aiskillstore aiskillstore / audio-transcriber

345

Transforms audio recordings into structured Markdown documentation with intelligent summaries and speaker identification.

github-copilotclaude-code
67
69

majiayu000 majiayu000 / audio-transcribe

106

Transcribes audio and video to text using Whisper, supporting word-level timestamps for accurate subtitle generation.

openclaw
83
100

majiayu000 majiayu000 / gastrohem-media-processor

106

Automates the processing of audio and image files from WhatsApp, providing transcription and OCR capabilities for efficient media management.

openclaw
83
100

damionrashford damionrashford / media-whisper

7

Generates accurate speech-to-text transcriptions and subtitles from audio/video, supporting multilingual translation and timestamps.

openclaw
100
95

damionrashford damionrashford / workflow-audio-production

7

Facilitates comprehensive audio routing and processing across multiple platforms, integrating advanced AI features for enhanced production quality.

openclaw
100
100

damionrashford damionrashford / workflow-podcast-pipeline

7

Transforms raw podcast recordings into polished, compliant episodes with features like AI denoise, chapter tagging, and multi-language subtitles.

openclaw
100
100

GeorgeDoors888 GeorgeDoors888 / bilibili-transcript

2

Transcribes Bilibili videos to text with high accuracy, providing detailed summaries and formatted transcripts in multiple languages.

100
97

kbarbel640-del kbarbel640-del / loom-workflow

1

Analyzes Loom recordings to create structured, automatable workflows, enhancing business process understanding and efficiency.

openclaw
100
98

meghal86 meghal86 / loom-workflow

Analyzes Loom recordings to create structured, automatable workflows, enhancing business process understanding and efficiency.

openclaw
100
98

majiayu000 majiayu000 / create-movie

106

Facilitates comprehensive movie creation through a structured workflow, utilizing AI tools for research, scripting, and assembly.

openclaw
75
100

ComeOnOliver ComeOnOliver / automate-this

47

Analyzes screen recordings to create automation scripts, extracting frames and audio to reconstruct workflows and suggest automation solutions.

openclaw
83
80