AI Skills for Whisper
Discover 112+ Speech-to-text
Browse AI Skills for Whisper
sickn33 / audio-transcriber
Automates audio-to-text transcription, generating professional Markdown documentation and summaries for meetings and lectures.
sickn33 / daily
Provides a comprehensive reference for building real-time voice and multimodal AI applications using Daily, enabling seamless integration of AI services.
aiskillstore / video-processor
Processes video files with audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.
InternLM / eachlabs-voice-audio
Facilitates text-to-speech, speech-to-text, and voice conversion using EachLabs AI models for enhanced audio processing.
Dokhacgiakhoa / voice-ai-engine-development
Architects real-time Voice AI agents with low-latency communication, utilizing advanced speech processing and AI technologies.
nicepkg / transcribe-and-analyze
Transcribes audio and video from URLs using WhisperKit and analyzes transcripts with AI upon request.
alsk1992 / voice
Enables voice recognition and control for trading applications, enhancing user interaction through wake words and speech commands.
pedronauck / promo-video
Creates professional promotional videos with AI voiceover and music using Remotion, enhancing visual marketing efforts.
Microck / Video Processor
Processes video files with audio extraction, format conversion, and transcription using FFmpeg and OpenAI's Whisper model.
aiskillstore / audio-transcriber
Transforms audio recordings into structured Markdown documentation with intelligent summaries and speaker identification.
majiayu000 / audio-transcribe
Transcribes audio and video to text using Whisper, supporting word-level timestamps for accurate subtitle generation.
majiayu000 / gastrohem-media-processor
Automates the processing of audio and image files from WhatsApp, providing transcription and OCR capabilities for efficient media management.
damionrashford / media-whisper
Generates accurate speech-to-text transcriptions and subtitles from audio/video, supporting multilingual translation and timestamps.
damionrashford / workflow-audio-production
Facilitates comprehensive audio routing and processing across multiple platforms, integrating advanced AI features for enhanced production quality.
damionrashford / workflow-podcast-pipeline
Transforms raw podcast recordings into polished, compliant episodes with features like AI denoise, chapter tagging, and multi-language subtitles.
GeorgeDoors888 / bilibili-transcript
Transcribes Bilibili videos to text with high accuracy, providing detailed summaries and formatted transcripts in multiple languages.
kbarbel640-del / loom-workflow
Analyzes Loom recordings to create structured, automatable workflows, enhancing business process understanding and efficiency.
meghal86 / loom-workflow
Analyzes Loom recordings to create structured, automatable workflows, enhancing business process understanding and efficiency.
majiayu000 / create-movie
Facilitates comprehensive movie creation through a structured workflow, utilizing AI tools for research, scripting, and assembly.
ComeOnOliver / automate-this
Analyzes screen recordings to create automation scripts, extracting frames and audio to reconstruct workflows and suggest automation solutions.