Build an AI Podcast Clipping SaaS: Python, Next.js, AWS, Stripe, Tailwind, TS, Modal, Inngest
DevBlog
Feb 28, 2026 ยท 1 min read ยท 32 views
you'll build a SaaS application that converts full podcasts into viral short-form clips ready for YouTube Shorts or TikTok. The tool uses different AI models to transcribe the video, automatically detect the most engaging moments in podcasts and create clips cropped to the active speaker's face. You'll learn how to build a complete production-ready SaaS with user authentication, a credit-based payment system using Stripe, and background processing queues to handle user load. All services used in this project are free, so you won't have to pay anything to follow along. We'll use technologies such as Next.js 15, React, Typescript, Tailwind CSS, ShadCN, Auth.js, Python, FastAPI, Stripe, Modal, Inngest, S3 on AWS, and more
Features:
๐ฌ Auto-detection of viral moments in podcasts (stories, questions, etc.)
๐ Automatically added subtitles on clips
๐ Transcription with m-bain/whisperX
๐ฏ Active speaker detection for video cropping with Junhua-Liao/LR-ASD
๐ฑ Clips optimized for vertical platforms (TikTok, YouTube Shorts)
๐๏ธ GPU-accelerated video rendering with FFMPEGCV
๐ง LLM-powered viral moment identification with Gemini 2.5 Pro
๐ Queue system with Inngest for handling user load
๐ณ Credit-based system
๐ฐ Stripe integration for credit pack purchases
๐ค User authentication system
๐ฑ Responsive Next.js web interface
๐๏ธ Dashboard to upload podcasts and see clips
โฑ๏ธ Inngest for handling long-running processes
โก Serverless GPU processing with Modal
๐ FastAPI endpoint for podcast processing
๐จ Modern UI with Tailwind CSS & Shadcn UI
codebase: https://www.devblog.blog/store/ai-podcast-clipping-saas-codebase-mm5o3m7e