Build an AI Podcast Clipping SaaS: Python, Next.js, AWS, Stripe, Tailwind, TS, Modal, Inngest

DevBlog

Feb 28, 2026 ยท 1 min read ยท 32 views

 Build an AI Podcast Clipping SaaS: Python, Next.js, AWS, Stripe, Tailwind, TS, Modal, Inngest

you'll build a SaaS application that converts full podcasts into viral short-form clips ready for YouTube Shorts or TikTok. The tool uses different AI models to transcribe the video, automatically detect the most engaging moments in podcasts and create clips cropped to the active speaker's face. You'll learn how to build a complete production-ready SaaS with user authentication, a credit-based payment system using Stripe, and background processing queues to handle user load. All services used in this project are free, so you won't have to pay anything to follow along. We'll use technologies such as Next.js 15, React, Typescript, Tailwind CSS, ShadCN, Auth.js, Python, FastAPI, Stripe, Modal, Inngest, S3 on AWS, and more

Features:

  • ๐ŸŽฌ Auto-detection of viral moments in podcasts (stories, questions, etc.)

  • ๐Ÿ”Š Automatically added subtitles on clips

  • ๐Ÿ“ Transcription with m-bain/whisperX

  • ๐ŸŽฏ Active speaker detection for video cropping with Junhua-Liao/LR-ASD

  • ๐Ÿ“ฑ Clips optimized for vertical platforms (TikTok, YouTube Shorts)

  • ๐ŸŽž๏ธ GPU-accelerated video rendering with FFMPEGCV

  • ๐Ÿง  LLM-powered viral moment identification with Gemini 2.5 Pro

  • ๐Ÿ“Š Queue system with Inngest for handling user load

  • ๐Ÿ’ณ Credit-based system

  • ๐Ÿ’ฐ Stripe integration for credit pack purchases

  • ๐Ÿ‘ค User authentication system

  • ๐Ÿ“ฑ Responsive Next.js web interface

  • ๐ŸŽ›๏ธ Dashboard to upload podcasts and see clips

  • โฑ๏ธ Inngest for handling long-running processes

  • โšก Serverless GPU processing with Modal

  • ๐ŸŒ FastAPI endpoint for podcast processing

  • ๐ŸŽจ Modern UI with Tailwind CSS & Shadcn UI

  • codebase: https://www.devblog.blog/store/ai-podcast-clipping-saas-codebase-mm5o3m7e