
Technology Stack
Project Overview
AgentTube is an AI-powered content creation assistant made with Next.JS that analyzes YouTube videos and enhances content generation using advanced AI models like Claude and Gemini. By providing a video link, users can generate transcripts, receive AI-suggested video titles, create improved or alternative scripts, and generate AI-powered thumbnails using OpenAI image generation model.
Objectives
The project aimed to achieve the following:
- Develop a Full-Stack AI-powered content Agent – Enable users to analyze YouTube videos and instead of only providing summary to user use AI tools to generate video Transcripts, Titles, Thumbnails and Scripts with the help of multiple models.
- Googl API – Use Google API services to get video details like transcription, thumbnail and metadata and store it to Convex database to caches.
- Vercel AI SDK – Use Vercel AI SDK with Gemini Model to create tools that Chat with user, generate Scripts, and Titles and with OpenAI Model to generate AI Thumbnails for video.
- Implement Subscription Plans – Use Schematic and Stripe to support Free, Starter, Creator and Enterprise with monthly and yearly subscription plans that user can manage through schematic elements.
- Features Tokens – Use Schematic to create features and tokens that then we can attach with user plans to provide certain functionality or limited access to a feature based on user plan.
- Implement Secure Authentication – Use Clerk to support email/password and social logins (Google) with profile management.
Features
Landing Page
Responsive landing page to Hero section that contains an input box to paste youtube link and start analyzing video along with Features sections showcase the features of the application and Pricing sections to showcase pricing cards for Free, Starter, Creator and Enterprise plans with both monthly and yearly billing options.
-
Hero Section

-
Features Section

-
Pricing Section

SignIn Page
A simple yest customized SignIn page using Clerk SignIn components.

SignUp Page
A simple yest customized SignUp page using Clerk SignUp components.

Dark Mode
A theme button to switch between Light and Dark Mode.

Dashboard Page
A simple Dashboard with an Input Box to paste youtube link for analyzing, a Collapsible Sider for navigating between analyzed videos and User Profile.

Analyze Page
This is the screen use lands on after the youtube video has been analyzed. It is divided into 6 sections.
- Chat Box Section – An AI chat agent where user can ask questions related to youtube video or ask agent to performs certain tasks like generate a script or a title for video. It has also quick button for performing such tasks.
- Video Token Count Section – A token counts showing user that how many tokens he has left for the month and a status bar showing wether the video analyzation happened by spending a token or by caches for user database.
- Video details Section – This sections showcase youtube video metadata like Thumbnail, Title, User name, Channel name and etc.
- Thumbnail Generation Section – A thumbnail generation section contains the list of all the ai generated thumbnails related to this video and the total count of thumbnails tokens spend by the user.
- Title Generation Section – Just like Thumbnail Generation Section, this contains the list of all the ai generated titles related to this video and the total count of titles tokens spend by the user in a month.
- Transcription Section – A transcription section for video that is used by AI agent tools as context for creating script and the total count of scripts fetched by the application.

Manage Plan Page
A simple screen that used Schematic Elements to showcase your current plan along with features access, total tokens and next billing cycle.

Plan Selector Modal
Clicking Change plan will open a multi step schematic element modal to allow user to Upgrade or Downgrade the plan and a checkout step for adding payment method or discount codes.
-
Plan Selection Step

-
Checkout Step

-
Save Payment Method

Conclusion
Building AgentTube was a really rewarding experience. From product point of view the goal was to create something useful for content creators — a tool that could turn YouTube videos into better titles, cleaner scripts, engaging thumbnails, and full transcripts, all powered by AI but from developer or learning point of view the goal was to understand AI agents (Vercel AI SDK) and AI tools along with proper AI prompting. Seeing that idea come to life was pretty awesome.
One of the best parts for me was working with the Vercel AI SDK and integrating models like Claude, Gemini, and OpenAI. It gave me hands-on experience with building real, interactive tools powered by AI — not just answering questions, but actually helping users create. It pushed me to think differently about user interaction and how AI can fit naturally into creative workflows.
I also got to dive deeper into user authentication, subscription plans, and feature control using tools like Clerk, Stripe, and Schematic — which made me more confident in building complete, production-ready apps.
Overall, AgentTube taught me a lot — not just about tech, but about designing things that feel intuitive, helpful, and fun to use. It really leveled up my skills with AI integration, and I’m excited to build even more tools like this in the future.