Search Themes
Search for themes and categories
Shush
Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app
Overview:
Shush is an application that showcases the deployment of a WhisperV3 model with Flash Attention v2 on Modal and interacts with it through a NextJS app. The main purpose of this app is to offer a comprehensive demonstration for individuals interested in utilizing high-performance models and reliable APIs on demand with auto-scaling capabilities. It is a demo application constructed with Next.js for the frontend and Modal for the backend.
Features:
- WhisperV3 Model Deployment: Demonstrates the deployment of a WhisperV3 model with Flash Attention v2 on Modal.
- NextJS Integration: Utilizes a NextJS app for interacting with the deployed model.
- Auto-Scaling Capabilities: Showcases the ability to run high-performance models and APIs on demand with auto-scaling.
Installation:
Set Up:
- Visit modal.com and create a free account.
- Install the Modal python package and authenticate in your CLI by following the instructions on the website.
Deploy Backend:
We will be using Modal to deploy and serve the WhisperV3 model. Execute the following commands in your terminal:
# Commands to deploy the WhisperV3 model using Modal
Deploy Frontend:
- Run the NextJS app by executing the following commands after navigating to the root of the repository:
# Commands to run the NextJS app
- Create a .env file and add the URL obtained from Modal (refer to .env.example for guidance).
- Once the setup is complete, access http://localhost:3000/ in your browser to test the application and model functionality.
Summary:
Shush is a demonstration app that exhibits the deployment of a WhisperV3 model utilizing Modal for backend services and a NextJS app for frontend interactions. By following the installation steps provided, users can experience running high-performance models with auto-scaling capabilities.