Create Talking Avatar Videos from Text or Audio
Create AI lip-sync videos for avatars using text-to-speech or audio input. Supports photorealistic and cartoon styles.
Why it matters
Generate lip-sync videos for digital avatars using text-to-speech or audio input. Supports photorealistic and cartoon styles, enabling dynamic avatar communication without pre-training.
Outcomes
What it gets done
Animate digital avatars to lip-sync with provided audio.
Generate talking avatar videos from text input using TTS.
Create digital avatars from images or videos.
Integrate with MCP for seamless workflow automation.
Install
Add it to your toolbox
Run in your project directory:
curl -fsSL https://spark.entire.vc/get/vb-flyworks-avatar | bash Capabilities
Tools your agent gets
Create lip-sync video with audio input, animating a digital avatar to match speech in sync.
Create lip-sync video with text input, generating audio from text and animating avatar.
Overview
Flyworks Avatar MCP Server
What it does
As a digital content creator, I want to generate engaging talking avatar videos quickly and efficiently, so that I can enhance my presentations and social media content.
The big job is to create professional-looking AI-generated video content featuring digital avatars that can speak. The small job is to animate a digital avatar to lip-sync with provided audio or text-to-speech.
Here's how you can install it:
npx -y @smithery/cli install @Flyworks-AI/flyworks-mcp --client claude
Or from source:
git clone https://github.com/yourusername/flyworks-mcp.git
cd flyworks-mcp
pip install httpx "mcp[cli]>=1.6.0"
This MCP server provides fast lip-sync video creation, supporting both photorealistic and cartoon styles. You can use it to:
- Create a lip-sync video using an avatar and audio file
- Generate a talking avatar video from text using text-to-speech
- Create a digital avatar from an image or video
- Convert text to speech and animate an avatar to speak it
Source README
An MCP server that provides free and fast lip-sync video creation for digital avatars without pre-training, supporting both photorealistic and cartoon styles with text-to-speech and audio input capabilities.
Installation
Smithery
npx -y @smithery/cli install @Flyworks-AI/flyworks-mcp --client claude
From Source
git clone https://github.com/yourusername/flyworks-mcp.git
cd flyworks-mcp
pip install httpx "mcp[cli]>=1.6.0"
Using uv
uv pip install httpx "mcp[cli]>=1.6.0"
With All Dependencies
pip install pygments pydantic-core httpx "mcp[cli]>=1.6.0"
Configuration
Claude Desktop
{
"mcpServers": {
"flyworks": {
"command": "uvx",
"args": [
"flyworks-mcp",
"-y"
],
"env": {
"FLYWORKS_API_TOKEN": "your_api_token_here",
"FLYWORKS_API_BASE_URL": "https://hfw-api.hifly.cc/api/v2/hifly",
"FLYWORKS_MCP_BASE_PATH": "/path/to/your/output/directory"
}
}
}
}
Available Tools
| Tool | Description |
|---|---|
create_lipsync_video_by_audio |
Create lip-sync video with audio input, animating a digital avatar to match speech in sync... |
create_lipsync_video_by_text |
Create lip-sync video with text input, generating audio from text and animating... |
Features
- Lip-sync video creation using digital avatar video and audio as inputs
- Lip-sync video generation from text (with text-to-speech)
- Digital avatar creation from images or videos
- Support for both asynchronous and synchronous modes
- Support for photorealistic and cartoon avatar styles
- Free trial access with limited quota
Environment Variables
Required
FLYWORKS_API_TOKEN- API token for authentication with the Flyworks service
Optional
FLYWORKS_API_BASE_URL- Base URL for the Flyworks APIFLYWORKS_MCP_BASE_PATH- Path to the directory where output files will be saved
Usage Examples
Create a lip-sync video using an avatar and audio file
Generate a talking avatar video from text using text-to-speech
Create a digital avatar from an image or video
Convert text to speech and animate an avatar to speak it
Resources
Notes
A free trial token '2aeda3bcefac46a3' is available with limited daily quota, videos include watermarks, and there's a 45-second duration limit. Creating avatars from video provides better quality but takes longer than image-based creation. Maximum wait time for synchronous mode is 10 minutes. Video URLs are temporary and should be downloaded immediately.
Discussion
Questions & comments · 0
Sign In Sign in to leave a comment.