MCP

Create Talking Avatar Videos from Text or Audio

Name: Create Talking Avatar Videos from Text or Audio
Availability: OnlineOnly
Author: VibeBaza

Create AI lip-sync videos for avatars using text-to-speech or audio input. Supports photorealistic and cartoon styles.

Connect

Works with github

VibeBaza

Maintainer?

Spark score

out of 100

Updated 4 months ago

Version 1.0.0

Models

claude

Add to Favorites

Why it matters

Generate lip-sync videos for digital avatars using text-to-speech or audio input. Supports photorealistic and cartoon styles, enabling dynamic avatar communication without pre-training.

Outcomes

What it gets done

Animate digital avatars to lip-sync with provided audio.

Generate talking avatar videos from text input using TTS.

Create digital avatars from images or videos.

Integrate with MCP for seamless workflow automation.

Install

Add it to your toolbox

Run in your project directory:

curl -fsSL https://spark.entire.vc/get/vb-flyworks-avatar | bash

Capabilities

Tools your agent gets

create_lipsync_video_by_audio

Create lip-sync video with audio input, animating a digital avatar to match speech in sync.

create_lipsync_video_by_text

Create lip-sync video with text input, generating audio from text and animating avatar.

Overview

Flyworks Avatar MCP Server

What it does

As a digital content creator, I want to generate engaging talking avatar videos quickly and efficiently, so that I can enhance my presentations and social media content.

The big job is to create professional-looking AI-generated video content featuring digital avatars that can speak. The small job is to animate a digital avatar to lip-sync with provided audio or text-to-speech.

Here's how you can install it:

npx -y @smithery/cli install @Flyworks-AI/flyworks-mcp --client claude

Or from source:

git clone https://github.com/yourusername/flyworks-mcp.git
cd flyworks-mcp
pip install httpx "mcp[cli]>=1.6.0"

This MCP server provides fast lip-sync video creation, supporting both photorealistic and cartoon styles. You can use it to:

Create a lip-sync video using an avatar and audio file
Generate a talking avatar video from text using text-to-speech
Create a digital avatar from an image or video
Convert text to speech and animate an avatar to speak it

Source README

An MCP server that provides free and fast lip-sync video creation for digital avatars without pre-training, supporting both photorealistic and cartoon styles with text-to-speech and audio input capabilities.

Installation

Smithery

npx -y @smithery/cli install @Flyworks-AI/flyworks-mcp --client claude

From Source

git clone https://github.com/yourusername/flyworks-mcp.git
cd flyworks-mcp
pip install httpx "mcp[cli]>=1.6.0"

Using uv

uv pip install httpx "mcp[cli]>=1.6.0"

With All Dependencies

pip install pygments pydantic-core httpx "mcp[cli]>=1.6.0"

Configuration

Claude Desktop

{
  "mcpServers": {
    "flyworks": {
      "command": "uvx",
      "args": [
        "flyworks-mcp",
        "-y"
      ],
      "env": {
        "FLYWORKS_API_TOKEN": "your_api_token_here",
        "FLYWORKS_API_BASE_URL": "https://hfw-api.hifly.cc/api/v2/hifly",
        "FLYWORKS_MCP_BASE_PATH": "/path/to/your/output/directory"
      }
    }
  }
}

Available Tools

Tool	Description
`create_lipsync_video_by_audio`	Create lip-sync video with audio input, animating a digital avatar to match speech in sync...
`create_lipsync_video_by_text`	Create lip-sync video with text input, generating audio from text and animating...

Features

Lip-sync video creation using digital avatar video and audio as inputs
Lip-sync video generation from text (with text-to-speech)
Digital avatar creation from images or videos
Support for both asynchronous and synchronous modes
Support for photorealistic and cartoon avatar styles
Free trial access with limited quota

Environment Variables

Required

FLYWORKS_API_TOKEN - API token for authentication with the Flyworks service

Optional

FLYWORKS_API_BASE_URL - Base URL for the Flyworks API
FLYWORKS_MCP_BASE_PATH - Path to the directory where output files will be saved

Usage Examples

Create a lip-sync video using an avatar and audio file

Generate a talking avatar video from text using text-to-speech

Create a digital avatar from an image or video

Convert text to speech and animate an avatar to speak it

Resources

GitHub Repository

Notes

A free trial token '2aeda3bcefac46a3' is available with limited daily quota, videos include watermarks, and there's a 45-second duration limit. Creating avatars from video provides better quality but takes longer than image-based creation. Maximum wait time for synchronous mode is 10 minutes. Video URLs are temporary and should be downloaded immediately.

Discussion