MCP

Create Talking Avatar Videos from Text or Audio

Create AI lip-sync videos for avatars using text-to-speech or audio input. Supports photorealistic and cartoon styles.

Works with github

90
Spark score
out of 100
Updated 4 months ago
Version 1.0.0
Models

Add to Favorites

Why it matters

Generate lip-sync videos for digital avatars using text-to-speech or audio input. Supports photorealistic and cartoon styles, enabling dynamic avatar communication without pre-training.

Outcomes

What it gets done

01

Animate digital avatars to lip-sync with provided audio.

02

Generate talking avatar videos from text input using TTS.

03

Create digital avatars from images or videos.

04

Integrate with MCP for seamless workflow automation.

Install

Add it to your toolbox

Run in your project directory:

curl -fsSL https://spark.entire.vc/get/vb-flyworks-avatar | bash

Capabilities

Tools your agent gets

create_lipsync_video_by_audio

Create lip-sync video with audio input, animating a digital avatar to match speech in sync.

create_lipsync_video_by_text

Create lip-sync video with text input, generating audio from text and animating avatar.

Overview

Flyworks Avatar MCP Server

What it does

As a digital content creator, I want to generate engaging talking avatar videos quickly and efficiently, so that I can enhance my presentations and social media content.

The big job is to create professional-looking AI-generated video content featuring digital avatars that can speak. The small job is to animate a digital avatar to lip-sync with provided audio or text-to-speech.

Here's how you can install it:

npx -y @smithery/cli install @Flyworks-AI/flyworks-mcp --client claude

Or from source:

git clone https://github.com/yourusername/flyworks-mcp.git
cd flyworks-mcp
pip install httpx "mcp[cli]>=1.6.0"

This MCP server provides fast lip-sync video creation, supporting both photorealistic and cartoon styles. You can use it to:

  • Create a lip-sync video using an avatar and audio file
  • Generate a talking avatar video from text using text-to-speech
  • Create a digital avatar from an image or video
  • Convert text to speech and animate an avatar to speak it
Source README

An MCP server that provides free and fast lip-sync video creation for digital avatars without pre-training, supporting both photorealistic and cartoon styles with text-to-speech and audio input capabilities.

Installation

Smithery

npx -y @smithery/cli install @Flyworks-AI/flyworks-mcp --client claude

From Source

git clone https://github.com/yourusername/flyworks-mcp.git
cd flyworks-mcp
pip install httpx "mcp[cli]>=1.6.0"

Using uv

uv pip install httpx "mcp[cli]>=1.6.0"

With All Dependencies

pip install pygments pydantic-core httpx "mcp[cli]>=1.6.0"

Configuration

Claude Desktop

{
  "mcpServers": {
    "flyworks": {
      "command": "uvx",
      "args": [
        "flyworks-mcp",
        "-y"
      ],
      "env": {
        "FLYWORKS_API_TOKEN": "your_api_token_here",
        "FLYWORKS_API_BASE_URL": "https://hfw-api.hifly.cc/api/v2/hifly",
        "FLYWORKS_MCP_BASE_PATH": "/path/to/your/output/directory"
      }
    }
  }
}

Available Tools

Tool Description
create_lipsync_video_by_audio Create lip-sync video with audio input, animating a digital avatar to match speech in sync...
create_lipsync_video_by_text Create lip-sync video with text input, generating audio from text and animating...

Features

  • Lip-sync video creation using digital avatar video and audio as inputs
  • Lip-sync video generation from text (with text-to-speech)
  • Digital avatar creation from images or videos
  • Support for both asynchronous and synchronous modes
  • Support for photorealistic and cartoon avatar styles
  • Free trial access with limited quota

Environment Variables

Required

  • FLYWORKS_API_TOKEN - API token for authentication with the Flyworks service

Optional

  • FLYWORKS_API_BASE_URL - Base URL for the Flyworks API
  • FLYWORKS_MCP_BASE_PATH - Path to the directory where output files will be saved

Usage Examples

Create a lip-sync video using an avatar and audio file
Generate a talking avatar video from text using text-to-speech
Create a digital avatar from an image or video
Convert text to speech and animate an avatar to speak it

Resources

Notes

A free trial token '2aeda3bcefac46a3' is available with limited daily quota, videos include watermarks, and there's a 45-second duration limit. Creating avatars from video provides better quality but takes longer than image-based creation. Maximum wait time for synchronous mode is 10 minutes. Video URLs are temporary and should be downloaded immediately.

Discussion

Questions & comments · 0

Sign In Sign in to leave a comment.