Skip to content

volcengine/mediakit-cli

Repository files navigation

AI MediaKit CLI

简体中文

AI MediaKit CLI — interface rebuilt for LLMs · agent-native · cloud + local

The agent-native command-line toolkit for audio & video. Run Volcengine's cloud AI and local editing through one unified command — built to be driven by AI agents (Claude Code, Trae, Cursor …) or by you in the terminal.

mediakit-cli packs video enhancement, subtitle removal, and a full editing toolbox into a single tool. Heavy AI runs in the cloud; lightweight editing runs locally — switch with one flag, same command surface.


✨ What it can do

Capability Runs on Status
AI — video enhancement · subtitle removal cloud ✅ Available
Editing (11) — trim · concat · watermark · subtitle · speed · flip · extract / mux audio · image-to-video cloud or local ✅ Available
More AI workflows — video understanding · translation · highlight clipping · script extraction · commentary · anime restyling … cloud 🚧 Rolling out

AI capabilities run in the cloud (elastic compute, async). Editing runs either in the cloud or locally (sync, zero cost) — pick per command with --cloud / --local.


🚀 Quick Start

npm install -g @volcengine/mediakit-cli
npx skills add volcengine/mediakit-cli -g -y   # optional — install agent Skills (Claude Code / Trae / Cursor …)
export MEDIAKIT_API_KEY=<your-api-key>         # from the AI MediaKit console

# Cloud AI (async): enhance to 1080p, then poll for the result
mediakit-cli --cloud video enhance-video --video-url <url> --resolution 1080p
mediakit-cli shared query-task --task-id <task_id> --poll-complete

# Local editing (sync, no key needed): runs on your machine
mediakit-cli --local editing trim-video --video-url ./in.mp4 --start-time 3 --end-time 8

📦 Install

# npm (recommended, cross-platform — pulls the right build for your OS / arch)
npm install -g @volcengine/mediakit-cli

# npx (no install)
npx @volcengine/mediakit-cli version

# curl (macOS / Linux)
curl -fsSL https://raw.githubusercontent.com/volcengine/mediakit-cli/main/scripts/install.sh | bash

Pin a version or path: VERSION=<version> INSTALL_DIR="$HOME/.local/bin" curl -fsSL …/install.sh | bash

Verify: mediakit-cli doctor (checks cloud readiness + local tool deps + install hints).


🤖 Use with AI Agents

mediakit-cli ships AI agent Skills that teach an agent how to call it — so a user can just say "enhance this video to 1080p and trim the best 5 seconds" and the agent orchestrates the commands.

# One command installs the Skills into every supported agent on your machine
npx skills add volcengine/mediakit-cli -g -y

This auto-detects and installs to 10+ runtimes — Claude Code, Trae (CN & Global), Cursor, Codex, Gemini CLI, GitHub Copilot, OpenCode, OpenClaw, Antigravity, and more.

Every capability is also MCP-compatiblemediakit-cli <domain> <tool> --schema emits a JSON Schema for MCP / Anthropic Tool Use / function-calling, no hand-written adapter needed.


🧩 How it works

  • Two modes, one command surface. --cloud runs heavy AI in Volcengine's cloud (elastic compute, async task_id); --local runs deterministic editing locally (sync, zero cloud cost). Default mode is cloud-first; per-command flags override it.
  • Command structure: mediakit-cli [--cloud|--local] <domain> <tool> [flags] — domains are editing · video · shared.
  • Outputs: cloud results are returned as URLs; local results write to ~/.mediakit/temp (override with --output-path or MEDIAKIT_OUTPUT_PATH).

📖 Documentation


🛠 Development

make build          # local build → .mediakit/build/dev/mediakit-cli
make build-all      # all platforms
make snapshot       # snapshot release

Releases are produced via .goreleaser.yml; npm distribution via package.json + scripts/install.js; curl install via scripts/install.sh.

Local Tool Admission (FFmpeg policy)
  • ffmpeg / ffprobe: required, 5.1.x, LGPL v2.1 or later, commercial use allowed
  • Optional FFmpeg features: openh264, libmp3lame, libass, libfreetype, libfontconfig, libfribidi, libharfbuzz, zlib, libpng, libjpeg-turbo
  • Boundary: external process execution only (no static/dynamic linking of local tools into the Go binary); FFmpeg stays in LGPL mode by default; no non-free components; no local intermediate artifacts retained (only final outputs + fetch-file downloads).

License

This project is open-sourced under the MIT License.

This software calls MediaKit APIs at runtime. Use of these APIs is subject to the following terms and privacy policies:

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors