video-podcast-maker (Claude Skill) — 설치 & 라이브 데모

왜 쓰나요

핵심 기능

2인 호스트 대화 아크를 갖춘 스크립트 작성
6개 TTS 엔진: Edge / Azure / OpenAI / ElevenLabs 등 — 품질/비용으로 선택
다국어: zh-CN과 en-US 1등급 지원
자동 조합: 비주얼과 음성 트랙 정렬 + 인트로/아웃트로
Bilibili / YouTube 직접 업로드 헬퍼

라이브 데모

실제 사용 모습

준비됨

설치

클라이언트 선택

~/Library/Application Support/Claude/claude_desktop_config.json · Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "video-podcast-maker-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Agents365-ai/video-podcast-maker",
        "~/.claude/skills/video-podcast-maker"
      ],
      "_inferred": true
    }
  }
}

Claude Desktop → Settings → Developer → Edit Config 열기. 저장 후 앱 재시작.

~/.cursor/mcp.json · .cursor/mcp.json

{
  "mcpServers": {
    "video-podcast-maker-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Agents365-ai/video-podcast-maker",
        "~/.claude/skills/video-podcast-maker"
      ],
      "_inferred": true
    }
  }
}

Cursor는 Claude Desktop과 동일한 mcpServers 스키마 사용. 프로젝트 설정이 전역보다 우선.

VS Code → Cline → MCP Servers → Edit

{
  "mcpServers": {
    "video-podcast-maker-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Agents365-ai/video-podcast-maker",
        "~/.claude/skills/video-podcast-maker"
      ],
      "_inferred": true
    }
  }
}

Cline 사이드바의 MCP Servers 아이콘 클릭 후 "Edit Configuration" 선택.

~/.codeium/windsurf/mcp_config.json

{
  "mcpServers": {
    "video-podcast-maker-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Agents365-ai/video-podcast-maker",
        "~/.claude/skills/video-podcast-maker"
      ],
      "_inferred": true
    }
  }
}

Claude Desktop과 같은 형식. Windsurf 재시작 후 적용.

~/.continue/config.json

{
  "mcpServers": [
    {
      "name": "video-podcast-maker-skill",
      "command": "git",
      "args": [
        "clone",
        "https://github.com/Agents365-ai/video-podcast-maker",
        "~/.claude/skills/video-podcast-maker"
      ]
    }
  ]
}

Continue는 맵이 아닌 서버 오브젝트 배열 사용.

~/.config/zed/settings.json

{
  "context_servers": {
    "video-podcast-maker-skill": {
      "command": {
        "path": "git",
        "args": [
          "clone",
          "https://github.com/Agents365-ai/video-podcast-maker",
          "~/.claude/skills/video-podcast-maker"
        ]
      }
    }
  }
}

context_servers에 추가. 저장 시 Zed가 핫 리로드.

claude mcp add video-podcast-maker-skill -- git clone https://github.com/Agents365-ai/video-podcast-maker ~/.claude/skills/video-podcast-maker

한 줄 명령. claude mcp list로 확인, claude mcp remove로 제거.

사용 사례

실전 활용법: video-podcast-maker

블로그 포스트를 팟캐스트 스타일 영상으로 변환

👤 글 콘텐츠를 재사용하는 크리에이터 ⏱ ~60 min intermediate

언제 쓸까: 긴 형식의 포스트가 있고 YouTube/B站에 올릴 10분짜리 영상을 만들고 싶을 때.

사전 조건

Skill 설치 — git clone https://github.com/Agents365-ai/video-podcast-maker ~/.claude/skills/video-podcast-maker
TTS 엔진 credential — 선택한 엔진의 환경 변수 설정 (예: AZURE_TTS_KEY)

흐름

스크립트 생성

Use video-podcast-maker. From post.md, write a two-host script (10 min) in en-US. Hosts: Alice (analytical), Bob (curious).✓ 복사됨

→ Alice/Bob 대사가 있는 턴별 스크립트
음성 렌더링

Render with Azure TTS — Alice: en-US-JennyNeural, Bob: en-US-GuyNeural.✓ 복사됨

→ 두 개의 MP3 트랙; 정렬 메타데이터
조합

Assemble video: title card, b-roll keywords from script, host avatars, captions.✓ 복사됨

→ MP4 준비됨, 1920×1080
업로드

Push to YouTube as unlisted with description + tags from the script.✓ 복사됨

→ YouTube URL

결과: 한 편의 긴 포스트에서 한 시간 이내에 완성된 팟캐스트 영상.

함정

TTS가 기술 용어를 잘못 발음 — 스크립트에서 까다로운 단어를 발음 힌트로 사전 표시; 대부분의 엔진 지원

함께 쓰기: humanizer-skill

일관된 호스트로 특정 주제 주간 팟캐스트 운영

👤 발행 주기가 있는 틈새 주제 크리에이터 ⏱ ~45 min advanced

언제 쓸까: AI/Web3/관심 분야에 대한 금요일 주간 팟캐스트를 자동화하고 싶을 때.

흐름

페르소나 정의

Set host personas: Alice (skeptic), Bob (enthusiast). Save as default.✓ 복사됨

→ 페르소나 파일 저장됨
주간 뉴스 조회

Use video-podcast-maker. Pull this week's top 5 stories on <topic> from RSS feeds. Generate the script.✓ 복사됨

→ 5개 세그먼트가 있는 스크립트
렌더링 + 발행

Render and publish to YouTube + Bilibili at Friday 9am.✓ 복사됨

→ 두 플랫폼 모두 에피소드 업로드됨

결과: 일관된 주간 콘텐츠; 에피소드당 거의 0의 작업.

함정

몇 에피소드 후에 AI 음성이 비슷하게 들림 — 페르소나 교체; 엔진 변경; 실제 인트로 스팅 추가

함께 쓰기: duckduckgo-mcp

영어 팟캐스트를 중국어로 현지화 (또는 반대)

👤 크로스 언어 시청자를 대상으로 하는 크리에이터 ⏱ ~50 min intermediate

언제 쓸까: 영어 팟캐스트가 있고 네이티브 음성으로 B站용 중국어 버전을 원할 때.

흐름

스크립트 번역

Use video-podcast-maker. Translate script from en-US to zh-CN preserving the conversational tone.✓ 복사됨

→ 직역이 아닌 문화 적응이 된 zh-CN 스크립트
네이티브 음성으로 렌더링

Render with zh-CN voices (e.g. Azure XiaoxiaoNeural + YunxiNeural).✓ 복사됨

→ 네이티브 품질 오디오
재조합 + Bilibili에 업로드

Use the same b-roll; new audio; new captions in zh-CN. Upload to B站.✓ 복사됨

→ B站 URL

결과: 번역 테이프가 아닌 진정한 크로스 언어 버전.

함정

직역으로 관용구가 손실됨 — Skill이 문화적으로 적응하도록 설정됨; 농담/레퍼런스는 수동으로 검토

조합

다른 MCP와 조합해 10배 효율

video-podcast-maker-skill + humanizer-skill

생성된 스크립트에서 AI 흔적 제거

Run humanizer on the script before TTS — sound less generated, more conversational.✓ 복사됨

video-podcast-maker-skill + duckduckgo-mcp

스크립트를 위한 최신 뉴스 수집

Search latest <topic> stories; feed top 5 into make_script.✓ 복사됨

도구

이 MCP가 노출하는 것

도구	입력	언제 호출	비용
make_script	source_text, hosts, length_min, language	1단계 — 스크립트 작성	LLM tokens
render_tts	script, engine, voices	스크립트 승인 후	TTS engine quota / $
assemble_video	audio_tracks, b_roll_keywords, theme	최종 조합	Local CPU/GPU
publish	platform, mp4_path, metadata	YouTube / Bilibili에 업로드	0
translate_script	script, target_language	현지화 단계	LLM tokens

비용 및 제한

운영 비용

API 쿼터: TTS 엔진에 문자당 제한이 있음; Azure 무료 티어 약 50만 자/월
호출당 토큰: 스크립트 약 3k–6k token; 조합은 로컬
금액: skill 무료; TTS 엔진 + LLM 비용 지불
팁: 초안에는 Edge TTS (무료) 사용; 프로덕션에만 Azure/ElevenLabs

보안

권한, 시크릿, 파급범위

최소 스코프: filesystem-write (output)

자격 증명 저장: 엔진 API 키는 환경 변수로; YouTube/Bilibili token은 비밀 파일에

데이터 외부 송신: TTS 엔진, LLM 제공자, 대상 플랫폼

절대 부여 금지: 확인 없이 채널에 게시할 수 있는 공개 OAuth token

음성 클로닝 기능(활성화된 경우)은 동의 문제를 야기함 — 명시적 허락이 있을 때만 클론

문제 해결

자주 발생하는 오류와 해결

TTS가 긴 줄을 잘라냄

대부분의 엔진은 줄당 약 250자 제한; skill이 자동 분할하지만 긴 문장 확인

긴 렌더링에서 오디오 드리프트

청크로 렌더링하고 크로스페이드로 연결 — skill이 5분 이상에서 기본적으로 처리

B站 업로드 실패

비밀 파일의 쿠키 확인; B站은 재로그인이 필요할 수 있음

자막이 동기화되지 않음

정렬 재실행; 일부 TTS 엔진이 타이밍을 과소 보고함 — skill에 재동기화 모드가 있음

대안

video-podcast-maker 다른 것과 비교

대안	언제 쓰나	단점/장점
ElevenLabs Studio	세련된 SaaS UI를 원할 때	더 비쌈; 채팅에서 자동화 적음
NotebookLM Audio Overview	어떤 소스에서든 원샷 2인 호스트 오디오 요약을 원할 때	영상 없음; 제어 적음; 클라우드 전용

video-podcast-maker