Unstructured UNS MCP — インストール & ライブデモ

なぜ使うのか

主な機能

60+ source connectors (S3, Drive, OneDrive, SharePoint, GCS)
Document partitioning into elements (Title, Table, NarrativeText, ListItem)
Built-in chunkers (by_title, basic)
Embeddings via OpenAI, Voyage, or local models
Targets: vector DBs (Pinecone, Weaviate, Qdrant) or warehouses

ライブデモ

実際の動作

unstructured-uns-mcp.replay ▶ 準備完了

0/0

インストール

クライアントを選択

~/Library/Application Support/Claude/claude_desktop_config.json · Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "unstructured-uns-mcp": {
      "command": "uvx",
      "args": [
        "uns-mcp"
      ]
    }
  }
}

Claude Desktop → Settings → Developer → Edit Config を開く。保存後、アプリを再起動。

~/.cursor/mcp.json · .cursor/mcp.json

{
  "mcpServers": {
    "unstructured-uns-mcp": {
      "command": "uvx",
      "args": [
        "uns-mcp"
      ]
    }
  }
}

Cursor は Claude Desktop と同じ mcpServers スキーマを使用。プロジェクト設定はグローバルより優先。

VS Code → Cline → MCP Servers → Edit

{
  "mcpServers": {
    "unstructured-uns-mcp": {
      "command": "uvx",
      "args": [
        "uns-mcp"
      ]
    }
  }
}

Cline サイドバーの MCP Servers アイコンをクリックし、"Edit Configuration" を選択。

~/.codeium/windsurf/mcp_config.json

{
  "mcpServers": {
    "unstructured-uns-mcp": {
      "command": "uvx",
      "args": [
        "uns-mcp"
      ]
    }
  }
}

Claude Desktop と同じ形式。Windsurf を再起動して反映。

~/.continue/config.json

{
  "mcpServers": [
    {
      "name": "unstructured-uns-mcp",
      "command": "uvx",
      "args": [
        "uns-mcp"
      ]
    }
  ]
}

Continue はマップではなくサーバーオブジェクトの配列を使用。

~/.config/zed/settings.json

{
  "context_servers": {
    "unstructured-uns-mcp": {
      "command": {
        "path": "uvx",
        "args": [
          "uns-mcp"
        ]
      }
    }
  }
}

context_servers に追加。保存時に Zed がホットリロード。

claude mcp add unstructured-uns-mcp -- uvx uns-mcp

ワンライナー。claude mcp list で確認、claude mcp remove で削除。

ユースケース

実用的な使い方： Unstructured UNS MCP

Stand up a RAG pipeline from a SharePoint folder to Pinecone

👤 Data engineers / RAG builders ⏱ ~15 min intermediate

使うタイミング： You have a corporate doc dump and want a Claude-queryable index.

前提条件

Server/skill installed and authenticated — See repo README

フロー

Define the pipeline

Create an Unstructured pipeline: source SharePoint folder X, partition by_title with 1024 token max, embed with text-embedding-3-small, target Pinecone index 'corp-docs'.✓ コピーしました

→ Pipeline id
Run and monitor

Run it and tell me when it's done. Report any failed documents.✓ コピーしました

→ Status updates + final summary

結果： Production-grade ingest with proper chunking — not naive PDF text dumps.

注意点

Default chunkers can split tables across chunks. For dense tabular docs, use 'by_title' with combine_text_under_n_chars. — Default chunkers can split tables across chunks. For dense tabular docs, use 'by_title' with combine_text_under_n_chars.

組み合わせ： filesystem · qdrant-mcp-server

組み合わせ

他のMCPと組み合わせて10倍の力を

unstructured-uns-mcp + filesystem

Pair with filesystem for complementary capabilities

Use this server together with filesystem to complete a multi-step task.✓ コピーしました

unstructured-uns-mcp + qdrant-mcp-server

Pair with qdrant-mcp-server for complementary capabilities

Use this server together with qdrant-mcp-server to complete a multi-step task.✓ コピーしました

ツール

このMCPが提供する機能

ツール	入力	呼び出すタイミング	コスト
create_pipeline	source, dest, partition_args, chunk_args	Define a new ingest job	1 API call
run_pipeline	pipeline_id	Execute the pipeline	Per Unstructured plan
list_workflows	(none)	See all configured workflows	1 API call

コストと制限

運用コスト

APIクォータ: See provider docs for rate limits
呼び出しあたりのトークン: Varies by tool
金額: See repo README for pricing details
ヒント: Cache tool results and avoid repeated identical calls.

セキュリティ

権限、シークレット、影響範囲

認証情報の保管： Use environment variables; never commit secrets

データ送信先： Tool calls go to the provider's API as documented

トラブルシューティング

よくあるエラーと対処法

401 Unauthorized

Get an API key at unstructured.io → Settings → API keys; set UNSTRUCTURED_API_KEY and UNSTRUCTURED_API_URL.

確認： list_workflows returns at least one

source connector auth failure

Connector creds (e.g. SharePoint app token) are configured per-source. Recreate the source via Unstructured UI to refresh.

確認： Run a 1-file test pipeline first

代替案

Unstructured UNS MCP 他との比較

代替案	代わりに使う場面	トレードオフ
LlamaIndex / llamacloud	You want a managed RAG-as-a-service product	Higher-level but less control over chunking

その他

リソース

📖 GitHub の公式 README を読む

🐙 オープンな issue を見る

🔍 400以上のMCPサーバーとSkillsを見る