U

UI-TARS Desktop

@sickn33
Visit Skill
Share:

Open-source multimodal AI agent stack connecting models and agent infrastructure.

What is UI-TARS Desktop?

UI-TARS Desktop is a multimodal agent stack for GUI, browser, and computer-use workflows built around modern agent infrastructure.

How to use UI-TARS Desktop?

Use it when you need desktop or browser-operating agents that combine multimodal models with agent orchestration patterns.

Key features of UI-TARS Desktop

  • Multimodal agent stack
  • GUI and browser-use workflows
  • Computer-use positioning
  • MCP-compatible ecosystem relevance

Use cases of UI-TARS Desktop

  1. 1Desktop agent automation
  2. 2Computer-use experimentation
  3. 3Multimodal agent prototyping

Installation

RecommendedOne-line install
git clone --depth 1 https://github.com/sickn33/antigravity-awesome-skills.git /tmp/ag-skills && mkdir -p ~/.agent/skills && cp -r /tmp/ag-skills/skills/ui-tars-desktop ~/.agent/skills/ && rm -rf /tmp/ag-skills
Step-by-step install
Step 1: Create skills directory
mkdir -p ~/.agent/skills
Step 2: Clone the skills repository
git clone --depth 1 https://github.com/sickn33/antigravity-awesome-skills.git /tmp/ag-skills
Step 3: Copy this skill
cp -r /tmp/ag-skills/skills/ui-tars-desktop ~/.agent/skills/
Step 4: Clean up temporary files
rm -rf /tmp/ag-skills

Supported Platforms

claudecursorantigravitygemini

Server Config

config.json
{
  "mcpServers": {
    "ui-tars-desktop": {
      "command": "npx",
      "args": [
        "-y",
        "ui-tars-desktop"
      ]
    }
  }
}
View on GitHub