aliyun-emo

Name: aliyun-emo
Author: cinience/alicloud-skills

$npx mdskill add cinience/alicloud-skills/aliyun-emo

Generates expressive portrait videos from a person image and speech audio using Alibaba Cloud EMO

Solves the task of creating expressive talking-head videos with strong expression control
Relies on Alibaba Cloud Model Studio EMO (`emo-v1`) and image detection models
Uses detected `face_bbox` and `ext_bbox` to guide expression style and positioning
Saves generated videos and metadata in the `output/aliyun-emo/` directory for review

SKILL.md

.github/skills/aliyun-emoView on GitHub ↗

---
name: aliyun-emo
description: Use when generating expressive portrait videos from a person image and speech audio with Alibaba Cloud Model Studio EMO (`emo-v1`). Use when creating non-Wan avatar clips with stronger expression style control from a detected portrait image.
version: 1.0.0
---

Category: provider

# Model Studio EMO

## Validation

```bash
mkdir -p output/aliyun-emo
python -m py_compile skills/ai/video/aliyun-emo/scripts/prepare_emo_request.py && echo "py_compile_ok" > output/aliyun-emo/validate.txt
```

Pass criteria: command exits 0 and `output/aliyun-emo/validate.txt` is generated.

## Output And Evidence

- Save normalized request payloads, detection boxes, and task polling snapshots under `output/aliyun-emo/`.
- Record the chosen `style_level` and the exact `face_bbox` / `ext_bbox`.

Use EMO when the input is a portrait image and speech audio, and you need a non-Wan expressive talking-head result.

## Critical model names

Use these exact model strings:
- `emo-v1-detect`
- `emo-v1`

Selection guidance:
- Run image detection first to obtain `face_bbox` and `ext_bbox`.
- Use `emo-v1` only after detection succeeds.

## Prerequisites

- China mainland (Beijing) only.
- Set `DASHSCOPE_API_KEY` in your environment, or add `dashscope_api_key` to `~/.alibabacloud/credentials`.
- Input files must be public HTTP/HTTPS URLs.

## Normalized interface (video.emo)

### Detect Request
- `model` (string, optional): default `emo-v1-detect`
- `image_url` (string, required)

### Generate Request
- `model` (string, optional): default `emo-v1`
- `image_url` (string, required)
- `audio_url` (string, required)
- `face_bbox` (array<int>, required)
- `ext_bbox` (array<int>, required)
- `style_level` (string, optional): `normal`, `calm`, or `active`

### Response
- `task_id` (string)
- `task_status` (string)
- `video_url` (string, when finished)

## Quick start

```bash
python skills/ai/video/aliyun-emo/scripts/prepare_emo_request.py \
  --image-url "https://example.com/portrait.png" \
  --audio-url "https://example.com/speech.mp3" \
  --face-bbox 302,286,610,593 \
  --ext-bbox 71,9,840,778 \
  --style-level active
```

## Operational guidance

- Do not invent `face_bbox` or `ext_bbox`; use the detection API output.
- `ext_bbox` ratio determines output format: `1:1` yields `512x512`, `3:4` yields `512x704`.
- Keep the input portrait clear and front-facing for better expression quality.
- EMO is portrait-focused; for full-scene human videos use other skills instead.

## Output location

- Default output: `output/aliyun-emo/request.json`
- Override base dir with `OUTPUT_DIR`.

## References

- `references/sources.md`

More from cinience/alicloud-skills

Skill	Description
aliyun-adb-mysql	Use when managing Alibaba Cloud AnalyticDB for MySQL (ADB) via OpenAPI/SDK, including the user needs AnalyticDB resource lifecycle and configuration operations, status checks, or troubleshooting ADB API and cluster workflow issues.
aliyun-adb-mysql-test	Smoke test for aliyun-adb-mysql. Validate minimal authentication, API reachability, and one read-only query path.
aliyun-aicontent-generate	Use when managing Alibaba Cloud AIContent (AiContent) via OpenAPI/SDK, including the user needs AI content generation or content workflow operations in Alibaba Cloud, including listing assets, creating/updating generation configurations, checking task status, or troubleshooting failed content jobs.
aliyun-aicontent-generate-test	Smoke test for aliyun-aicontent-generate. Validate minimal authentication, API reachability, and one read-only query path.
aliyun-aimiaobi-generate	Use when managing Alibaba Cloud Quan Miao (AiMiaoBi) via OpenAPI/SDK, including the user asks for Alibaba Cloud MiaoBi content operations, including listing resources, creating/updating configurations, querying runtime status, and diagnosing API or workflow failures.
aliyun-aimiaobi-generate-test	Smoke test for aliyun-aimiaobi-generate. Validate minimal authentication, API reachability, and one read-only query path.
aliyun-airec-manage	Use when managing Alibaba Cloud AIRec (Airec) via OpenAPI/SDK, including the user needs recommendation-engine resource operations in Alibaba Cloud, including list/create/update flows, status inspection, and troubleshooting AIRec configuration or runtime issues.
aliyun-airec-manage-test	Smoke test for aliyun-airec-manage. Validate minimal authentication, API reachability, and one read-only query path.
aliyun-alb-manage	Use when managing and troubleshoot Alibaba Cloud ALB (Application Load Balancer), including the user asks to inspect, create, change, or debug ALB instances, listeners, server groups, rules, certificates, ACLs, security policies, or health checks in Alibaba Cloud.
aliyun-alb-manage-test	Smoke test for Alibaba Cloud ALB skill. Validates SDK auth, script compilation, list instances, and health check flows.