aliyun-modelstudio-entry-test
$
npx mdskill add cinience/alicloud-skills/aliyun-modelstudio-entry-testRuns minimal tests for Model Studio skills and records results
- Validates basic functionality of image, audio, video, and other AI skills
- Uses DashScope SDK and Alibaba Cloud Model Studio APIs
- Executes one small request per skill in the test matrix
- Logs outputs for verification and debugging purposes
SKILL.md
.github/skills/aliyun-modelstudio-entry-testView on GitHub ↗
--- name: aliyun-modelstudio-entry-test description: Use when running a minimal test matrix for the Model Studio skills that exist in this repo, including image/video/audio, realtime speech, omni, visual reasoning, embedding, rerank, and edit variants. Use to execute one small request per skill and record results. version: 1.0.0 --- Category: task # Model Studio Skills Minimal Test Run minimal validation for currently available Model Studio skills in this repo and record results. ## Prerequisites - Install SDK (virtual environment recommended to avoid PEP 668 restrictions): ```bash python3 -m venv .venv . .venv/bin/activate python -m pip install dashscope ``` - Configure `DASHSCOPE_API_KEY` (environment variable preferred; or `dashscope_api_key` in `~/.alibabacloud/credentials`). ## Test Matrix (currently supported) 1) Text-to-image → `skills/ai/image/aliyun-qwen-image/` 2) Image editing → `skills/ai/image/aliyun-qwen-image-edit/` 3) Text-to-video / Image-to-video (i2v) → `skills/ai/video/aliyun-wan-video/` 4) Reference-to-video (r2v) → `skills/ai/video/aliyun-wan-r2v/` 5) TTS → `skills/ai/audio/aliyun-qwen-tts/` 6) ASR transcription (non-realtime) → `skills/ai/audio/aliyun-qwen-asr/` 7) Realtime ASR → `skills/ai/audio/aliyun-qwen-asr-realtime/` 8) Realtime TTS → `skills/ai/audio/aliyun-qwen-tts-realtime/` 9) Live speech translation → `skills/ai/audio/aliyun-qwen-livetranslate/` 10) CosyVoice voice clone → `skills/ai/audio/aliyun-cosyvoice-voice-clone/` 11) CosyVoice voice design → `skills/ai/audio/aliyun-cosyvoice-voice-design/` 12) Voice clone → `skills/ai/audio/aliyun-qwen-tts-voice-clone/` 13) Voice design → `skills/ai/audio/aliyun-qwen-tts-voice-design/` 14) Omni multimodal → `skills/ai/multimodal/aliyun-qwen-omni/` 15) Visual reasoning → `skills/ai/multimodal/aliyun-qvq/` 16) Text embedding → `skills/ai/search/aliyun-qwen-text-embedding/` 17) Rerank → `skills/ai/search/aliyun-qwen-rerank/` 18) Video editing → `skills/ai/video/aliyun-wan-edit/` If new capability tests are needed, create corresponding skill first (use `skills/ai/misc/aliyun-modelstudio-crawl-and-skill/` to refresh model list). ## Minimal Flow Per Capability 1. Open target sub-skill directory and read `SKILL.md`. 2. Choose one minimal input example and recommended model. 3. Run SDK call or script. 4. Record model, request summary, response summary, duration, and status. ## Result Template Save as `output/aliyun-modelstudio-entry-test-results.md`: ``` # Model Studio Skill Test Results - Date: YYYY-MM-DD - Environment: region / API_BASE / auth method | Capability | Sub-skill | Model | Request summary | Result summary | Status | Notes | | --- | --- | --- | --- | --- | --- | --- | | Text-to-image | aliyun-qwen-image | <model-id> | ... | ... | pass/fail | ... | | Image editing | aliyun-qwen-image-edit | <model-id> | ... | ... | pass/fail | ... | | Image-to-video (i2v) | aliyun-wan-video | <model-id> | ... | ... | pass/fail | ... | | Reference-to-video (r2v) | aliyun-wan-r2v | <model-id> | ... | ... | pass/fail | ... | | TTS | aliyun-qwen-tts | <model-id> | ... | ... | pass/fail | ... | | ASR (non-realtime) | aliyun-qwen-asr | <model-id> | ... | ... | pass/fail | ... | | Realtime ASR | aliyun-qwen-asr-realtime | <model-id> | ... | ... | pass/fail | ... | | Realtime TTS | aliyun-qwen-tts-realtime | <model-id> | ... | ... | pass/fail | ... | | Live speech translation | aliyun-qwen-livetranslate | <model-id> | ... | ... | pass/fail | ... | | CosyVoice voice clone | aliyun-cosyvoice-voice-clone | <model-id> | ... | ... | pass/fail | ... | | CosyVoice voice design | aliyun-cosyvoice-voice-design | <model-id> | ... | ... | pass/fail | ... | | Voice clone | aliyun-qwen-tts-voice-clone | <model-id> | ... | ... | pass/fail | ... | | Voice design | aliyun-qwen-tts-voice-design | <model-id> | ... | ... | pass/fail | ... | | Omni multimodal | aliyun-qwen-omni | <model-id> | ... | ... | pass/fail | ... | | Visual reasoning | aliyun-qvq | <model-id> | ... | ... | pass/fail | ... | | Text embedding | aliyun-qwen-text-embedding | <model-id> | ... | ... | pass/fail | ... | | Rerank | aliyun-qwen-rerank | <model-id> | ... | ... | pass/fail | ... | | Video editing | aliyun-wan-edit | <model-id> | ... | ... | pass/fail | ... | ``` ## Failure Handling - If parameters are unclear, check target sub-skill `SKILL.md` or `references/*.md`. - If model is unavailable, refresh model list and retry. - For auth issues, verify `DASHSCOPE_API_KEY` (env var or `~/.alibabacloud/credentials`). ## Validation ```bash mkdir -p output/aliyun-modelstudio-entry-test echo "validation_placeholder" > output/aliyun-modelstudio-entry-test/validate.txt ``` Pass criteria: command exits 0 and `output/aliyun-modelstudio-entry-test/validate.txt` is generated. ## Output And Evidence - Save artifacts, command outputs, and API response summaries under `output/aliyun-modelstudio-entry-test/`. - Include key parameters (region/resource id/time range) in evidence files for reproducibility. ## Workflow 1) Confirm user intent, region, identifiers, and whether the operation is read-only or mutating. 2) Run one minimal read-only query first to verify connectivity and permissions. 3) Execute the target operation with explicit parameters and bounded scope. 4) Verify results and save output/evidence files. ## References - Source list: `references/sources.md`