> ai-task-hub
AI task hub for image analysis, background removal, speech-to-text, text-to-speech, markdown conversion, and async execute/poll/presentation orchestration. Use when users need hosted AI outcomes while host runtime manages identity, credits, payment, and risk control.
curl "https://skillshub.wtf/LeoYeAI/openclaw-master-skills/ai-task-hub?format=md"AI Task Hub
Formerly skill-hub-gateway.
Public package boundary:
- Only orchestrates
portal.skill.execute,portal.skill.poll, andportal.skill.presentation. - Does not exchange
api_keyoruserTokeninside this package. - Does not handle recharge or payment flows inside this package.
- Assumes host runtime injects short-lived task tokens and attachment URLs.
Chinese documentation: SKILL.zh-CN.md
When to Use This Skill
Use this skill when the user asks to:
- detect people, faces, hands, keypoints, or tags from images
- remove backgrounds or generate cutout/matting results for products or portraits
- transcribe uploaded audio into text (
speech to text,audio transcription) - generate speech from text input (
text to speech,voice generation) - convert uploaded files into markdown (
document to markdown) - start async jobs and check status later (
poll,check job status) - fetch rendered visual outputs such as
overlay,mask, andcutout - run embedding or reranking tasks for retrieval workflows
Common Requests
Example requests that should trigger this skill:
- "Detect faces in this image and return bounding boxes."
- "Tag this image and summarize the main objects."
- "Remove the background from this product photo."
- "Create a clean cutout from this portrait image."
- "Transcribe this meeting audio into text."
- "Generate speech from this paragraph."
- "Convert this PDF file into markdown."
- "Start this job now and let me poll the run status later."
- "Fetch overlay and mask files for run_456."
- "Generate embeddings for this text list and rerank the candidates."
Search-Friendly Capability Aliases
visionaliases: face detection, human detection, person detection, image taggingbackgroundaliases: remove background, background removal, cutout, matting, product-cutoutasraliases: speech to text, audio transcription, transcribe audiottsaliases: text to speech, voice generation, speech synthesismarkdown_convertaliases: document to markdown, file to markdown, markdown conversionpollaliases: check job status, poll long-running task, async run statuspresentationaliases: rendered output, overlay, mask, cutout filesembeddings/rerankeraliases: vectorization, semantic vectors, relevance reranking
Runtime Contract
Default API base URL: https://gateway-api.binaryworks.app
Action to endpoint mapping:
portal.skill.execute->POST /agent/skill/executeportal.skill.poll->GET /agent/skill/runs/:run_idportal.skill.presentation->GET /agent/skill/runs/:run_id/presentation
Auth Contract (Host-Managed)
Every request must include:
X-Agent-Task-Token: <jwt_or_paseto>
Recommended token claims:
sub(user_id)agent_uidconversation_idscope(execute|poll|presentation)expjti
CLI argument order for scripts/skill.mjs:
[agent_task_token] <action> <payload_json> [base_url]- If token arg is omitted, script reads
AGENT_TASK_TOKENfrom environment. - Host runtime should refresh and inject short-lived
AGENT_TASK_TOKENautomatically to avoid user-facing auth friction.
Payload Contract
portal.skill.execute: payload requirescapabilityandinput.payload.request_idis optional and passed through.portal.skill.pollandportal.skill.presentation: payload requiresrun_id.portal.skill.presentationsupportsinclude_files(defaults totrue).
Attachment normalization:
- Prefer explicit
image_url/audio_url/file_url. attachment.urlis mapped to target media field by capability.- Local
file_pathis disabled in the published package. - Host must upload chat attachments first, then pass URL fields.
- Example host upload endpoint:
/api/blob/upload-file.
Error Contract
- Preserve gateway envelope:
request_id,data,error. - Preserve
POINTS_INSUFFICIENTand pass througherror.details.recharge_url.
Bundled Files
scripts/skill.mjsscripts/agent-task-auth.mjsscripts/base-url.mjsscripts/attachment-normalize.mjsscripts/telemetry.mjs(compatibility shim)references/capabilities.jsonreferences/openapi.jsonSKILL.zh-CN.md
> related_skills --same-repo
> youtube-watcher
Fetch and read transcripts from YouTube videos. Use when you need to summarize a video, answer questions about its content, or extract information from it.
> youtube-transcript
Fetch and summarize YouTube video transcripts. Use when asked to summarize, transcribe, or extract content from YouTube videos. Handles transcript fetching via residential IP proxy to bypass YouTube's cloud IP blocks.
> youtube-auto-captions
youtube-auto-captions skill from LeoYeAI/openclaw-master-skills
> youtube
YouTube Data API integration with managed OAuth. Search videos, manage playlists, access channel data, and interact with comments. Use this skill when users want to interact with YouTube. For other third party apps, use the api-gateway skill (https://clawhub.ai/byungkyu/api-gateway).