Skip to main content
This skill is available as a machine-readable YAML playbook via the Framelane MCP server. Load it into any MCP-capable agent to get the complete workflow without writing integration code.

What this skill does

Submits a POST /v1/tasks/transcribe job via AssemblyAI, polls until complete, fetches the transcript JSON, and returns it. Word-level timestamps are always included so the output is ready to feed into word_animation in a render request.

Load via MCP

@framelane-skill:transcribe

Inputs

NameRequiredDescription
source_urlrequiredURL of the audio or video file
source_languageoptionalBCP-47 language code (e.g. en). Auto-detected if omitted.
speaker_labelsoptionalEnable speaker diarisation. Default: false.

Output

Returns transcript — the full AssemblyAI JSON including text, words[] (with text, start, end per word), and optionally utterances[] when speaker labels are enabled.