Generate High-Quality Audio with Voxtral Small 24B 2507

Go to Workflow
93 views
Built by Yaron Been Yaron Been
Created on June 09, 2026

Description

Generate High-Quality Audio with Voxtral Small 24B 2507

This workflow integrates the notdaniel/voxtral-small-24b-2507 model from Replicate to generate audio content from provided inputs. It handles API authentication, creates predictions, polls until completion, and outputs the final generated audio file.

⚑ Section 1: Trigger & Authentication

πŸ”˜ On clicking 'execute'** β†’ Manually starts the workflow.
πŸ”‘ Set API Key** β†’ Stores your Replicate API key to authenticate requests.

Benefit: Securely connects your workflow to Replicate’s API and ensures only authorized requests are made.

πŸŽ›οΈ Section 2: Create Prediction

🌐 Create Prediction** β†’ Sends a request to Replicate’s API with parameters like:

audio: Input audio file (e.g., a reference sample).
max\_new\_tokens: Maximum number of tokens to generate (controls audio length/complexity).

Benefit: Starts the audio generation process with configurable input and settings.

⏳ Section 3: Polling & Status Tracking

πŸ†” Extract Prediction ID** β†’ Captures the unique prediction ID and endpoint for polling.
⏱️ Wait** β†’ Pauses for 2 seconds before re-checking.
πŸ“‘ Check Prediction Status** β†’ Polls Replicate’s API to see if the audio generation is done.
βœ… Check If Complete** β†’

If finished: moves forward to process results.
If not: loops back to wait and check again.

Benefit: Efficiently manages asynchronous audio generation, ensuring the workflow only proceeds when results are ready.

🎧 Section 4: Process Result

πŸ“ Process Result** β†’ Extracts and structures final output data:

status (success or failure)
output (raw response)
metrics (generation statistics)
timestamps (created and completed times)
audio\_url (final generated audio link)

Benefit: Provides a clean, structured output that can be used in follow-up automations (e.g., sending audio to users, storing in a database, or sharing via email).

πŸ“Š Workflow Overview

| Section | Purpose | Key Nodes | Benefit |
| --------------------------- | ------------------------------- | ----------------------------------------------------------------------- | -------------------------------- |
| ⚑ Trigger & Authentication | Start workflow & authenticate | Manual Trigger, Set API Key | Secure execution |
| πŸŽ›οΈ Create Prediction | Submit audio generation request | Create Prediction | Start model processing |
| ⏳ Polling & Status Tracking | Monitor prediction progress | Extract Prediction ID, Wait, Check Prediction Status, Check If Complete | Ensures reliable completion |
| 🎧 Process Result | Format and deliver output | Process Result | Clean audio result ready for use |

βœ… Final Benefits

πŸ”’ Secure authentication with Replicate
πŸŽ›οΈ Flexible audio generation using voxtral-small-24b-2507
⏳ Reliable polling until results are ready
🎧 Clean and structured audio output

Nodes Used (2)

Code
n8n-nodes-base.code
HTTP Request
n8n-nodes-base.httpRequest