Description
n8n-nodes-musicgpt
Official n8n community node for MusicGPT – AI-powered audio processing, music generation, and voice transformation.
Features
- 🎵 Audio Generation: Create music from text prompts or lyrics
- 🎙️ Text-to-Speech: Convert text to natural-sounding speech in 10+ languages
- 🎤 Voice Transformation: Change voices in audio files with 100+ voice options
- 🎸 Stem Separation: Extract vocals, instruments, and 20+ individual stems
- 🎛️ Audio Processing: Denoise, deecho, dereverb, master, trim, speed, convert
- 🎹 Advanced Generation: Remix, extend, inpaint, add vocals, generate sound effects
- 🔍 Voice Search: Find and list available voices dynamically
- ⚡ Triggers: Monitor conversions and automate workflows
- 🌐 Webhook Support: Async processing with callbacks
- 📦 Binary File Handling: Process files from previous nodes seamlessly
Quick Start
Installation
Via n8n Community Nodes (Recommended)
- Open your n8n instance
- Go to Settings → Community Nodes
- Click Install
- Enter:
n8n-nodes-musicgpt - Click Install
Via npm (Self-Hosted)
cd ~/.n8n/nodes
npm install n8n-nodes-musicgpt
Setup Credentials
- Get your API key from musicgpt.com
- In n8n, go to Credentials → New
- Search for "MusicGPT API"
- Enter your API key
Your First Workflow
Manual Trigger →
MusicGPT (Generate Audio):
- Prompt: "Upbeat electronic dance music"
→ Wait (45 seconds) →
MusicGPT (Get Conversion by ID) →
Process result
Operations Overview
📦 What's Included
- 1 Action Node: MusicGPT (25 operations across 4 resources)
- 1 Trigger Node: MusicGPT Trigger (2 trigger types)
Audio Generation (6 operations)
| Operation | Description | Processing Time |
|---|---|---|
| Generate Audio | Create music from prompts or lyrics | 30-60s |
| Text to Speech | Convert text to speech (10+ languages) | 15-25s |
| Voice Changer | Transform voices in audio | 20-40s |
| Cover Song | Create AI covers with different voices | 40-70s |
| Extraction | Separate 20+ stems (vocals, instruments, etc.) | 40-90s |
| Denoise | Remove background noise | 15-25s |
Audio Processing (9 operations)
| Operation | Description | Processing Time |
|---|---|---|
| Deecho | Remove echo | 20-30s |
| Dereverb | Remove reverb | 20-30s |
| Audio Mastering | Master to professional quality | 30-50s |
| Trim Audio | Cut to time range | 5-15s |
| Change Speed | Adjust playback speed | 10-20s |
| Audio to MIDI | Convert to MIDI notation | 30-45s |
| Extract Key/BPM | Detect key and tempo | 10-20s |
| File Convert | Convert formats/sample rates | 15-30s |
| Transcribe | Speech to text (10+ languages) | 15-40s |
Advanced Generation (6 operations)
| Operation | Description | Processing Time |
|---|---|---|
| Remix | AI-powered remixing | 60-120s |
| Extend | Extend audio duration | 60-100s |
| Inpaint | Replace audio sections | 40-70s |
| Sing Over Instrumental | Add vocals to instrumentals | 60-100s |
| Sound Generator | Generate sound effects | 20-40s |
| Prompt to Lyrics | Generate lyrics | 5-15s |
Searches (4 operations)
| Operation | Description |
|---|---|
| Get All Voices | List available voices with pagination |
| Search Voices | Search voices by name |
| Get Conversion by ID | Check status and retrieve results |
| Get Conversions by User | Get user history with filters |
Triggers (2 types)
| Trigger | Description |
|---|---|
| Get Conversion | Poll specific conversion until complete |
| New Conversions | Trigger on new user conversions |
Documentation
| Resource | Description |
|---|---|
| Full Documentation | Complete API reference with parameters, examples, and patterns |
| Quick Reference | Fast lookup for all operations and parameters |
Usage Examples
Example 1: Text-to-Speech Podcast Generator
Schedule Trigger (weekly) →
Fetch Script (HTTP Request) →
Text to Speech:
- Text: {{ $json.script }}
- Language: English
- Voice: Professional narrator
→ Wait (20s) →
Get Conversion →
Upload to Podcast Host →
Notify Team (Slack)
Example 2: AI Cover Song API
Webhook (receive song URL + artist) →
Search Voices (find artist voice) →
Cover Song (generate with voice) →
Respond (task_id)
(Separate Webhook)
Completion Callback →
Upload to Storage →
Notify User
Example 3: Audio Processing Pipeline
File Upload Trigger →
Parallel Processing:
├─ Denoise
├─ Extract Key/BPM
└─ Transcribe
→ Merge Results →
Audio Mastering →
Save to Database
Example 4: Music Production Workflow
Manual Trigger →
Generate Audio (from lyrics) →
Wait (45s) →
Get Conversion →
Extraction (separate stems) →
Wait (60s) →
Remix (transform style) →
Download & Archive
Example 5: Automated Content Creation
MusicGPT Trigger (New Conversions) →
Filter (music_generation, completed) →
Auto-Master Audio →
Convert to Multiple Formats →
Upload to CDN →
Update Database →
Email Notification
Supported Formats & Features
Audio Formats
- Input: MP3, WAV, FLAC, OGG, AAC, WEBM, M4A
- Output: MP3, WAV, FLAC, OGG, AAC, WEBM
Languages (TTS & Transcription)
English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean, Auto-detect
Available Stems (Extraction)
- Vocals: vocals, male_vocal, female_vocal, lead_vocal, back_vocal
- Rhythm: drums, kick, snare, toms, hi-hat, ride, crash
- Strings: guitar, bass, rhythm_guitar, solo_guitar, acoustic_guitar, electric_guitar
- Keys: piano, keys, strings, winds
- Full: instrumental (all instruments)
Sample Rates
8kHz, 16kHz, 22.05kHz, 24kHz, 32kHz, 44.1kHz (CD), 48kHz (Pro), 96kHz (Hi-Res), 192kHz (Ultra)
Advanced Features
Webhook Callbacks
Enable async processing for long operations:
{
"webhook_url": "https://your-webhook.com/callback"
}
Receive completion notifications:
{
"event": "conversion.completed",
"task_id": "task_abc123",
"status": "completed",
"output_url": "https://cdn.musicgpt.com/result.mp3"
}
Binary File Processing
Process files from previous nodes:
HTTP Request (download) →
└─ Response: File
MusicGPT:
└─ Audio Source: File
└─ Binary Property: "data"
Dynamic Voice Loading
Voice selection fields dynamically load 100+ available voices from the API, with search and filter capabilities.
Troubleshooting
| Issue | Solution |
|---|---|
| "Authentication failed" | Verify API key in credentials |
| "Conversion not found" | Increase wait time (operations vary: 15s-120s) |
| "Invalid voice_id" | Use "Get All Voices" to see valid IDs |
| "File too large" | Compress or split audio files |
| "Rate limit exceeded" | Add delays between requests or upgrade plan |
| "Binary data error" | Ensure previous node outputs file data |
Contributing
Contributions are welcome! See the GitHub repository for development setup and guidelines.
Support
- 📖 Documentation: DOCUMENTATION.md
- ⚡ Quick Reference: API-REFERENCE.md
- 💬 Community: n8n Forum
- 📧 Email: support@musicgpt.com
Resources
- MusicGPT API Docs: docs.musicgpt.com
- n8n Documentation: docs.n8n.io
- Creating n8n Nodes: docs.n8n.io/integrations/creating-nodes
- n8n Community: community.n8n.io
License
MIT License – see LICENSE file for details.
Credits
- MusicGPT API: musicgpt.com
- n8n Platform: n8n.io
Made with ❤️ for the n8n community