Tutorial··7 min read

How to split stems from any song

What is stem splitting?

Stem splitting (also called source separation) is the process of isolating individual instruments from a mixed audio track. Upload a finished song and the AI extracts separate files for vocals, drums, bass, and other instruments. Each "stem" can be used independently — mute the vocals for a karaoke track, isolate the drums for sampling, or extract the bass line for a remix.

This technology was impossible just a few years ago. Traditional audio processing could only boost or cut frequency ranges, which always left artifacts and bleed between instruments. Modern AI models (like Meta's Demucs and ByteDance's HTDemucs) use deep neural networks trained on millions of songs to intelligently separate overlapping sounds — even when instruments share the same frequency range.

How to split stems with Echo's browser tool

Step 1: Open voicechanger.live/tools/stem-splitter in your browser. No download, no account, no installation required.

Step 2: Upload your audio file. The tool supports MP3, WAV, FLAC, and OGG formats. File size limit is 50MB (roughly 10 minutes of audio at standard quality).

Step 3: Select your separation mode. "4-stem" separates into vocals, drums, bass, and other instruments. "2-stem" separates into vocals and instrumental only (faster processing, useful when you just need a karaoke track).

Step 4: Wait for processing. The AI analyzes the entire track and generates separated stems. Processing time varies by track length — a 3-minute song typically takes 30-60 seconds. All processing happens in your browser using WebAssembly — your audio is never uploaded to any server.

Step 5: Download your stems. Each stem is available as a separate WAV file. Play them back individually to verify quality, then download the ones you need.

Use cases for producers and DJs

Remixing: Extract the vocal from a song and place it over your own instrumental. This is the foundation of remix culture — and AI stem splitting makes it accessible to anyone, not just producers with access to the original multitrack recordings.

Sampling: Isolate a drum break, bass line, or melodic phrase from any recording. Clean stems mean cleaner samples — no bleed from other instruments contaminating your chops.

DJ mashups: Create acapellas and instrumentals from any track for live DJ sets. Blend vocals from one song over the instrumental of another. Previously this required finding official acapella releases — now you can create them from any song.

Practice and learning: Isolate the guitar part from a song to learn it by ear. Mute the drums and play along with the rest of the band. Music teachers use stem splitting to create custom practice tracks for students.

Karaoke: Remove the vocals from any song to create an instant karaoke track. The AI separation is clean enough for casual karaoke — no vocal bleed or artifact issues that plagued older vocal removal tools.

Tips for best separation quality

Source quality matters: Higher quality input produces better separation. Use lossless formats (WAV, FLAC) when available. Heavily compressed MP3 files (128kbps or lower) produce noticeably worse results because the compression has already destroyed audio detail that the AI needs for separation.

Song complexity: Simple arrangements (vocals + guitar, electronic music with clean synthesis) separate better than dense orchestral arrangements or heavily layered production. The AI handles most modern pop, rock, hip-hop, and electronic music very well.

Post-processing: The separated stems may have subtle artifacts — slight reverb tails, quiet bleed from other instruments. For professional use, apply a noise gate to clean up the quiet sections and use EQ to remove any remaining bleed frequencies.

Stem splitting vs. vocal removal

Vocal removal (voicechanger.live/tools/vocal-remover) is a subset of stem splitting — it extracts just the vocals and instrumental. If you only need a karaoke track or an acapella, vocal removal is faster because it runs a simpler 2-stem model.

Full stem splitting gives you 4 or more individual stems, which is essential for production work. Use vocal removal for quick karaoke tracks and stem splitting for serious music production, remixing, and sampling.

Frequently asked questions

Is stem splitting free?+
Yes. Echo's browser-based stem splitter at voicechanger.live/tools/stem-splitter is completely free. No account, no download, no usage limits. Processing happens in your browser — your audio never leaves your device.
How good is AI stem separation?+
Modern AI models produce broadcast-quality separation for most songs. Vocals, drums, and bass separate cleanly. Complex instrumental arrangements may have minor artifacts, but the quality is sufficient for remixing, sampling, and professional production work.
Is it legal to split stems from copyrighted songs?+
Stem splitting itself is legal — it is audio processing on a file you possess. However, using the separated stems in commercial releases may require licensing from the original rights holders. Fair use applies to practice, education, and certain transformative works. Consult a music attorney for commercial projects.
What is the difference between stems and multitracks?+
Multitracks are the original individual recordings from a studio session. Stems are AI-reconstructed approximations extracted from a final mix. Multitracks are always higher quality, but stems are available for any song — you do not need access to the original session files.
stem-splittertutorialmusic-productiondjtools

Try Echo

Free AI voice conversion. Download and start in under 60 seconds.