
I ran the same Pixabay track through LALAL.AI, Moises, vocalremover.org, Voice.ai, Fadr, UVR, and my own AI Stem Splitter. Here is the honest, headphone-tested comparison plus a step-by-step guide for getting clean six-stem output.
I built a vocal remover. Before I shipped a single line of inference code, I spent a weekend running the same song through every tool I could find a credit card for — and a couple I didn't need one for.
The results were not what the SEO articles promised.
This is the comparison I wish had existed at the start. Seven tools, one track, headphones, and a notebook full of complaints.
One reference track: inaya_official's "Sunlight" on Pixabay — royalty-free, has a clean lead vocal over warm pads, drums, bass, and a piano fill on the bridge.
I picked it because the piano in the bridge is the hardest thing for any separator to handle. If a tool calls itself a "stem splitter" but smears the piano into "other," I want to hear it.
Every tool got the same input: a 3:14 WAV at 44.1 kHz / 16-bit. I listened on Sony MDR-7506s, mono summed where the tool only offered MP3.
I was not gentle.
Upload, pay, download. Their UI is the cleanest in this list, and their per-stem isolation tab is genuinely well thought out.
But two things bit me.
The free tier gives you 10 minutes once, ever — not 10 minutes per month, not 10 minutes per song. Once. The moment you hit it you're on a credit pack.
And their default 2-stem output (vocals vs instrumental) sounded oddly hollow on the vocal stem. Switching to their "Phoenix" model fixed most of it, but I had to know to do that.
Good for: people who want a clean dashboard and don't care about price. Bad for: anyone testing more than one song before committing.
Moises wants to be your practice studio, not just a separator. There's a chord detector, a pitch shifter, a tempo slider, lyric overlays.
For practicing along to a song, that's great. For extracting stems and getting out, it's friction.
The free plan caps you at 5 separations per month, 2 stems each. To get 4 stems you're on the Premium plan at $3.99/month billed yearly. Cancel mid-year and you keep paying.
The 5-stem output (vocals/drums/bass/piano/other) on Premium is clean. The catch is that you have to like Moises enough to subscribe before you can hear it on your own song.
The brand that ranks #1 for "vocal remover" on Google. So I had high hopes.
Upload was instant, processing was fast, output was — fine. Just fine.
Their Splitter AI sub-product gives you up to a 5-stem split, free, no signup for the first run. That's the killer feature. Quality-wise it's clearly a Spleeter-lineage model: bleed on the bass, drums that sound a little smushed, a vocal stem that's clean enough for a karaoke night but not for a remix you'd publish.
Honest take: if you're cutting a karaoke backing track for a family party tonight, this is the tool. Stop reading and go use it. It's free, it works, you don't need a login.
For anything you'd loop on stage or in a DAW, keep going.
Their landing page literally says "The Best AI Stem Splitter on the Market."
It is not.
The vocal isolation was acceptable. The drums were soft and pumping. The "other" stem ate the piano whole — exactly the failure mode I was watching for.
What surprised me more was the workflow. To download the result I had to install their desktop app. On a separator that is positioned as a web tool, that broke the deal for me.
I uninstalled it 8 minutes after install.
Fadr is the only tool in this list that markets itself to producers and DJs first, casual users second. 16 stem types, MIDI export for vocals/bass/drums, a DAW plugin, a /dj page.
The free tier is generous enough to actually evaluate. The 5-stem output on my test track was the second-best I heard, behind the htdemucs-class results below. The piano stayed in the piano stem. The bass stayed in the bass stem.
Paid tier is $50/year for "Plus." That's good value if you're using it weekly.
Caveat I didn't expect: the 16-stem mode is mostly creative slicing of the same underlying separation — guitar-electric vs guitar-acoustic vs guitar-lead, that kind of thing. Useful for sampling, not a true 16-way physical separation.
Free. Local. Open source. No upload, no privacy worry, no monthly fee.
This is the one most r/musicproduction threads recommend. And rightly so — if you can install Python, drop the right model files into the right folder, and survive a CUDA / MPS install fight on macOS, the quality you get out is on par with anything commercial.
But that's a real "if."
I got UVR running in about 50 minutes on a fresh M2 MacBook Air. A friend on Windows took two hours and a Stack Overflow detour. We both used the same Demucs htdemucs_ft model under the hood, and we both got beautiful stems.
Use UVR if: you're processing dozens of tracks, you care about privacy, you don't want to pay per minute, and you're OK reading installation docs.
Skip UVR if: you want stems in 60 seconds and you've never written pip install in anger.
Full disclosure: I built this. Skip this section if you want — but you came here for a comparison and I'm going to be honest about what it is and isn't.
It runs the same htdemucs model family as UVR uses. The model is from Meta AI, it won the Sony Music Demixing Challenge, and it's what every serious open-source tool reaches for. I did not build a "better model." I built a frictionless way to run that model.
Six stems out: vocals, drums, bass, guitar, piano, other. Browser only. WAV download.
On the test track it kept the piano in the piano stem. The vocal had no audible drum bleed on the chorus. Processing finished in under 60 seconds on a typical song length (the Pixabay test track came back in 41 seconds).
Where it's not the right pick:
You asked for the operation guide. Here's the full path from "I have a song" to "I have six stems."
Step 1. Go to aistemsplitter.org. Sign in with Google. You get 10 free minutes of processing — no credit card, no trial timer counting down in the background.
Step 2. Drag your audio file onto the uploader. Supported: MP3, WAV, FLAC, M4A. Up to ~100 MB or about 20 minutes per file.
Step 3. Pick a stem mode. 4-stem (vocals / drums / bass / other) is faster and matches what most other tools give you. 6-stem (adds guitar and piano) is the one to use if you care about the piano question I mentioned earlier.
Step 4. Press start. The progress bar moves in real time, not a fake 0-100 animation. A typical 3-4 minute song lands in under a minute on our infrastructure.
Step 5. When it's done, every stem gets its own player and its own download button. You can solo, mute, A/B against the original, or download all six as WAV in a zip.
At this point you should hear the vocal completely isolated when you solo it, and you should hear no vocal at all when you solo everything except vocal. If you don't, ping me on the support email — that's the failure mode we care most about.
Step 6 (optional). Out of free credits? The credit pack is $5 for 50 minutes ($0.10 per minute) and the credits never expire. There's no subscription. If you process one song this year, you've used 4 cents of that $5.
This is the table I wished I had at the start.
| Tool | Free output | Paid floor | Stems | Format | Speed | Honest verdict |
|---|---|---|---|---|---|---|
| LALAL.AI | 10 min lifetime | Credit packs from ~$15 | up to 10 | WAV / MP3 | ~1 min | Clean UI, painful free tier |
| Moises | 5 tracks/mo, 2-stem | $3.99/mo billed yearly | 5 on paid | MP3 (free), WAV (paid) | ~1-2 min | Great if you'll actually subscribe |
| vocalremover.org | Free, no signup, 5-stem | Membership tier | up to 5 | MP3 / WAV | <1 min | Best free option for karaoke |
| Voice.ai | Free with app install | n/a | 2-4 | App required | varies | Marketing > reality |
| Fadr | Generous free tier | $50/year | 16 (creative split) | WAV | ~1 min | Best for producers |
| UVR (local) | Free forever | Free | Model-dependent (often 6) | WAV | Your GPU | Best quality if you can install it |
| AI Stem Splitter | 10 min on signup | $5 / 50 min, never expires | 6 (htdemucs) | WAV | <1 min | Best fit for occasional users who want htdemucs without the install |
If you're cutting one karaoke track tonight and never doing this again: vocalremover.org. Stop reading.
If you can install Python and you're doing a lot of these: UVR. Nothing beats free and local.
If you're a producer who lives in a DAW: Fadr plus the plugin.
If you want six stems, you want them in under a minute, you want WAV, and you don't want to subscribe to anything: AI Stem Splitter is what I built for that exact case.
Three things, written down before I forget:
Most tools are racing on stem count. The marketing claim of "16 stems" sells. The actual quality difference between a good 4-stem split and a bad 16-stem split is enormous. Stem count is not quality.
The free tier is the product. Every paid plan I tried sold a quality I couldn't hear until I'd paid. The tools I trust are the ones that let me hear the output on my own song before any signup. That's a deliberate choice I made on my own site, and it's the single thing I'd change about every competitor in this list.
Speed without quality is worse than slow and clean. A 30-second separation that leaves drum bleed on the vocal stem is unusable. A 90-second separation that doesn't is gold. Watch what gets benchmarked.
That's the whole notebook.
The day you can drop a song into a web page and get six clean stems back, faster than it would have taken you to open your DAW, is here. It just isn't here equally for every tool.


A practical workflow for building 'everything except your instrument' backing tracks — covers model choice (4-stem vs 6-stem), per-instrument steps for voice, guitar, bass, drums, the songs that don't separate well, and how to slow them down.


Step-by-step guide to removing vocals from any song with AI. No software to install, no signup for your first try. Get a clean instrumental in under 90 seconds.


A practical comparison of three leading open-source audio separation models — covering SDR scores, inference cost, real-world latency, and when each one actually makes sense in production.
