For spoken voice in noisy recordings

Voice Isolator for Spoken Voice in Noisy Recordings

Have an interview, call, lecture, podcast clip, field recording, or voice note where speech is hard to hear? Upload the audio and isolate the spoken voice into a cleaner MP3.

A voice isolator separates human speech from background noise in an audio recording. For spoken voice, use it on interviews, calls, lectures, podcasts, field recordings, or voice notes. For songs and music vocals, use a vocal remover or stem splitter instead.

For songs and music vocals, use Vocal Remover / Stem Splitter

Audio files onlyUp to 10 minutes1 credit per source second

Audio-only upload. 10 free minutes for eligible accounts. Credits are refunded if provider processing fails.

Isolate spoken voice
Quality

Sign in to process audio

You can choose a file and preview it here, but the Voice Isolator job starts only after sign-in because it uses paid GPU processing.

Before

Original noisy recording

Original noisy recording

After

Isolated spoken voice

Isolated spoken voice
Sign in to isolate spoken voice
Real Replicate tests

Hear the noisy source and isolated voice

Four short noisy speech clips were processed with playmore/speech-enhancer on Replicate. Play each before and after pair to judge the cleanup.

Waveform comparison of a noisy bus speech clip before and after voice isolation.
SNR lift: +15.0 dB

Bus-noise speech cleanup

A low-SNR bus clip where traffic-like rumble sits under the spoken sentence.

Noisy source
Isolated voice
Edinburgh noisy speech · bus 2.5 dB
Waveform comparison of a cafe-noise speech clip before and after voice isolation.
SNR lift: +7.3 dB

Cafe chatter cleanup

A cafe-background sample that tests whether speech stays intelligible after broad ambient noise is reduced.

Noisy source
Isolated voice
Edinburgh noisy speech · cafe 7.5 dB
Waveform comparison of an office-noise speech clip before and after voice isolation.
SNR lift: +18.3 dB

Office-noise cleanup

A difficult office-noise recording where the model has to keep the sentence while removing room texture.

Noisy source
Isolated voice
Edinburgh noisy speech · office 2.5 dB
Waveform comparison of a public-square speech clip before and after voice isolation.
SNR lift: +14.0 dB

Public-square cleanup

A longer public-square clip with heavy background noise around a single spoken voice.

Noisy source
Isolated voice
Edinburgh noisy speech · public square 2.5 dB

Audio source: Cassia Valentini-Botinhao, Noisy speech database for training speech enhancement algorithms and TTS models, University of Edinburgh DataShare, CC BY 4.0. Enhanced outputs were generated with Replicate playmore/speech-enhancer.

01

Use this voice isolator for spoken voice, not songs

Voice isolator searches mix two jobs: speech cleanup and music vocal removal. This page is for spoken voice in noisy recordings. If your source is a song, karaoke track, acapella request, or music vocal, use Vocal Remover instead.

02

Upload noisy audio and keep the workflow simple

Start with an audio file: MP3, WAV, FLAC, M4A, AAC, OGG, or WEBM. Voice Isolator v1 accepts files up to 50 MB and 600 seconds. Direct MP4 upload, URL fetching, and live microphone cleanup are outside this workflow.

03

Compare before and after voice isolation

Speech cleanup has to be heard. Use the before player for your original noisy recording, then compare it with the isolated spoken voice after processing. The side-by-side check helps you judge intelligibility, artifacts, and download readiness.

04

Download one isolated spoken-voice MP3

The result is one MP3 for the spoken voice, not a stem package, mixer session, or ZIP file. Use it for review, editing, transcription prep, podcast cleanup, or sharing a clearer version of a speech recording.

05

Know the credits before GPU processing starts

You can choose and preview a file on the page, but the cost-incurring job starts after sign-in. Voice Isolator uses 1 credit per source second. Provider submission, provider failure, and output finalization failures refund credits.

06

Clear v1 limits prevent wrong-tool starts

Voice Isolator is not real-time denoise for calls, OBS, Discord, Zoom, or Teams. It is not diarization, target-speaker extraction, forensic restoration, or overlapping-speaker separation. For video, extract the audio first, then upload the supported audio file.

07

Powered by a speech enhancement model

This flow is separate from the music stem splitter. It sends the uploaded audio to Replicate playmore/speech-enhancer with the mossformer2_se_48k model, then finalizes the returned audio as an isolated-voice MP3 stored for download.

FAQ

Voice Isolator FAQ

What is Voice Isolator for?+

Voice Isolator extracts spoken voice from noisy recordings such as interviews, calls, lectures, podcasts, voice notes, and field audio. It is meant for speech cleanup, not music stem separation.

Can it remove vocals from songs?+

No. This page is for spoken voice in noisy recordings. For songs, music vocals, karaoke, acapella, remix, or stem workflows, use Vocal Remover or Stem Splitter instead.

Which files can I upload?+

V1 accepts audio files only: MP3, WAV, FLAC, M4A, AAC, OGG, and WEBM. Files must be 50 MB or smaller and no longer than 600 seconds.

Can I upload a video or paste a URL?+

Not in v1. Voice Isolator does not support direct MP4/video upload or URL fetching. If your source is video, extract the audio first and upload a supported audio file.

How are credits calculated?+

Voice Isolator uses the same audio rule as other processing flows: 1 credit equals 1 second of source audio. A 90-second recording uses 90 credits.

What happens if processing fails?+

Provider submission, provider failure, and output finalization failures mark the job failed and refund the credits used for that recording. You can retry with the same or a cleaner audio export.

Can it separate multiple overlapping speakers?+

No. V1 is for enhancing spoken voice in noisy audio, not diarization, target-speaker extraction, forensic restoration, or separating multiple people talking over each other in one recording.

Clean up the spoken voice in your noisy recording

Upload audio, compare before and after, then download the isolated MP3.

Isolate spoken voice
LogoAI Stem Splitter

Launch your next AI product faster with this template.

GitHubDiscordEmail
Product
  • Features
  • Pricing
  • FAQ
Free Tools
  • Key Finder
  • Nightcore Maker
  • Pitch Changer
  • Slowed Reverb Maker
  • TikTok Voice Generator
AI Tools
  • AI Vocal Removal
  • AI Acapella Extractor
  • YouTube & SoundCloud Vocal Remover
  • Karaoke Maker
  • AI Drum Remover
  • Voice Isolator
Alternatives
  • Lalal.ai alternative
  • Splitter.ai Alternative
Resources
  • Blog
  • API
Developers
  • API Reference
  • SDKs
  • Get API Key
Integrations
  • n8n integration
Trust
  • Stripe Climate
  • Product Hunt
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
BadgeBadge
BadgeBadge
BadgeBadge
BadgeBadge
© 2026 AI Stem Splitter All Rights Reserved.
LogoAI Stem Splitter
HomePricing
API Reference

REST endpoints, auth, callbacks, OpenAPI 3.1 spec.

SDKs

Seven first-party SDKs (Node, Python, Java, Go, PHP, Swift, Lua).

Get API Key

Mint a key in Settings → Developer.

Key Finder

Detect tempo and musical key — no signup

Nightcore Maker

Nightcore, daycore, or sped-up versions from a YouTube link or upload.

Pitch Changer

Shift pitch up or down without affecting tempo.

Slowed Reverb Maker

Slow + reverb edits for TikTok, Reels, and slowed playlists.

TikTok Voice Generator

Generate free AI voiceovers for short videos.

AI Vocal Removal

Remove vocals for karaoke tracks, quick acapellas, and six-stem previews from files or supported links

AI Acapella Extractor

Pull a clean acapella out of any song for a remix, mashup, or DJ edit.

YouTube & SoundCloud Vocal Remover

Paste a YouTube or SoundCloud link and split it into vocals, drums, bass, piano, guitar, and other stems

Karaoke Maker

Remove vocals from a song to make a clean instrumental backing track for sing-alongs, rehearsals, and karaoke nights

AI Drum Remover

Upload a song and download one drumless track — vocals, bass, and everything except the drums.

Voice Isolator

Extract spoken voice from noisy recordings, interviews, calls, and field audio.

Blog
Dashboard