For spoken voice in noisy recordings

Voice Isolator for Spoken Voice in Noisy Recordings

Have an interview, call, lecture, podcast clip, field recording, or voice note where speech is hard to hear? Upload the audio and isolate the spoken voice into a cleaner MP3.

A voice isolator separates human speech from background noise in an audio recording. For spoken voice, use it on interviews, calls, lectures, podcasts, field recordings, or voice notes. For songs and music vocals, use a vocal remover or stem splitter instead.

For songs and music vocals, use Vocal Remover / Stem Splitter

Audio files onlyUp to 10 minutes1 credit per source second

Audio-only upload. 3 free minutes for eligible accounts. Credits are refunded if provider processing fails.

Isolate spoken voice

Quality

You can choose a file and preview it here, but the Voice Isolator job starts only after sign-in because it uses paid GPU processing.

Before

Original noisy recording

After

Isolated spoken voice

Real Replicate tests

Hear the noisy source and isolated voice

Four short noisy speech clips were processed with playmore/speech-enhancer on Replicate. Play each before and after pair to judge the cleanup.

Waveform comparison of a noisy bus speech clip before and after voice isolation.

SNR lift: +15.0 dB

Bus-noise speech cleanup

A low-SNR bus clip where traffic-like rumble sits under the spoken sentence.

Noisy source

Isolated voice

Edinburgh noisy speech · bus 2.5 dB

Waveform comparison of a cafe-noise speech clip before and after voice isolation.

SNR lift: +7.3 dB

Cafe chatter cleanup

A cafe-background sample that tests whether speech stays intelligible after broad ambient noise is reduced.

Noisy source

Isolated voice

Edinburgh noisy speech · cafe 7.5 dB

Waveform comparison of an office-noise speech clip before and after voice isolation.

SNR lift: +18.3 dB

Office-noise cleanup

A difficult office-noise recording where the model has to keep the sentence while removing room texture.

Noisy source

Isolated voice

Edinburgh noisy speech · office 2.5 dB

Waveform comparison of a public-square speech clip before and after voice isolation.

SNR lift: +14.0 dB

Public-square cleanup

A longer public-square clip with heavy background noise around a single spoken voice.

Noisy source

Isolated voice

Edinburgh noisy speech · public square 2.5 dB

Audio source: Cassia Valentini-Botinhao, Noisy speech database for training speech enhancement algorithms and TTS models, University of Edinburgh DataShare, CC BY 4.0. Enhanced outputs were generated with Replicate playmore/speech-enhancer.

Use this voice isolator for spoken voice, not songs

Voice isolator searches mix two jobs: speech cleanup and music vocal removal. This page is for spoken voice in noisy recordings. If your source is a song, karaoke track, acapella request, or music vocal, use Vocal Remover instead.

Upload noisy audio and keep the workflow simple

Start with an audio file: MP3, WAV, FLAC, M4A, AAC, OGG, or WEBM. Voice Isolator v1 accepts files up to 50 MB and 600 seconds. Direct MP4 upload, URL fetching, and live microphone cleanup are outside this workflow.

Compare before and after voice isolation

Speech cleanup has to be heard. Use the before player for your original noisy recording, then compare it with the isolated spoken voice after processing. The side-by-side check helps you judge intelligibility, artifacts, and download readiness.

Download one isolated spoken-voice MP3

The result is one MP3 for the spoken voice, not a stem package, mixer session, or ZIP file. Use it for review, editing, transcription prep, podcast cleanup, or sharing a clearer version of a speech recording.

Know the credits before GPU processing starts

You can choose and preview a file on the page, but the cost-incurring job starts after sign-in. Voice Isolator uses 1 credit per source second. Provider submission, provider failure, and output finalization failures refund credits.

Clear v1 limits prevent wrong-tool starts

Voice Isolator is not real-time denoise for calls, OBS, Discord, Zoom, or Teams. It is not diarization, target-speaker extraction, forensic restoration, or overlapping-speaker separation. For video, extract the audio first, then upload the supported audio file.

Powered by a speech enhancement model

This flow is separate from the music stem splitter. It sends the uploaded audio to Replicate playmore/speech-enhancer with the mossformer2_se_48k model, then finalizes the returned audio as an isolated-voice MP3 stored for download.

FAQ

Voice Isolator FAQ

What is Voice Isolator for?+

Voice Isolator extracts spoken voice from noisy recordings such as interviews, calls, lectures, podcasts, voice notes, and field audio. It is meant for speech cleanup, not music stem separation.

Can it remove vocals from songs?+

No. This page is for spoken voice in noisy recordings. For songs, music vocals, karaoke, acapella, remix, or stem workflows, use Vocal Remover or Stem Splitter instead.

Which files can I upload?+

V1 accepts audio files only: MP3, WAV, FLAC, M4A, AAC, OGG, and WEBM. Files must be 50 MB or smaller and no longer than 600 seconds.

Can I upload a video or paste a URL?+

Not in v1. Voice Isolator does not support direct MP4/video upload or URL fetching. If your source is video, extract the audio first and upload a supported audio file.

How are credits calculated?+

Voice Isolator uses the same audio rule as other processing flows: 1 credit equals 1 second of source audio. A 90-second recording uses 90 credits.

What happens if processing fails?+

Provider submission, provider failure, and output finalization failures mark the job failed and refund the credits used for that recording. You can retry with the same or a cleaner audio export.

Can it separate multiple overlapping speakers?+

No. V1 is for enhancing spoken voice in noisy audio, not diarization, target-speaker extraction, forensic restoration, or separating multiple people talking over each other in one recording.

Clean up the spoken voice in your noisy recording

Upload audio, compare before and after, then download the isolated MP3.

Isolate spoken voice

Simple pay-as-you-go pricing

First 3 minutes free50 minutes for $6.99150 minutes for $15.00

See all plans →

Voice Isolator for Spoken Voice in Noisy Recordings

Have an interview, call, lecture, podcast clip, field recording, or voice note where speech is hard to hear? Upload the audio and isolate the spoken voice into a cleaner MP3.