What is Audio to MIDI?
Audio to MIDI converts audio recordings into MIDI note data by detecting their pitch. Upload a vocal melody, guitar riff, or any monophonic audio and get a downloadable MIDI file you can import into any DAW.
Pitch detection runs on the YIN algorithm, which estimates fundamental frequency frame by frame and quantises each stable frame to the nearest semitone. Works best on monophonic sources (one note at a time): vocals, flute, solo guitar, whistled melodies. Polyphonic chords or drums give unpredictable results.
How to use
- Upload an audio file (WAV, MP3, or OGG) containing a monophonic melody — one note at a time works best.
- Adjust the minimum confidence threshold and note duration to filter out noise and short transients.
- Preview the detected notes on the piano roll, then download the result as a standard MIDI file.
When to use
- Capturing a vocal idea on your phone and turning it into a MIDI sketch in your DAW.
- Transcribing a flute or whistle melody into notation without picking it out by ear.
- Reusing an acoustic guitar lick as a synth line by re-triggering the MIDI with a different instrument.
Result
A singer records a vocal melody idea on their phone. They upload the MP3 here, the tool detects each sung note's pitch and timing, and exports a MIDI file they drag into Ableton Live to play back with a synth.
FAQ
- Why does the tool miss notes or detect random ones?
- Pitch detection struggles with breathy attacks, vibrato wider than a semitone, and overlapping notes. Raise the min-confidence slider to drop weak detections, and try a cleaner recording with less reverb. Sing or play one note at a time.
- Can it transcribe a full song with chords and drums?
- No. The algorithm assumes one pitch at a time. Multiple simultaneous notes confuse it and produce garbled output. Solo a single track (just vocals, just bass) before uploading. Polyphonic transcription needs a much heavier model.
- What audio formats can I upload?
- WAV, MP3, OGG, and FLAC all work. Decoding happens locally with the Web Audio API, so anything that plays on this device is accepted. Mono and stereo are both fine — stereo gets averaged to mono before analysis.
- Why are the detected notes shifted up or down by an octave?
- Pitch detectors sometimes lock onto a harmonic instead of the fundamental, producing octave errors. Use the Octave control to nudge the whole result up or down. Recordings with rich low frequencies (bass guitar) are most prone to this.
- Will the exported MIDI keep the original timing?
- Yes. Note start times and durations are written in milliseconds, so the MIDI matches the original tempo without snapping to a grid. If you want quantised notes, your DAW can apply that after import.
Related Tools
Spectrum Analyzer
Visualize audio frequency spectrum in real time
Mono to Stereo Converter
Convert mono audio files to stereo
Audio Bitrate Converter
Change audio file bitrate
Audio Crossfade
Crossfade between two audio tracks
Chorus Effect
Add chorus audio effect to sounds
Waveform Image Generator
Export audio waveform as an image