But... MP3 is (compressed) audio and MIDI isn't (It's notation - it's not actually sound). The mere concept of converting them is just wrong... Pitch and transient detection are not advanced enough to do it properly, and a frequency domain/granular method would not yield the kind of result which would be useful in a music production environment.
