hey! i just made a tool to do this!
github.com nectarboy dsi-sound-converter
the sound format actually is ima adpcm at 16384 hz. reverse engineering the binary reveals the same step size table is used, and further examination confirms the ima adpcm algorithm is used for encoding and decoding...