The Swedish U-sound is very close to making a V-sound, but not entirely there. That's probably what's going on here. It's even a common misspelling among small children to accidentally add a V to the long U-sound, so even native little learners make that mistake. :)
EDIT: Besides, the TTS overemphasises the R a little bit. Perhaps that's a contributing factor.
I have played the audio on the word for ''moose'' (mobile isn't accent friendly, so incase lack of accents changes it drastically, I'm simply refraining) more times than I can count... but my ears just aren't catching it. What is the proper pronunciation of the 'g' in this instance? Sometimes I think I hear a very soft 'g' , others more of a long 'e' (ee/y), and I don't even know if either of those is remotely close.