Meta has launched its personal AI music generator known as ‘MusicGen,’ skilled on 10,000 licensed music tracks.
The AI music generator works very like Google’s MusicLM, producing a snippet of about 12 seconds of audio based mostly on a textual content immediate. I experimented with MusicLM’s mannequin upon its preliminary launch and found it’s fairly nice at producing digital music and synthwave, however not a lot else. MusicGen desires to be higher at all kinds of genres.
MusicGen was skilled on 20,000 hours of music that features 10,000 “high-quality” license tracks and 390,000 instrument-only tracks from ShutterStock and Pond5. Whereas the mannequin itself is open supply, Meta has not supplied the code it used to coach the mannequin. As an alternative, pre-trained fashions can be found for obtain. The outcomes from each MusicGen and MusicLM aren’t going to be placing musicians out of a job anytime quickly.
Prompting an appropriate piece of audio from a text-to-audio AI means understanding the way to describe what you wish to hear. Easy prompts like ‘ambient chiptune music’ are so open-ended that merely re-feeding the immediate to the music generator will generate wildly totally different songs after every era.
In the meantime, a immediate like “Gradual tempo, bass-and-drums-led reggae track. Sustained electrical guitar. Excessive-pitched bongos with ringing tones. Vocals are relaxed with a laid-back really feel, very expressive,” will assist the language mannequin construct one thing that sounds very related after every profitable era. As generative AI progresses, these language fashions will develop into higher at producing sound that’s pleasing to the human ear—if a bit soulless.
It additionally means the period of deepfake music is about to get even tougher to differentiate as these fashions get extra use. We’ve already seen viral tracks like “Coronary heart On My Sleeve” dominate social media like TikTok and YouTube. The Huge Three labels are discussing new AI provisions to assist fight deep-faked music that borrows musical ideas to create a five-finger low cost mash-up of in style artists’ established sound.