Skip to main content

Google is working on generative AI soundtracks and dialogue for videos

Hands of a conductor of a symphony orchestra close-up in black and white

Everyone knows sound is a critical component to most films and videos. After all, even when films were silent, there was still a musical accompanist letting the audience know how to feel.

This natural law remains the same for the new crop of generative AI videos, which emerge eerily silent. That's part of why Google has been working on "video-to-audio" technology (V2A) which "makes synchronized audiovisual generation possible." On Monday, Google's AI lab, DeepMind, shared progress on generating such audio including soundtracks and dialogue that automatically match up with AI-generated videos.

Google has been hard at work developing multimodal generative AI technology to compete with rivals. OpenAI has its AI video generator Sora (yet to be publicly released) and GPT-4o, which creates AI voice responses. Companies like Meta and Suno have been exploring AI-generated audio and music, but pairing audio with video is relatively new. ElevenLabs has a similar tool that matches audio to text prompts, but DeepMind says V2A is different because it doesn't require text prompts.

V2A can be paired with AI video tools like Google Veo or existing archival footage and silent films. This can be used for soundtracks, sound effects, and even dialogue. It works by using a diffusion model trained with visual inputs, natural language prompts, and video annotations to gradually refine random noise into audio that fits the tone and context of videos.

Google DeepMind says V2A can "understand raw pixels" therefore you don't actually need a text prompt to generate the audio, but it does help with the accuracy. The model can also be prompted to make the tone of the audio sound positive or negative. Along with the announcement, DeepMind released some demo videos, including a video of a dark, creepy hallway accompanied by horror music, a lone cowboy at sunset scored to a mellow harmonica tune, and an animated figure talking about its dinner.

V2A will include Google's SynthID watermarking as a safeguarding measure against misuse, and Deepmind's blog post says the feature is currently undergoing testing before it's released to the public.



from Mashable https://ift.tt/1S3WG8a
https://ift.tt/5w9f4Je

Comments

Popular posts from this blog

WORKING 2.0 No Recoil File For PUBG | Anti-Ban File

 WORKING 2.0 No Recoil File For PUBG | Anti-Ban File - techy teacher 2.0 No Recoil File For PUBG Howdy Buddies! We are back with new theme on PUBG versatile 2.0. I genuinely want to believe that you all update your PUBG App in light of the fact that on 11 May PUBG new update has been shown up, and pretty much every client update it. As you most likely are aware my site is the best wellspring of hacking and breaking, on my site you get the most recent reports on game hacks with reasonable recordings. Today, I give you the most recent 2.0 No Recoil File For PUBG. This update is truly astounding, PUBG 2.0 report a few new and intriguing elements with regards to this update. PUBG presents new livik map and in this guide we see a great deal of new things. This new guide is entirely unexpected and PUBG add a few games in it. In this guide, you additionally appreciate soccer challenge and gather coins in remuneration to purchase plunder. How about we examine every one of the new elements in c

WhatsApp Hacking Using Phishing Attack - techy teacher

WhatsApp Hacking Using Phishing Attack - techy teacher  WhatsApp Hacking Using Phishing Attack ? As all of us recognise, WhatsApp hacking is sort of no longer viable because of quit-to-give up encryption. But there are methods we are able to hack all of us's WhatsApp. I’m speaking about phishing attack. I suppose that is the maximum effective assault for WhatsApp hacking. Today in this article we are speakme approximately how to carry out this attack using termux. Before starting this text, in case you do not know what's termux? First examine approximately it on Google and learn some basic instructions. Observe my instructions. If you're inquisitive about hacking, then check out my recent article on How to Hack a Facebook Accoun t. Do not download this app from the Play Store if you already have the termux app. Because termux can't upgrade packages if it's downloaded via the Play Store. As a result, I recommend that you download termux from F-Droid . F-droid is t

Hack PayPal - Hack PayPal Money Free Using Termux -techy teacher

 Hack PayPal - Hack PayPal Money Free Using Termux - by techy teacher Hack any PayPal Account Hi Everyone, we are again with fascinating subject of hack PayPal cash free utilizing termux application. As you probably are aware, my site is the best hotspot for moral hacking. So if you have any desire to find out about how to hack Facebook account or some other stuff then should visit my entire site. I likewise give functional recordings to my supporters.  Alright, we should discuss our device. So the of this device is onex coded in slam script. Onex is the bundle oversees for programmers on the grounds that onex device oversee beyond what 400 apparatuses that can be introduced in single tick. The main thing is to introduce this apparatus and after establishment I will let you know how we can hack PayPal utilizing this onex device. In the event that you're a termux client, should find out about How to Hack Instagram Account. Before start this interaction, we want an application name