Advertisement

Close this content

Why people are worried about Microsoft's AI that makes photos come to life

Microsoft's new generative AI system has highlighted how advanced deepfake technology is becoming.

·Contributor

Updated 29 April 2024 at 7:01 am·5-min read

Watch: Microsoft's new 'deepfake' video generator in action

A new Microsoft generative AI system has highlighted how advanced deepfake technology is becoming - generating convincing video from a single image and audio clip.

The tool takes an image and turns it into a realistic video, along with convincing emotions and movements such as eyebrows raising.

One demo shows off the Mona Lisa coming to life and singing Lady Gaga’s Papparazzi - Microsoft says the system was not specifically trained to handle singing audio, but does so. But the ability to generate video from a single image and audio file has alarmed some experts.

Microsoft has not yet revealed when the AI system will be released to the general public. Yahoo spoke to two AI and privacy experts about the risks of this sort of technology.

What's significant about this new technology?

The VASA system (which stands for 'visual affective skill') allows users to prompt where the fake person is looking, and what emotions they are displaying on screen. Microsoft says that the tech paves the way for ‘real time’ engagement with realistic talking avatars.

The demo can create realistic video from one image and an audio file (Microsoft)

Microsoft says, ‘Our premiere model, VASA-1, is capable of not only producing lip movements that are exquisitely synchronised with the audio, but also capturing a large spectrum of facial nuances and natural head motions that contribute to the perception of authenticity and liveliness.’

Why are some people worried?

Not everybody is enamoured with the new system, with one blog describing it as a ‘deepfake nightmare machine’. Microsoft has emphasised the system is a demonstration and says there are currently no plans to release it as a product.

But while VASA-1 represents a step forward in animating people, the technology is not unique: audio start-up Eleven Labs allows users to create incredibly realistic audio doppelgangers of people, based on just 10 minutes of audio.

Eleven Labs’ technology was used to create a ‘deepfake’ audio clip of Joe Biden by ‘training’ a fake version on publicly available audio clips of the President, and then sending a fake audio clip of Biden urging people not to vote. The incident, which saw a user banned from Eleven Labs, highlighted how such technology can easily be used to manipulate real events.

This illustration photo taken on January 30, 2023 shows a phone screen displaying a statement from the head of security policy at META with a fake video (R) of Ukrainian President Volodymyr Zelensky calling on his soldiers to lay down their weapons shown in the background, in Washington, DC. Chatbots spouting falsehoods, face-swapping apps generating fake porn and cloned voices defrauding companies of millions -- governments are scrambling to regulate AI-powered deepfakes widely feared to be a misinformation super spreader. (Photo by OLIVIER DOULIERY / AFP) (Photo by OLIVIER DOULIERY/AFP via Getty Images) — Meta shows off a deepfaked video of Ukrainian President Volodymyr Zelensky telling his soldiers to lay down their weapons (Photo by OLIVIER DOULIERY / AFP)

In another incident, a worker at a multinational firm paid out $25 million to fraudsters after a video call with multiple other members of staff where everyone was a deepfake. Deepfakes are becoming increasingly common online, with one survey by Prolific finding that 51% of adults said they had encountered deepfaked video on social media.

Simon Bain, CEO of OmniIndex, says, 'Deepfake technology is on a mission to produce content that contains no clues or ‘identifiable artifacts’ to show that it is fake. The recent VASA-1 demo is the latest such development offering a significant step towards this, and Microsoft’s accompanying ‘Risk and responsible AI considerations’ statement suggests this drive for perfection, saying:

“Currently, the videos generated by this method still contain identifiable artifacts, and the numerical analysis shows that there's still a gap to achieve the authenticity of real videos.”

'Personally, I find this deeply alarming, as we need these identifiable artifacts to prevent deepfakes from causing irreparable harm.

What are the telltale signs you are looking at a deepfake?

Tiny signs like inconsistencies in skin texture and flickers in facial movements can give away you are looking at a deepfake, Bain says. But soon, even those may go away, he explains.

Bain says, Only these possible inconsistencies in skin texture and minor flickers in facial movements can visually tell us about a video’s authenticity. That way, we know that when we’re watching politicians destroy their upcoming election chances, it’s actually them and not an AI deepfake.

'This begs the question: why is deepfake technology seemingly determined to eliminate these and other visual clues as opposed to ensuring they remain in? After all, what benefit can a truly lifelike and ‘real’ fake video have other than to trick people? In my opinion, a deepfake that is almost lifelike but identifiably not can have just as much social benefit as one that is impossible to identify as fake.'

What are tech companies doing about it?

Twenty of the world's biggest tech companies, including Meta, Google, Amazon, Microsoft and TikTok signed a voluntary accord earlier this year to work together to stop the spread of deepfakes around elections.

Nick Clegg, president of global affairs at Meta said, “With so many major elections taking place this year, it’s vital we do what we can to prevent people being deceived by AI-generated content.

“This work is bigger than any one company and will require a huge effort across industry, government and civil society.”

But the broader effect of deepfakes is that soon, no one will be able to trust anything online, and companies should use other methods to 'validate' videos, says Jamie Boote, associate principal consultant at the Synopsys Software Integrity Group:

Boote said, "The threat posed by Deepfakes is that they are a way of fooling people into believing what they see and hear transmitted via digital channels. Previously, it was hard for attackers to fake someone’s voice or likeness, and even harder to do so with live video and audio. Now AI makes that possible in real time and we can no longer believe what’s on screen.

"Deepfakes open up another avenue of attacks against human users of IT systems or other non-digital systems like the stock market. This means that video calls from the CEO or announcements from PR folks can be faked to manipulate stock prices in external attacks or be used by spearphishers to manipulate employees into divulging information, changing network settings or permissions, or downloading and opening files.

"In order to protect against this threat, we have to learn to validate that the face on the screen is actually the face in front of the sender’s camera and that can be done through extra channels like a phone call to the sender’s cell phone, a message from a trusted account, or for public announcements, a press release on a public site controlled by the company.

South China Morning Post
Huawei plans tri-fold smartphone as Apple weighs foldable iPhone, reports say
The foldable smartphone market is set for a big boost amid reports that Apple is preparing a clamshell-style model, while Huawei Technologies may soon launch a tri-folding handset. Apple's top-down folding iPhone could come as early as 2026, according to a report by The Information on Wednesday. The US tech giant has reached out to Asian suppliers in recent months to make components for its first foldable handset, according to the report, citing anonymous sources. This would mark Apple's entry i
Reuters
Apple's China smartphone shipments drop 6.7% as Huawei surges, data shows
BEIJING (Reuters) -Apple's smartphone shipments in China fell by 6.7% in the second quarter of 2024, as the tech giant faced intensifying competition from rivals like Huawei, according to data from market research firm Canalys. Apple's total shipments for the quarter ending in June stood at 9.7 million units, down from 10.4 million units in the same quarter last year, Canalys data shows. In contrast, Huawei's smartphone shipments surged 41% year-on-year to 10.6 milion in the quarter, bolstered by the launch of its new Pura 70 series in April.
Digital Spy
My long-term iPhone 15 Pro Max review
I’ve been using the iPhone 15 Pro Max for over six months to come to my long-term verdict on the big-screen flagship with excellent cameras and battery life.
Reuters
Amazon racing to develop AI chips cheaper, faster than Nvidia's, executives say
Inside Amazon.com's chip lab in Austin, Texas, half a dozen engineers on a Friday afternoon put a closely guarded new server design through its paces. The server was packed with Amazon's artificial intelligence chips that compete with those from market leader Nvidia, Amazon executive Rami Sinno said on Friday, during a visit to the lab. Amazon is developing its own processors to limit its reliance on costly Nvidia chips - the so-called Nvidia tax - that power some of the artificial intelligence cloud business at its Amazon Web Services, the main growth driver.
Fortune
Startup with ‘radical’ concept for AI chips emerges from stealth with $15 million to try to challenge Nvidia
Fractile thinks it can deliver 100 times faster performance 10 times cheaper than GPUs.
Reuters
CrowdStrike says over 97% of Windows sensors back online
The outage happened because the advanced platform contained a fault that forced computers running Microsoft's Windows operating system to crash and show the so-called blue screen of death. Microsoft said on Saturday about 8.5 million Windows devices had been affected in the outage that had left flights grounded, forced broadcasters off air and left customers without access to services such as healthcare or banking."Our recovery efforts have been enhanced thanks to the development of automatic recovery techniques and by mobilizing all our resources to support our customers," Kurtz said in a post on LinkedIn.
Reuters
BofA payments app for businesses handled record $500 billion by mid-year
Bank of America’s corporate clients approved a record $500 billion in payments through its CashPro app by mid-year, up almost 40% versus the same period in 2023, the company said in a statement on Thursday. Payments on the app are projected to exceed $1 trillion this year from $802 billion last year, BofA said. Global payments revenues rose to more than $2.2 trillion in 2022, with commercial accounts generating 53% of the total, the report showed.
MediaOutReach
Southco Introduces A Flush-Mount E6-73 Constant Torque Hinge
HONG KONG SAR - Media OutReach Newswire - 26 July 2024 - Southco Asia Ltd., a subsidiary of Southco Inc., a leading global provider of engineered access solutions such as locks, latches, captive fasteners, electronic access solutions, and hinges/positioning technology, has developed a flush-mount version of its popular E6 constant torque hinge. The E6-73 Stainless Steel Constant Torque Hinge provides all the benefits of a torque hinge in a low-profile, corrosion-resistant package, making it an i
Evening Standard
SearchGPT unveiled: OpenAI is launching a search engine to compete with Google
Here’s how you can sign up to be among the first to try the new AI-powered search tool
Engadget
Samsung Galaxy Buds 3 and Galaxy Buds 3 Pro review: AirPods clones that actually deliver
The Galaxy Buds 3 series offers two sets of great earbuds, but Samsung borrowed heavily from Apple to design them.
Yahoo Finance Video
OpenAI announces testing of new search engine, SearchGPT
OpenAI — which Microsoft (MSFT) owns a 49% stake in — is stepping into the search engine game. In a move that could rival Alphabet's (GOOG, GOOGL) Google search, the artificial intelligence developer unveiled its SearchGPT engine. The Market Domination team analyzes OpenAI's blog post outlining what it hopes to achieve in this search platform. For more expert insight and the latest market action, click here to watch this full episode of Market Domination. This post was written by Luke Carberry Mogan.
The Guardian
North Korea-backed cyber espionage campaign targets UK military
National Cyber Security Centre warns of global hacking effort to obtain nuclear and defence intelligence
Engadget
OpenAI unveils SearchGPT, an AI-powered search engine
The launch of SearchGPT comes amid growing competition in AI-powered search.
South China Morning Post
Chinese AI start-up Baichuan raises US$700 million from Alibaba, Tencent, Xiaomi
Baichuan AI, one of China's four so-called artificial intelligence (AI) tigers, raised about 5 billion yuan (US$687.6 million) in a new funding round that valued the start-up at more than 20 billion yuan, the company said on Thursday. The Beijing-based firm's latest round was backed by some of the biggest names in Chinese technology, including Alibaba Group Holding, Tencent Holdings and Xiaomi, along with some state-backed funds. Alibaba owns the South China Morning Post. China International Cap
The Independent
CrowdStrike offers $10 gift card apology for $5bn outage
Some of the Uber Eats gift cards offered to partners were flagged as fraud
Sky News
£7.7 million bounty offered in hunt for members of North Korea-backed hacking group
The UK, US and South Korea have accused a North Korea-backed cyber group of carrying out an online espionage campaign to steal military and nuclear secrets. The "Andariel" group has been compromising organisations around the globe as it attempts to get hold of sensitive and classified technical information and intellectual property data, according to the UK's National Cyber Security Centre (NCSC). The centre, along with the FBI in the US and South Korea's national intelligence service, have issued a joint warning and advisory note about Andariel's actions.
USA TODAY
Get an Apple AirTag tracking device for the lowest price we've seen in months
Keep a watchful eye on your keys, wallet, luggage, and more with an Apple AirTag. Get the tracker on sale at Amazon for just $24, the lowest price we've seen in months.
Cosmo
Rosalía goes braless and *almost* frees the nip in a lace naked dress
Rosalía stepped out wearing a breathtaking naked dress at the Prelude to the Olympics in Paris. The design was a nude coloured see-through lace gown by Dior.
HuffPost
‘I Approve This Message’: Kamala Harris Instantly Uses Trump’s Own Words Against Him
That didn’t take long.
NY Daily News
Harris campaign roasts Trump as ‘old and quite weird’ after Fox News insults
Republican presidential candidate Donald Trump called in to Fox News Thursday, where he told supporters that presumptive Democratic nominee Kamala Harris is a “radical left, not very smart person” who’s part of a massive conspiracy to weaponize the nation’s legal system against him. Harris’ campaign fired back mere minutes later with an email blasting the “78-year-old convicted criminal’s Fox ...