Toutiao is making fake news to train its anti-fake news AI

TechNode

10 December 2017 at 9:05 pm

One of the world’s most popular news aggregator needs to make sure it doesn’t become a forum for fake news

Toutiao is making fake news to train its anti-fake news AI was written by Frank Hersey for TechNode.

Toutiao’s AI software did not generate this headline, but for the 20 million pieces of content that flow through the platform each day, headline generation and AB testing are just two of the AI services Toutiao uses to get more people tapping.

Speaking to foreign journalists for the first time as head of the Jinri Toutiao AI Lab and vice president of the app’s owner Bytedance, Dr. Ma Wei-Ying talked about the tech that his lab is working on, why it has a bot that generates fake news and what it knows about its users.

Jinri Toutiao is a news recommendation app that is trained and updated in real time on a user’s behavior. Unlike search engines, Ma pointed out, its search function is individual rather than one ranking for everyone.

“This is the democratization of content creation,” said Ma, putting Bytedance in line with other Chinese tech companies that have recently declared themselves as content companies.

“Toutiao is becoming a new information platform for people to find information and connect with information. People are using their smartphones not just to access information, but to create information. They don’t need their own website–they can use Toutiao to directly upload and publish the information and content they create.”

Also Read: ComfortDelGro acquires controlling stake in Uber Singapore’s car rental subsidiary

The tremendous amount of data generated by users and creators allows the training of neuro-network models. Applying AI to the data gathered is generating a better understanding of the world these users are in. “We are moving from a digital representation of the world to a semantic representation of the world”.

Ma believes the system is going to improve across the board. “Content creation will be fundamentally revolutionized in next few years” as AI allows the “mining of human intelligence to close the feedback loop” of each stage of the lifecycle of content creation, moderation, dissemination, and consumption. Here’s how.

Make fake news to beat fake news

Bytedance has a different approach to tackling fake news: writing it. The AI lab that Ma heads has developed a bot that uses the company’s growing database of real fake news stories to generate its own fake fake news. It then has another bot for detecting fake news which is trained by analyzing its counterpart’s fake feed, and by drawing on a matching database of real news.

“One is good at writing, which means this also helps us to advance machine writing, and the other is machine reading. These two can push each other to improve by using the label data and assimilated data through our algorithms,” said Ma.

Ma believes that having two competing algorithms allows them each to improve. Toutiao lets users report what they believe to be fake news and analyzes comments to detect whether they suggest the content might be fake. When the system identifies a piece of fake news that has got through, it will notify all who have read it that they had read something fake.

Bytedance is using this “dual-learning” technique in other ways. It machine translates news from Chinese into English, then has another program to translate that article from English into Chinese to improve both processes. Fake news can also be translated to allow the algorithms to train for Toutiao’s global expansion. Other aspects of global expansion are language-independent, such as video, meaning those algorithms have already been trained on large numbers of Chinese users.

In the future, the culmination of analyzing successful pieces, building a database of popular topics, and developing machine writing will mean Toutiao will be able to automatically generate articles for its readers on their favorite subjects.

Better algorithms, better articles

“We adjust our strategy every week. It’s a constant experiment,” said Ma.

The system is monitoring in real time and is also working to predict if a piece of content will be a success. Algorithms offer four headlines to article writers then conduct AB testing to determine which is having the most impact. But not all articles are subject to algorithms due to the computing power involved. Only when a piece starts to gain traction will it get extra help.

Machine learning is used for viral prediction. It compares incoming articles with previous content that has taken off and as the machine learning proves successful, the accuracy of the system increases with constant feedback.

Ma acknowledged that care has to be taken to prevent the algorithms from distorting the popularity of particular elements of content or stopping content from new users getting through who have yet to establish a positive profile from the system.

Automated sports commentary

Object recognition in video is also finely developed to fuel more personalisation. Bytedance is working on smarter, personalized sports coverage, explained Ma. The current one-feed-fits-all approach will be replaced with a tailored viewing experience when fan data recognizes an interest in, for example, a particular player.

Coverage will focus more on that player, with the end goal being a personalized, automated commentary and onscreen captions.

Location, location, location. And time.

Toutiao builds up an idea of users’ lives including their whereabouts and habits. As well as understanding what content the user is interested in, the AI adjusts recommendations based on current and historic location. Ma gave an example of this which shows the sophistication of the tool. Chinese people living in the US, using Toutiao as part of their everyday lives there, are generating a footprint.

Then suddenly Chinese New Year comes around and the location changes from the US to somewhere in China. The news may change accordingly there and then, but once the user heads back to the States, the software assumes that the user’s location at Chinese New Year was significant to them, and probably their hometown. Once back in the US, if any news stories crop up in their supposed hometowns, they will show up in the users’ feeds.

Also Read: Founder of SumoStory makes video to apologise for promoting clients on Forbes; PR firms respond

Time is used as a gauge for what is appropriate to send. Algorithms work out when a person is busy and so the app will not bombard them with too much content and will save it until they are free. On a larger scale, the data is providing profiles of cities and areas of cities in terms of people’s working habits.

On an individual scale, these patterns can suggest what a person’s occupation is, but the data is anonymised. The system generates a user ID per smartphone, made up of a billion factors and which only an algorithm can identify.

Moderation and government relations

In a separate briefing, Bytedance senior vice-president for corporate development Liu Zhen revealed that of the 20 million pieces of content uploaded to Toutiao each day, 90% are machine moderated. Meaning the other 2 million pieces are human-reviewed. Although Toutiao has been working on its moderation for five years, humans are and always will be needed, according to Ma.

“We have a very good communication channel between the company and the government. So far we’ve been working very hard because we are a new platform, a new kind of application exploring a new frontier. Things have been going quite smoothly because the communication channel is very open and very healthy,” said Ma.

—

The article Toutiao is making fake news to train its anti-fake news AI first appeared on Technode.

The post Toutiao is making fake news to train its anti-fake news AI appeared first on e27.

People
“Call Her Daddy'”s Alex Cooper Models Her Wedding Night Lingerie in Instagram Reveal: See the Racy Look
Cooper wore a sexy lacy bodysuit from SKIMS' Wedding Shop collection after marrying Matt Kaplan in Mexico
15 hours ago
Yahoo News Singapore
Fatal accident in Tampines: 42-year-old driver involved in crash charged with four offences, including dangerous driving causing death
After a fatal collision in Tampines, Muhammad Syafie Ismail, 42, faces four charges including dangerous driving causing death.
10 hours ago
Evening Standard
Liverpool: Darwin Nunez tipped to be sold by Arne Slot after 'unforgivable error'
The striker was again guilty of a huge miss as Everton all but ended Liverpool’s title hopes
7 hours ago
Cosmo
Sabrina Carpenter looks practically naked in completely see-through lace mini dress
Sabrina Carpenter went braless wearing the Mirror Palais Anemone Dress in butter featuring illusion tulle adorned with lace appliqués along the neckline and hem
2 hours ago
INSIDER
Malaysia might add a casino to boost troubled $100 billion mega-development Forest City
The casino, which would only be the second in Malaysia, could revive the struggling property.
5 hours ago
The Telegraph
Horse trainer accused of rape and murder found dead at home
A horse trainer who was accused of the murder and rape of a showjumper he was in an “illicit relationship” with has been found dead before the second day of his trial was set to begin.
16 hours ago
The Telegraph
Satellite images show Iran tried to cover up impact of Israeli missile strike
Iran replaced a destroyed radar installation within hours of an Israeli strike on an air base last week in an attempt to make it appear as though the damage had been minimal, it has been claimed.
18 hours ago
People
Kim Kardashian Reveals the Viral SKIMS Nipple Bra Was Modeled After Her Own Breasts
The bra was first released in October 2023
2 days ago
The Telegraph
Hezbollah launches deepest ever attack inside Israel
Hezbollah has launched a series of drone strikes against Israeli military bases, in its deepest attack inside Israel since the start of the war in Gaza.
2 days ago
BANG Showbiz
Megan Thee Stallion being sued for ‘forcing cameraman watch her having lesbian sex!’
In a suit being brought by her ex-cameraman, Megan Thee Stallion is being sued for allegedly creating a hostile work environment and forcing her former videographer to watch her having lesbian sex.
2 days ago
The Telegraph
Ukraine has only six months left
Last summer there were high expectations that Ukraine’s major counter offensive would succeed in driving Russian forces back, setting the stage for victory. That didn’t happen; instead the offensive faltered and gained little ground. This failure can be laid squarely at the feet of Western refusal to supply adequate military aid. The result was a silent backlash in domestic politics both sides of the Atlantic, which undoubtedly contributed to the US president’s failure to get a further aid packa
2 days ago
The Telegraph
The five horrible defensive mistakes that derailed Liverpool’s title bid
When you are fighting to stay in the title race and need a win at the home of your city rivals, you cannot get away with the calamitous defending that gifted Everton the lead against Liverpool.
17 hours ago
People
Kourtney Kardashian's Sexy Bikini Photo from Her 45th Birthday Leaves Husband Travis Barker Melting
Kardashian enjoyed a vacation in paradise with her husband and four kids in honor of "45 trips around the sun"
2 days ago
People
Christy Turlington Says Her Son's Rival Basketball Team Heckled Him by Passing Around Her Nude Photos
The supermodel is mom to son Finn, 18, and daughter Grace, 20, whom she shares with husband Edward Burns
19 hours ago
People
TikToker Xandra Pohl Sizzles in Racy Bikinis in “Sports Illustrated Swimsuit” Rookie Shoot: ‘Been My Dream’
Pohl joins Brittany Mahomes and seven Swim Search finalists in the 2024 Rookie class in the magazine’s 60th-anniversary issue, out May
19 hours ago
SETHLUI.COM
Sweet Garden Dining Cafe: No GST & service charge at ex-5-star hotel chef’s Western cafe
The post Sweet Garden Dining Cafe: No GST & service charge at ex-5-star hotel chef’s Western cafe appeared first on SETHLUI.com.
8 hours ago
The Telegraph
Seventy Israeli hostages have been killed, says captive
Around half of the remaining Israeli hostages abducted by Hamas have been killed in Gaza, an Israeli-American captive said in a rare proof-of-life video.
19 hours ago
Evening Standard
Manchester United 4-2 Sheffield United: Bruno Fernandes magic offers fresh Europa League hope
Red Devils three points clear of Newcastle in battle for top six
17 hours ago
Reuters
TikTok CEO expects to defeat US restrictions: 'We aren't going anywhere'
WASHINGTON (Reuters) -TikTok's chief executive said on Wednesday the social media company expects to win a legal challenge to block legislation signed into law by President Joe Biden that he said would ban its popular short video app used by 170 million Americans. "Rest assured - we aren't going anywhere," CEO Shou Zi Chew said in a video posted moments after Biden signed the bill that gives China-based ByteDance 270 days to divest TikTok's U.S. assets or face a ban. Biden's signing sets a Jan. 19 deadline for a sale - one day before his term is set to expire - but he could extend the deadline by three months if he determines ByteDance is making progress.
23 hours ago
Reuters
UPDATE 2-Malaysia ex-PM Mahathir facing anti-graft probe in a case involving his sons
Former Malaysian Prime Minister Mahathir Mohamad is among individuals being investigated in connection with a graft probe involving his sons, the head of Malaysia's Anti-Corruption Commission (MACC) said on Thursday. The investigation comes amid a widening crackdown on graft involving prominent political figures, including those seen as close to 98-year-old veteran leader Mahathir, a long-time foe of current Prime Minister Anwar Ibrahim.
5 hours ago