Speech recognition system can transcribe Singaporean lingo in real time

Wong Casandra

·Senior Reporter

9 July 2018 at 9:14 am

A live demonstration of the code-switch speech recognition system developed by AI Singapore on 9 July, 2018. (PHOTO: Wong Casandra/Yahoo News Singapore)

“Hello, 是 police 吗? 啊, 我要报警…uh then they anyhow throw 那些 rubbish…can you all like hurry up come?”

This phone call could have been made by a Singaporean resident to the police. But it was actually a simulated emergency call, transcribed by a homegrown speech recognition system that is able to interpret Singaporean lingo on the fly.

Demonstrated live to the media on Monday (9 July), the system is being developed by the Republic’s national programme in artificial intelligence, AI Singapore (AISG), in its AI Speech Lab.

The unique speech engine, named Southeast Asian Mandarin-English (SEAME), is touted by its creators to be adept at recognising “code-switching” – the practice of alternating between two or more languages in conversation.

The system now has a combined vocabulary of about 80,000 words, mostly in English and Mandarin. It can adapt to local accents of spoken English.

“Our technological breakthrough is the outcome of…research efforts in Singapore (that started a decade ago),” said National University of Singapore (NUS) Professor Li Haizhou, who co-leads the speech lab. “This technology performs better than commercial engines as it can accurately recognise conversations comprising words from different languages and solves a unique Singaporean problem.”

The system is currently “90 per cent accurate in a quiet environment”, according to Prof Li, Department of Electrical and Computer Engineering and Department of Mechanical Engineering at the NUS.

For now, its Hokkien, Malay, Tamil and Singlish lexicon is limited to basic phrases like “jiak ba bueh”, “hoh boh” (“Have you eaten?” and “How are you?” in Hokkien, respectively), “lah”, “loh”, as well as food and street names such as char kway teow, nasi lemak and Jalan Besar.

When more data is collected, Prof Li’s team may expand the system’s code-switching capabilities to include English and Tamil or English and Malay.

The team is working on a “stable” version for commercial use to be ready in 12 to 18 months. Their first partner? The Singapore Civil Defence Force (SCDF).

“Our first step is to make the “code-switching” work (better),” said Prof Li. “Our next step is to work with partners to include vocabulary they would like to incorporate. For example, if it’s the SCDF, things will be concerning the deployment of medical services, or fire-related (incidents).”

The SCDF’s 995 Operations Centre receives close to 200,000 calls for assistance every year, according to Assistant Commissioner Daniel Seet, SCDF Director of Operations, and the new system, “if successful, will help (operators) reduce the time needed to log in information” relating to emergencies.

The team will use the upcoming months to harvest data from real calls made to the SCDF centre, said the lab’s co-lead Assistant Professor Chng Eng Siong, School of Computer Science & Engineering at the Nanyang Technological University (NTU).

“We will visit the site and collect the calls (excluding personal data)…as many as possible, hundreds of hours of SCDF calls. We will send people down (to the site) to transcribe and run our engine through their system to study our problems and errors,” said Asst. Prof Chng.

The system currently has about up to 2,000 hours of audio data, with about half from Singapore sources, including conversations on the ground, as well as audio clips on radio station 938NOW and from Singaporean YouTubers. In comparison, search giant Google has about 10,000 hours to train its AI system, said Asst. Prof Chng.

Over the last three to four years, the team has collected conversational data in Singapore and Penang, cities where code-switching between English and Mandarin is more common in the region.

Code-switching poses a huge challenge because the system has to deal with verbal input as if it is a single language, said Asst. Prof Chng. Different pronunciations of the same word posed further challenges, he added.

When asked whether the system would be able to handle translation in future, Prof Li said, “If translation (capabilities) are needed, that can be done in the lab as well. We have the technology to do this.”

In due course, the system could be deployed at various government agencies and companies to assist frontline officers while they focus on customer service, as well as in areas such as voice data mining and live subtitling, said Prof Li.

“For public services, we cannot pump our conversations to Google. We don’t want our confidential conversations to go into a company’s database so we need to have our own Singapore-based speech recognition engine to serve our citizens,” he added.

AISG is investing $1.25 million to set up the speech lab, with four government agencies on board to match the investment, bringing total funding to $2.5 million over the next three years.

Located within the National University of Singapore’s Kent Ridge campus, the currently operational lab occupies a floor area of 125 sqm, and will eventually have five AI engineers.

The lab marks AISG’s first major collaboration with multiple government agencies to design an AI system that could be deployed government-wide or nation-wide.

Other Singapore stories:

oBike responsible for refunding user deposits: Janil Puthucheary

Parliament: Over $250M spent by Singapore on High-Speed Rail project; Malaysia’s stance still not clarified

Malaysia ‘lost its right’ to revise water agreement in 1987: Vivian Balakrishnan

Cosmo
Zendaya's dress plunges all the way past her belly button to... a tennis ball?
For the LA premiere of 'Challengers' Zendaya wore a custom Celia Kritharioti neon green dress with a plunging neckline and featuring a tennis ball at her waist.
5 hours ago
INSIDER
Jennifer Pan's father survived the murder plot she orchestrated. Here's where he is today.
Jennifer Pan is currently serving a life sentence for the attempted murder of her father, Huei Hann Pan. He requested no communication from her.
18 hours ago
INSIDER
China picks its lowest-scoring officers to command nuclear submarines
Submarines are one of China's foremost weapons, but the officers who lead them are chosen from candidates with some of the lowest exam scores.
18 hours ago
Bloomberg
Singapore orders all employers to consider employees’ flexi-time requests
Workers in Singapore can now ask for four-day work weeks, more work-from-home days and staggered work timings starting from 1 December.
a day ago
INSIDER
Taking too much of these 4 popular supplements can be dangerous, a toxicologist warns
A toxicologist shared some of the risks of taking too much of popular supplements, including vitamin D and magnesium.
a day ago
INSIDER
US Navy warships shot down Iranian missiles with a weapon they've never used in combat before
The secretary of the Navy said American destroyers fired SM-3s to intercept Iranian ballistic missiles during an unprecedented attack on Israel.
22 hours ago
The Telegraph
‘I shot down drones over Israel and was back in my office sending emails by 4pm’
An Israeli reservist fighter pilot said it was like “Top Gun meets Star Wars” as he described how he shot down Iranian missiles and was back at work in his office before the end of the day.
18 hours ago
HuffPost
Dismissed Juror Has 1 Word To Describe What It’s Like Seeing Trump In Person
Kara McGee was seated about 30 feet from the former president all day Monday.
20 hours ago
The Independent
Horrifying moment phone scam victim, 81, pulls gun on innocent Uber driver before killing her
Both the driver and shooter fell victim to the same scam, according to reports
15 hours ago
INSIDER
28 photos show what Iran looked like before the 1979 revolution turned the nation into an Islamic republic
From 1941 to 1979, Iran was ruled by King Mohammad Reza Pahlavi, the Shah. On February 11, 1979, the Islamic Revolution swept the country.
23 hours ago
People
Married Teacher Allegedly Caught Undressed in Back of Car with Teen Student: Police
Erin Ward, 45, was allegedly found inside a car with a 17-year-old student, according to the Douglas County Sheriff’s Office
2 days ago
Fortune
TikTok has repeatedly said that it’s no longer linked to China. A new Fortune investigation tells a more complicated story.
According to more than ten former employees, TikTok retained data-sharing ties to its Chinese parent, ByteDance—despite the company’s assertions to the contrary.
a day ago
Yahoo News Singapore
Tourist arrested for allegedly stealing luxury goods from Changi Airport shops before her flight out of Singapore two months ago
Woman tourist, 38, arrested for allegedly shoplifting over $1,400 worth of goods on two occasions from Changi Airport. Read more.
a day ago
Cinema Online
Chang Hsiao-yen heartbroken over protege Mickey Huang's latest scandal
The veteran TV personality is speechless after Mickey was revealed to possess child pornography
a day ago
BBC
'Bollard Man': Hero who confronted stabber promised Australia visa
The prime minister has told "Bollard Man" Damien Guerot he can stay in the country as long as he likes.
a day ago
SETHLUI.COM
Xiao Di Charcoal Roasted Delights: Succulent char-kissed roast meats you can’t get enough of
The post Xiao Di Charcoal Roasted Delights: Succulent char-kissed roast meats you can’t get enough of appeared first on SETHLUI.com.
9 hours ago
HuffPost
Donald Trump Flips Out Over Barron’s Graduation Ban. Here's What The Judge Really Said.
The former president's disingenuous spin on his hush money trial whipped up anger on the right, including from his other sons Don Jr. and Eric.
a day ago
Associated Press
Barcelona's Champions League exit sends Atletico Madrid to 2025 Club World Cup in the United States
Barcelona’s exit from the Champions League on Tuesday sent Atletico Madrid to the inaugural 32-team Club World Cup in the United States next year. Europe will have 12 teams at the lucrative monthlong tournament — title winners of the Champions League in the four seasons through this one plus high-ranked teams based on overall results in the competition in this period. Barcelona began play Tuesday trailing Atletico in the rankings to take the second Club World Cup entry from Spain and lost 4-1 to Paris Saint-Germain in the home leg of their quarterfinals.
17 hours ago
Yahoo News Singapore
58-year-old bus driver dies after double-decker bus crashes into tree along Woodlands Avenue 2
A bus driver has died after the vehicle he was driving crashed into a tree in Woodlands on Tuesday (16 April). Read more.
a day ago
The Smart Investor
4 Singapore Semiconductor Stocks That Could Explode When the Industry Recovers
The semiconductor industry is still in the doldrums but these four stocks could perform very well once the recovery takes place. The post 4 Singapore Semiconductor Stocks That Could Explode When the Industry Recovers appeared first on The Smart Investor.
a day ago

Latest stories