Shang-Yi Chuang is a Machine Learning Researcher about Automatic Speech Recognition at the Siri team of Apple. She received her master’s degree in Computer Science from Cornell Tech. She has experience in international collaborations in the United States, Japan, and Taiwan.
Prior to Cornell Tech, she was working on artificial intelligence at Academia Sinica, Taiwan. Her research interests include natural language processing, speech processing, and computer vision. She collaborated with Prof. Yu Tsao and Prof. Hsin-Min Wang on audio-visual multimodal learning and data compression on speech enhancement. She was engaged in a project of a cross-lingual question answering system with Prof. Keh-Yih Su as well.
She was also conducting research about humanoid robots with Prof. Tomomichi Sugihara at Motor Intelligence, Osaka University. Her work was to improve the control system of a robot arm in order to create a safer and more comfortable human-robot coworking space.
M.Eng., Major in Computer Science, 2021 - 2022
Cornell Tech
B.S., Major in Mechanical Engineering, Minor in Electrical Engineering, 2012 - 2017
National Taiwan University
FrontierLab@OsakaU Program, 2016 - 2017
Osaka University
DTS is an API-based project that assists stock trading by providing registered users with stock prices and statistical analytics.
Going Everywhere is an NFT-based art project supported by the \Art Microgrant Award. A critical characteristic of blockchain is inerasability, and it can be leveraged to prove that something exists. The artist collected a set of photos of an identifier (Minccino) as a demonstration of where she existed across the physical world and launched the collection as an NFT. After putting the project on chain, the existence became eternal in the digital world.
iLAVSE is a deep-learning-based audio-visual project that addresses three practical issues often encountered in implementing AVSE systems, including the requirement for additional visual data, audio-visual asynchronization, and low-quality visual data.
TMSV is an audio-visual dataset based on the script of TMHINT (Taiwan Mandarin hearing in noise test).
LAVSE is a deep-learning-based audio-visual project that addresses additional processing costs and privacy problems.
Realizing human behavior on a robot arm based on a nonlinear reference shaping controller in order to create robots which can share a safe working environment with humans.
Practice of PCB designing and welding.
The ME Robot Cup is a traditional annual event at the Department of Mechanical Engineering, NTU.
Native
Advanced
Advanced