A study of irrelevant variability normalization based training and unsupervised online adaptation for LVCSR.

Yu Shi , Qiang Huo , Guangchuan Shi
conference of the international speech communication association 1357 -1360

11
2010
MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension

Tianrui Li , Yu Shi , Huaishao Luo , Linjun Shou
arXiv: Computation and Language

3
2020
Mixed-Lingual Pre-training for Cross-lingual Summarization

Chenguang Zhu , Xuedong Huang , Yu Shi , Ruochen Xu
arXiv: Computation and Language

16
2020
Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model

Liyang Lu , Sefik Eskimez , Yu Shi , Linjun Shou
arXiv: Computation and Language

6
2021
Speech-language Pre-training for End-to-end Spoken Language Understanding

Yao Qian , Naoyuki Kanda , Yu Shi , Michael Zeng
arXiv: Computation and Language

30
2021
Listen, Look and Deliberate: Visual Context-Aware Speech Recognition Using Pre-Trained Text-Video Representations

Jinyu Li , Yashesh Gaur , Yu Shi , Shahram Ghorbani
spoken language technology workshop 621 -628

2021
A Segmentation Posterior Based Endpointing Algorithm

YanLu Xie , Yu Shi , Frank K. Soong , BeiQian Dai
international conference on acoustics, speech, and signal processing 4 813 -816

2007
Symbol graph based discriminative training and rescoring for improved math symbol recognition

Zhen Xuan Luo , Yu Shi , Frank K. Soong
international conference on acoustics, speech, and signal processing 1953 -1956

11
2008
A Study of Discriminative Training for HMM-Based Online Handwritten Chinese/Japanese Character Recognition

Yongqiang Wang , Qiang Huo , Yu Shi
international conference on frontiers in handwriting recognition 518 -523

1
2010
A symbol graph based handwritten math expression recognition

Yu Shi , Frank K. Soong
international conference on pattern recognition 1 -4

3
2008
Robust voice activity detection based on noise eigenspace

Dongwen Ying , Yu Shi , Xugang Lu , Jianwu Dang
Acoustical Science and Technology 28 ( 6) 413 -423

7
2007
Florence: A new foundation model for computer vision

Lu Yuan , Dongdong Chen , Yi-Ling Chen , Noel Codella
arXiv preprint arXiv:2111.11432

188
2021
Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition

Kenichi Kumatani , Robert Gmyr , Felipe Cruz Salinas , Linquan Liu
arXiv preprint arXiv:2112.05820

6
2021
Knowledge distillation for mixture of experts models in speech recognition

Felipe Cruz Salinas , Kenichi Kumatani , Robert Gmyr , Linquan Liu
Smpte Journal

2022
Florence: A new foundation model for computer vision, 2021

Lu Yuan , Dongdong Chen , Yi-Ling Chen , Noel Codella
URL: https://arxiv. org/abs/2111.11432 2

6
Optimizing alignment of speech and language latent spaces for end-to-end speech recognition and understanding

Wei Wang , Shuo Ren , Yao Qian , Shujie Liu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 7802 -7806

12
2022
i-code: An integrative and composable multimodal learning framework

Ziyi Yang , Yuwei Fang , Chenguang Zhu , Reid Pryzant
Proceedings of the AAAI Conference on Artificial Intelligence 37 ( 9) 10880 -10890

31
2023
i-code v2: An autoregressive generation framework over vision, language, and speech data

Ziyi Yang , Mahmoud Khademi , Yichong Xu , Reid Pryzant
arXiv preprint arXiv:2305.12311

2
2023
Improving readability for automatic speech recognition transcription

Junwei Liao , Sefik Eskimez , Liyang Lu , Yu Shi
ACM Transactions on Asian and Low-Resource Language Information Processing 22 ( 5) 1 -23

54
2023
Improving zero-shot neural machine translation on language-specific encoders-decoders

Junwei Liao , Yu Shi , Ming Gong , Linjun Shou
2021 International Joint Conference on Neural Networks (IJCNN) 1 -8

9
2021