Skip to content
View ShawnPi233's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing
  • BUPT
  • Beijing

Highlights

  • Pro

Block or report ShawnPi233

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
ShawnPi233/README.md

Bingsong Bai

I am a Foundation Model Algorithm Engineer at ModelBest (้ขๅฃๆ™บ่ƒฝ๏ผŒVoxCPM). I received my Master's degree from the School of Artificial Intelligence, Beijing University of Posts and Telecommunications (BUPT). My research interests lie in the intersection of Large Speech Models (LSM), Automatic Speech Recognition (ASR), Singing Voice Conversion (SVC), and Expressive Text-to-Speech (TTS).

Prior to joining ModelBest, I earned my Bachelor's degree in Computer Science and Technology from Ningbo University (Yangming Innovation Class). I have gained extensive industry experience through research and engineering internships at Zhipu AI (ๆ™บ่ฐฑAI่ฏญ้Ÿณ่พ“ๅ…ฅๆณ•), Tencent Music Entertainment (TME, Lyra Lab / ๅคฉ็ดๅฎž้ชŒๅฎค), and Momo (้™Œ้™Œ).

I have been awarded the Zhejiang Government Scholarship (3 times) and the BUPT First-Class Scholarship (2 times). My research has been accepted for top-tier conferences such as AAAI, Interspeech, ICASSP, and ISCSLP.

๐Ÿ”ฅ News

  • 2026.03: ๐Ÿš€ Joined ModelBest as a Large Speech Foundation Model Researcher.
  • 2026.01: ๐ŸŽ‰ One paper (SynParaSpeech) accepted by ICASSP 2026 as the first author!
  • 2025.12: ๐ŸŽ‰ One paper (HQ-SVC) accepted by AAAI 2026 as the first author!
  • 2025.10: ๐Ÿš€ Joined Zhipu AI as a Speech Large Model Research Intern.
  • 2025.07: ๐ŸŽธ Joined Tencent Music (QQ Music) focusing on multi-speaker conversational podcast TTS.
  • 2025.03: ๐Ÿ‘ซ Joined Momo focusing on paralinguistic TTS and understanding.
  • 2024.06: ๐ŸŽ‰ One paper (SPA-SVC) accepted by Interspeech 2024 as the first author.

๐Ÿ“‘ Selected Research Papers

๐Ÿ“Ž For a full list of publications, please visit my Google Scholar.

HQ-SVC: High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios, Bingsong Bai, et al., AAAI 2026. [CCF-A]

SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding, Bingsong Bai, et al., ICASSP 2026. [CCF-B]

SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion, Bingsong Bai, et al., Interspeech 2024. [CCF-B]

ExpressiveSinger: Synthesizing Expressive Singing Voice as an Instrument, Fengping Wang, Bingsong Bai, et al., ISCSLP 2024.

๐Ÿ—ฃ Large Speech Models & TTS

๐Ÿ† Awards & Honors

  • 2023, 2024: BUPT First-Class Academic Scholarship
  • 2020, 2021, 2022: Zhejiang Provincial Government Scholarship (3 consecutive years)
  • 2021: Mathematical Contest in Modeling (MCM) - International Second Prize
  • 2020: Contemporary Undergraduate Mathematical Contest in Modeling (CUMCM) - Provincial Second Prize

Pinned Loading

  1. HQ-SVC HQ-SVC Public

    Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)

    Python 105 6

  2. SynParaSpeech SynParaSpeech Public

    Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (ICASSP 2026)

    JavaScript 71 4