Song-Ze (Jimmy) Yu 游松澤

A musician & music/audio ML researcher, Actively seeking Fall 2027 CS PhD.

I’m a Computer Science student at UC Berkeley and NTHU. My work focuses on music understanding for audio-language models, and controllability for music generation and audio effect design.

I work with BAIR under Prof. Trevor Darrell and David M. Chan, CNMAT under Prof. Carmine-Emanuele Cella, and the NTU Music and AI Lab under Prof. Yi-Hsuan (Eric) Yang.

My long-term goal is to build ALMs that can perceive and reason about music, while giving musicians more direct and expressive ways to interact with generative systems.

Email Google Scholar GitHub LinkedIn Academic CV

Song-Ze Yu performing at a grand piano on stage

Before research

I performed 24 solo recitals and appeared on television more than 70 times across China and Taiwan. I still perform, compose, and produce; that perspective shapes the research questions and tools I choose to build.

Music CV YouTube (7K subscribers) Instagram

Song-Ze Yu playing piano for Jay Chou — Showing my composition to Jay Chou.

Selected Work

Look more

2026 · Preprint Music understanding BAIR

Click to enlarge

PitchBench: Measuring Pitch Hearing in Audio-Language Models

Milan Liessens Dujardin^*, Song-Ze Yu^*, Craver Corbyn Thomas-Smith, David M. Chan, Karina Nguyen ^* Equal contribution

Evaluating pitch hearing in audio-language models through controlled experiments across sequences, chords, instruments, and acoustic conditions.

Paper Dataset Code

2026 · DAFx Controllability CNMAT

Click to enlarge

InstructFX2FX: A Multi-Turn Text-to-Effect System for Sequential Audio Effect Refinement

Song-Ze Yu, Milan Liessens Dujardin, Yuxuan Cai, Wantong Zhang, Brian Cruz, Jeremy Wagner, Carmine-Emanuele Cella

Introduces sequential FX refinement: given the current effect state and a new instruction, update the sound while preserving what earlier instructions have already achieved. InstructFX2FX combines LLM planning with CLAP-guided optimization for iterative, multi-turn audio effect design.

Paper Project Code