I am currently a Research Scientist at DubGuild, where I work on improving generative spoken language models.

Prior to this, I received my Ph.D. in Computer Science at Nagoya University, Japan at Toda Laboratory under the supervision of Professor Tomoki Toda, where my research mainly focused on speech synthesis, voice conversion, and speech recognition. My Ph.D. work has been published in top speech and audio conferences/journals such as Interspeech, ICASSP, ASRU, SLT, and TASLP. I am also currently part of the peer-review committee for several academic conferences such as ASRU, SLT, ICASSP, Interspeech, IJCNN, and journals like IEEE JSTSP. I was also the main organizer of the Singing Voice Conversion Challenge in 2023 and 2025.

I have a deep international background now based in Japan, and having done my B.S. in the Philippines and done a research exchange in France. Outside of research, I like bouldering (check out this page) and learning Japanese.

Education

2023—2026

Nagoya University, Japan

Ph.D. Computer Science

Advisor: Prof. Tomoki Toda

Thesis: Speech Synthesis, Voice Conversion

2021—2023

Nagoya University, Japan

M.S. Computer Science

Advisor: Prof. Tomoki Toda

Thesis: Speech Recognition

2015—2020

Ateneo de Manila University, Philippines

B.S. Electronics Engineering

Thesis: Renewable Energy, Microgrid Optimization

2019

Institut Catholique d'Arts et Metiers Paris, France

Research Exchange Semester

Thesis: Renewable Energy, Microgrid Optimization

Publications

日本語音声基盤モデルをスケーリングさせ、TTS性能を見てみる (in Japanese)

Technical Report 2026

日本語音声基盤モデルをスケーリングさせ、TTS性能を見てみる (in Japanese)

長谷川 直哉, 相田 優希, 廣岡 聖司, 林 春太朗, Lester Phillip Violeta, 大嶽 匡俊

The Singing Voice Conversion Challenge 2025: From Singer Identity Conversion To Singing Style Conversion

ICASSP 2026

The Singing Voice Conversion Challenge 2025: From Singer Identity Conversion To Singing Style Conversion

Lester Phillip Violeta, Xueyao Zhang, Jiatong Shi, Yusuke Yasuda, Wen-Chin Huang, Zhizheng Wu, Tomoki Toda

Serenade: A Singing Style Conversion Framework Based on Audio Infilling

EUSIPCO 2025

Serenade: A Singing Style Conversion Framework Based on Audio Infilling

Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda

Electrolaryngeal Speech Intelligibility Enhancement through Robust Linguistic Encoders

ICASSP 2024

Electrolaryngeal Speech Intelligibility Enhancement through Robust Linguistic Encoders

Lester Phillip Violeta, Wen-Chin Huang, Ding Ma, Ryuichi Yamamoto, Kazuhiro Kobayashi, Tomoki Toda

Pretraining and Adaptation Techniques for Electrolaryngeal Speech Recognition

IEEE/ACM TASLP 2024

Pretraining and Adaptation Techniques for Electrolaryngeal Speech Recognition

Lester Phillip Violeta, Ding Ma, Wen-Chin Huang, Tomoki Toda

A Preliminary Investigation on Flexible Singing Voice Synthesis Through Decomposed Framework with Inferrable Features

Technical Report 2024

A Preliminary Investigation on Flexible Singing Voice Synthesis Through Decomposed Framework with Inferrable Features

Lester Phillip Violeta, Taketo Akama

The Singing Voice Conversion Challenge 2023

ASRU 2023

The Singing Voice Conversion Challenge 2023

Wen-Chin Huang, Lester Phillip Violeta, Songxiang Liu, Jiatong Shi, Tomoki Toda

An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-hearing

APSIPA 2023

An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-hearing

Lester Phillip Violeta, Tomoki Toda

Intermediate Fine-tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

ICASSP 2023

Intermediate Fine-tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

Lester Phillip Violeta, Ding Ma, Wen-Chin Huang, Tomoki Toda

Investigating Self-Supervised Pretraining Frameworks for Pathological Speech Recognition

Interspeech 2022

Investigating Self-Supervised Pretraining Frameworks for Pathological Speech Recognition

Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda