Lester Phillip Violeta

Lester Phillip Violeta

Ph.D. Candidate
Nagoya University, Japan

lpgvioleta [at] gmail [dot] com
Google Scholar
@lesterphv
github.com/lesterphillip
linkedin.com/in/lestervioleta

I am a final year (!) Ph.D. student at Nagoya University, Japan at Toda Laboratory under the supervision of Professor Tomoki Toda through a full scholarship from the Monbukagakusho Japanese Government. My research mainly focuses on speech synthesis, particularly with electrolaryngeal speech data and singing voice data.

My work has been published in top speech and audio conferences/journals such as Interspeech, ICASSP, ASRU, SLT, and TASLP. I have also been involved in several academic activities, where I was co-organizer of the recent Singing Voice Conversion Challenge in 2023 (and also co-organizing 2025!). I am also part of the peer-review committee for several academic conferences such as ASRU, SLT, ICASSP, Interspeech, IJCNN, and journals like IEEE JSTSP.

Aside from research, I also have various experiences in the engineering side, creating custom models and deploying these as products for companies. I am currently a part-time speech researcher at CoeFont in the voice conversion team. Previously, I was also a founding AI engineer at VoiceSwap.AI and have also worked on research internships at Sony Computer Science Laboratories Tokyo, NTT Media Intelligence Laboratories, and Hitachi Ltd. Thus, I have extensive experience in both the academic research and engineering sides of AI.

I have a deep international background now studying in Japan, and having done my B.S. in the Philippines and done a research exchange in France. Outside of programming, I like bouldering (check out this page) and learning Japanese.

Education

2021—Present

Nagoya University, Japan

Ph.D. in Computer Science

Advisor: Prof. Tomoki Toda

Thesis: Speech Recognition, Voice Conversion

2015—2020

Ateneo de Manila University, Philippines

B.S. Electronics Engineering

Thesis: Renewable Energy, Microgrid Optimization

2019

Institut Catholique d'Arts et Metiers Paris, France

Research Exchange Semester

Thesis: Renewable Energy, Microgrid Optimization

Publications

Serenade: A Singing Style Conversion Framework Based on Audio Infilling

Preprint 2025

Serenade: A Singing Style Conversion Framework Based on Audio Infilling

Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda

Electrolaryngeal Speech Intelligibility Enhancement through Robust Linguistic Encoders

ICASSP 2024

Electrolaryngeal Speech Intelligibility Enhancement through Robust Linguistic Encoders

Lester Phillip Violeta, Wen-Chin Huang, Ding Ma, Ryuichi Yamamoto, Kazuhiro Kobayashi, Tomoki Toda

Pretraining and Adaptation Techniques for Electrolaryngeal Speech Recognition

IEEE/ACM TASLP 2024

Pretraining and Adaptation Techniques for Electrolaryngeal Speech Recognition

Lester Phillip Violeta, Ding Ma, Wen-Chin Huang, Tomoki Toda

A Preliminary Investigation on Flexible Singing Voice Synthesis Through Decomposed Framework with Inferrable Features

Technical Report 2024

A Preliminary Investigation on Flexible Singing Voice Synthesis Through Decomposed Framework with Inferrable Features

Lester Phillip Violeta, Taketo Akama

The Singing Voice Conversion Challenge 2023

ASRU 2023

The Singing Voice Conversion Challenge 2023

Wen-Chin Huang, Lester Phillip Violeta, Songxiang Liu, Jiatong Shi, Tomoki Toda

An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-hearing

APSIPA 2023

An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-hearing

Lester Phillip Violeta, Tomoki Toda

Intermediate Fine-tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

ICASSP 2023

Intermediate Fine-tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

Lester Phillip Violeta, Ding Ma, Wen-Chin Huang, Tomoki Toda

Investigating Self-Supervised Pretraining Frameworks for Pathological Speech Recognition

Interspeech 2022

Investigating Self-Supervised Pretraining Frameworks for Pathological Speech Recognition

Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda

Experience

Nov. 2024 to Present

Researcher — CoeFont

Part-time, voice conversion team

Feb. 2024 to Nov. 2024

Founding AI Engineer — Voice-Swap.AI

Custom singing voice conversion models and speech synthesis for B2B customers

Oct. 2023 to Mar. 2024

Research Assistant — Sony CSL Tokyo

Manager: Dr. Taketo Akama

Research on singing voice synthesis systems

Mar. 2022

Research Intern — NTT Media Intelligence Laboratories

Manager: Dr. Atsushi Ando

Research on speech diarization systems

Jan. 2022 to Feb. 2022

Research Intern — Hitachi Ltd.

Manager: Dr. Takashi Sumiyoshi

Research on low-resourced speech recognition