招待講演

2026

山岡洸瑛, “分散マイクロフォンアレイのための時間差推定と信号の同期化,” in 応用音響研究会, May 2026.

2025

齋藤佑樹, “深層学習による音声の合成と評価,” in 第20回言語処理若手シンポジウム (YANS2025), Sep. 2025.

2024

岡本悠希, “環境音合成の研究動向,” in 電気情報通信学会信号処理研究会, Aug. 2024.
山岡洸瑛, “非同期分散マイクロフォンアレイの研究動向,” in 電気情報通信学会信号処理研究会, Aug. 2024.

2023

高道慎之介, “音声の基盤モデルが切り拓く未来,” in 第24回慶應科学技術展, Dec. 2023.
K. Seki, “Data Selection for Text-to-speech with Feedback from Automatic Evaluation of Naturalness on Synthetic Speech,” in Joint Workshop of VoicePersonae and ASVspoof 2023 (VoiceMOS mini workshop), Nov. 2023.
高道慎之介, “AI音声合成の技術動向,” in CEDEC 2023, Aug. 2023.
高道慎之介, “ここまで来た＆これから来る歌声合成・歌声変換技術,” in 音楽情報科学研究会夏のシンポジウム, Aug. 2023.
高道慎之介, “機械学習を用いた波形モデリング〜人間の音声の場合〜,” in 日本地球惑星科学連合2023年大会, May 2023.
高道慎之介, “音声合成技術の未来,” in G7群馬高崎デジタル・技術大臣会合開催記念事業「ぐんま Digital Land」, Mar. 2023.
高道慎之介, “音声合成の技術動向,” in 日本音響学会2023年春季研究発表会, Mar. 2023.
高道慎之介, “音声合成は次にどこに向かうのか,” in 言語処理学会第29回年次大会 (NLP2023), Mar. 2023.
高道慎之介, “音声処理を支えるデータベース,” in 明治大学先端メディアコロキウム, Jan. 2023.

2022

Y. Saito, “Towards human-in-the-loop DNN-based speech synthesis technologies,” in Seminar by IEEE NZ Signal Processing / Information Theory Joint Chapter and Acoustics Research Center, the University of Auckland, Dec. 2022.
T. Nakamura, “Signal-processing-inspired deep learning,” in Seminar by IEEE NZ Signal Processing / Information Theory Joint Chapter and Acoustics Research Center, the University of Auckland, Dec. 2022.
高道慎之介, “深層学習による音声合成の発展とその先,” in IBIS2022 企画セッション「マルチメディアと機械学習」, Nov. 2022.
高道慎之介, “知覚に伴う音声合成に向けた機械学習とコーパス,” in 日本音響学会第148回(2022年秋季)研究発表会, Sep. 2022.
中村友彦, “ウェーブレット変換と深層学習を融合した音源分離,” in 応用音響研究会（EA）, Aug. 2022.
Y. Saito, “Towards human-in-the-loop speech synthesis,” in Seminar by IEEE Systems, Man and Cybernetics Singapore Chapter, Chinese and Oriental Languages Information Processing Society Teochew Doctorate Society, Singapore, and Human Language Technology Lab., National University of Singapore, Aug. 2022.
高道慎之介, “開かれた音声情報処理のためのコーパス,” in 国立国語研究所言語資源ワークショップ, Aug. 2022.
D. Xin, “Maintaining data consistency in speech quality assessment and speech emotion recognition,” in Seminar by IEEE Systems, Man and Cybernetics Singapore Chapter, Chinese and Oriental Languages Information Processing Society Teochew Doctorate Society, Singapore, and Human Language Technology Lab., National University of Singapore, Aug. 2022.
高道慎之介, “Self-supervised neural voice remastering,” in 東京大学|Google パートナーシップスプリングシンポジウム, Apr. 2022.

2021

高道慎之介, “音声合成のコーパスをつくろう,” in Tokyo BISH Bash #5, Jun. 2021.
高道慎之介, “音声認識の基礎と音声合成,” in 日本ロボット学会第132回セミナー, Feb. 2021.
高道慎之介, “ここまで来た＆これから来る音声合成,” in 明治大学先端メディアコロキウム, Jan. 2021.
H. Saruwatari, “Multichannel audio source separation based on unsupervised and semi-supervised learning,” in Proceedings of Chinese Computer Federation, Jan. 2021.

2020

H. Saruwatari, “Multichannel audio source separation based on unsupervised and semi-supervised learning,” in Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Dec. 2020.

2019

高道慎之介, “人間を利用する音声言語処理の試み,” in 第242回自然言語処理研究会, Oct. 2019.
高道慎之介, “音声合成・変換の国際コンペティションへの参加を振り返って,” in FIT2019 企画セッション「コンペの覇者」, Sep. 2019.
猿渡洋, “教師無し学習に基づく自律的な音メディア信号処理とその応用,” in Software Defined Media シンポジウム2019, Jul. 2019.
高道慎之介, “統計的ボイチェン研究事情,” in GREE #VRSionUp!6, Jul. 2019.
Y. Takida, S. Koyama, N. Ueno, and H. Saruwatari, “Comparison of Interpolation Methods for Gridless Sound Field Decomposition Based on Reciprocity Gap Functional,” in Proceedings of International Congress on Sound and Vibration (ICSV), Montreal, Jul. 2019. (to appear)
高道慎之介, “音声コーパス設計と次世代の音声研究に向けた提言,” in 2019年6月音声研究会, Jun. 2019.
S. Takamichi, “Group-delay modelling based on deep neural network with sine-skewed generalized cardioid distribution,” in Proceedings of International Conference on Soft Computing & Machine Learning (SCML), Wuhan, China, Apr. 2019. (invited)

2018

高道慎之介 and 亀岡弘和, “音声分野における敵対的学習の可能性と展望,” in IBIS2018 企画セッション, Nov. 2018.
M. Une, Y. Saito, S. Takamichi, D. Kitamura, R. Miyazaki, and H. Saruwatari, “Generative approach using the noise generation models for DNN-based speech synthesis trained from noisy speech,” in Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Hawaii, Nov. 2018.
S. Koyama, “Sparsity-based sound field reconstruction,” in Tohoku Universal Acoustical Communication Month, Seminar on the spatial aspects of hearing and their applications, keynote lecture, Sendai, Oct. 2018.
高道慎之介, “分布あるいはモーメント間距離最小化に基づく統計的音声合成,” in 第18回ステアラボ人工知能セミナー招待講演, Oct. 2018.
S. Takamichi, “What can GAN and GMMN do for augmented speech communication?,” in GMI workshop, Hiroshima, Aug. 2018.
猿渡洋, “教師無し最適化に基づくブラインド音源分離とその応用,” in 東京大学先端研・富士電機第二回産学連携技術交流会, Feb. 2018.
高道慎之介, “騙し騙され音声合成,” in IPSJ-ONE, 情報処理学会第80回全国大会, 2018.

2017

S. Takamichi, “Modulation spectrum-based speech parameter trajectory smoothing for DNN-based speech synthesis using FFT spectra,” in Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Kuala Lumpur, Dec. 2017.
D. Kitamura, N. Ono, and H. Saruwatari, “Experimental analysis of optimal window length for independent low-rank matrix analysis,” in Proceedings of Proceedings of 25th European Signal Processing Conference, Greek island of Kos, Aug. 2017.
S. Koyama, N. Murata, and H. Saruwatari, “Effect of multipole dictionary in sparse sound field decomposition for super-resolution in recording and reproduction,” in Proceedings of International Congress on Sound and Vibration (ICSV), London, Jul. 2017.
高道慎之介, “最先端の統計的音声処理,” in 徳山高専テクノ・アカデミア技術セミナー, 2017.
高道慎之介, “深層学習を深く学習するための基礎,” in 日本音響学会2017年秋季研究発表会ビギナーズセミナー, 2017.
高道慎之介, “音声なりすまし検出技術とそれを騙す音声合成技術,” in セコムIS研究所トーク, 2017.
高道慎之介, “音声合成・変換あれこれ〜声のゆらぎの再現と音声翻訳応用〜,” in 第1回若手異分野交流研究会, 2017.
猿渡洋, “ブラインド音源分離　～時空間スモールデータの非ガウス・低ランクモデリングとその最適化の数理～,” in 人工知能学会合同研究会, 2017.
猿渡洋, “ブラインド音源分離再考 -時空間の非ガウス・スパース・低ランクモデリング-,” in 日本音響学会2017年春季研究発表会講演論文集, 2017, pp. 1–8-12.
亀岡弘和, 小野順貴, and 猿渡洋, “音響分野におけるブラインド適応信号処理の展開,” in 電子情報通信学会総合大会, 2017, no. AI-2-2.

2016

H. Nakajima, D. Kitamura, N. Takamune, S. Koyama, H. Saruwatari, Y. Takahashi, and K. Kondo, “Audio signal separation using supervised NMF with time-variant all-pole-model-based basis deformation,” in Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Jeju, Dec. 2016.
S. Takamichi, “Speech synthesis that deceives anti-spoofing verification,” in NII Talk, Dec. 2016.
高道慎之介, “なりすましセキュリティを騙す高品質音声合成・変換,” in 4th Microsoft Research Intern Alumni Networking, Dec. 2016.
高道慎之介, “統計的音声合成・変換の高品質化と言語教育応用,” in 京都大学学術情報メディアセンターセミナー, Nov. 2016.
高道慎之介, “統計的音声合成・変換の高品質化とその応用,” in 山梨大学テニュアトラック普及・定着事業第56回サイエンスカフェ講演会, Nov. 2016.
S. Koyama, N. Murata, and H. Saruwatari, “Super-resolution in sound field recording and reproduction based on sparse representation,” in Proceedings of 5th Joint Meeting of the Acoustical Society of America and Acoustical Society of Japan, Honolulu, Nov. 2016.
H. Saruwatari, K. Takata, N. Ono, and S. Makino, “Flexible microphone array based on multichannel nonnegative matrix factorization and statistical signal estimation,” in The 22nd International Congress on Acoustics (ICA2016), Sep. 2016, no. ICA2016-312.
S. Koyama, “Source-Location-Informed Sound Field Recording and Reproduction: A Generalization to Arrays of Arbitrary Geometry,” in Proceedings of 2016 AES International Conference on Sound Field Control, Guildford, Jul. 2016 [Online]. Available at: http://www.aes.org/e-lib/browse.cfm?elib=18303

2015

S. Koyama, A. Matsubayashi, N. Murata, and H. Saruwatari, “Sparse Sound Field Decomposition Using Group Sparse Bayesian Learning,” in Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Dec. 2015, pp. 850–855.
小山翔一, “スパース信号表現って何？,” in 日本音響学会秋季研究発表会, ビギナーズセミナー：今話題のあの技術ってどうなってるの？, Sep. 2015.
D. Kitamura, N. Ono, H. Sawada, H. Kameoka, and H. Saruwatari, “Relaxation of rank-1 spatial constraint in overdetermined blind source separation,” in in Proceedings of The 2015 European Signal Processing Conference (EUSIPCO2015), Nice, Sep. 2015, pp. 1271–1275.
H. Saruwatari, “Statistical-model-based speech enhancement With musical-noise-free properties,” in in Proceedings of 2015 IEEE International Conference on Digital Signal Processing (DSP2015), Singapore, 2015.
猿渡洋, “統計的バイノーラル信号表現とその音源分離への応用,” in 電子情報通信学会技術研究報告（電気／応用音響）, 2015.

2014

D. Kitamura, H. Saruwatari, S. Nakamura, Y. Takahashi, K. Kondo, and H. Kameoka, “Hybrid multichannel signal separation using supervised nonnegative matrix factorization with spectrogram restoration,” in Proceedings of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Siem Reap, Dec. 2014.
猿渡洋, “高次統計量は何を語る？～教師無し学習に基づく自律的な音メディア信号処理～,” in 音学シンポジウム2014, May 2014.
小山翔一, “音場再現技術の基本原理と展開,” in 音学シンポジウム2014, May 2014.

解説記事など

2025

齋藤佑樹, 中田亘, 山内一輝, and 朴浚鎔, “大規模事前学習モデルに基づく音声合成,” 日本音響学会誌, Oct. 2025.

2024

岡本悠希 and 井本桂右, “統計的手法による環境音・効果音合成,” 日本音響学会誌, vol. 80, no. 12, Dec. 2024.
齋藤佑樹, “共感的対話音声合成－より表現力豊かな音声対話システムに向けた音声合成－,” 日本音響学会誌, Dec. 2024.
佐伯高明 and 高道慎之介, “最先端の予測性能を持つ合成音声品質の自動評価システム UTMOS について,” 日本音響学会誌, Jul. 2024.

2021

高道慎之介, “2021年総合大会報告,” 電子情報通信学会情報・システムソサイエティ誌, May 2021. (to appear)
高道慎之介, “音声アバターを選ぶ時代～ボイスチェンジャー技術の動向～,” in 電気学会誌, 141, Feb. 2021, no. 2, pp. 93–96.

2019

高道慎之介, “位相のための確率分布と深層学習について教えて下さい,” 日本音響学会誌, vol. 75, no. 3, Mar. 2019.

2018

高道慎之介, “研究と実応用の話,” 日本音響学会誌, vol. 74, no. 1, Jan. 2018.
高道慎之介 and 戸田智基, “音声翻訳システムにおける音声変換の利用,” 日本音響学会誌, vol. 74, no. 9, 2018.

2017

小山翔一, “未来の音の収録・再生・編集技術の実現に向けて（創立100周年記念特集「基礎・境界」が支えた100年，これからの100年）,” 電子情報通信学会誌, vol. 100, no. 6, pp. 474–478, 2017.

2015

小山翔一, “スパース音場表現に基づく超解像型収音・再現 (\<小特集\>スパース表現に基づく音響信号処理),” 日本音響学会誌, vol. 71, no. 11, pp. 632–638, 2015.

2014

猿渡洋, “小特集「マイクロホンアレーの新しい技術展開」にあたって,” 日本音響学会誌, vol. 70, no. 7, pp. 371–372, Jul. 2014.
古家賢一 and 小山翔一, “波面合成技術の原理と応用 (特集: 立体音響技術),” 映像情報メディア学会誌, vol. 68, no. 8, pp. 621–624, 2014.
小山翔一, “サイバースペースにおける空間 (ちょっとしたエッセイ, コーヒーブレーク),” 日本音響学会誌, vol. 70, no. 4, p. 231, 2014.
猿渡洋, “逆フィルタによる音場再現技術 (特集: 立体音響技術),” 映像情報メディア学会誌, vol. 68, no. 8, pp. 612–615, 2014.

プレプリント

2020

N. T. Shinnosuke Takamichi Mamoru Komachi and H. Saruwatari, “JSSS: free Japanese speech corpus for summarization and simplification,” in arXiv, Oct. 2020.