Kohei Yatabe 研究室

主宰者：Kohei Yatabe

東京農工大学

AI 要約（直近 5 年の研究成果）

音声・音響信号処理における様々な逆問題を、最適化手法を用いて解く研究に取り組んでいます。具体的には、複数の音声混合から各音源を分離する問題、音声に残存する残響を除去する問題、破損した音声区間の復元といった、実環境での音響処理に必要な課題に着目しています。これらの課題に対して、時間周波数領域での信号表現や凸最適化、特に交互方向乗数法(ADMM)などの高速な最適化アルゴリズムを活用して、効率的な解法を開発しています。深層神経網(DNN)を音響信号処理に統合する研究も展開しており、従来のモデルベースの手法とDNNの学習能力を組み合わせることで、両者の長所を活かした手法の構築を目指しています。また、異なるサンプリング周波数で訓練された音響処理モデルの性能を維持するための技術開発にも取り組んでおり、実用的な信号処理システムの構築に向けて研究を進めています。さらに、音響計測の一般的な枠組みを提案し、通常の試験信号だけでなく音楽を含む任意の音を計測に用いる手法を開発しています。これらのツールは公開されており、研究者や教育の現場での広範な活用を想定した、実践的で応用性の高い研究を特徴としています。

※ AI（Claude）が、公開されている論文要旨から研究の問い・手法・主要な発見を事実情報として抽出・再構成して自動生成しています。誤りを含む可能性があるため、正確性は研究室公式情報でご確認ください。

外部リンク

研究成果（73 件）

[2026] Encoder-masking-decoder networks using orthogonal convolutional layer as invertible linear encoder
DOI: https://doi.org/10.1250/ast.e26.10
[2025] Acceleration of Optimization-based Structured Sparse Time-Frequency Analysis by ADMM
DOI: https://doi.org/10.1109/sampta64769.2025.11133562
[2025] Stride conversion algorithms for convolutional layers and its application to sampling-frequency-independent deep neural networks
DOI: https://doi.org/10.1016/j.sigpro.2025.110420
[2025] Fast and flexible algorithm for determined blind source separation based on alternating direction method of multipliers
DOI: https://doi.org/10.1250/ast.e25.46
[2025] Sound Safeguarding for Acoustic Measurement Using Any Sounds: Tools and Applications
DOI: https://doi.org/10.1109/gcce65946.2025.11275437
[2025] On the crossband filter representations of LTI systems: Algorithms and inversion
DOI: https://doi.org/10.1250/ast.e25.14
[2025] Determined Blind Source Separation Using Metric Projection and Proximity Operator of Log-Det Function Under Projection-back Constraint
DOI: https://doi.org/10.1109/sampta64769.2025.11133538
[2025] Gamma-von-Mises restricted Boltzmann machine and its application to audio modeling
DOI: https://doi.org/10.1250/ast.e24.95
[2025] Subband splitting: Simple, efficient and effective technique for solving block permutation problem in determined blind source separation
DOI: https://doi.org/10.1250/ast.e25.04
[2025] Local Equivariance Error-Based Metrics for Evaluating Sampling-Frequency-Independent Property of Neural Network
DOI: https://doi.org/10.23919/eusipco63237.2025.11226337

続きを表示（残り 63 件）

[2025] Single-channel blind dereverberation based on sparse matrix recovery with reweighting and accelerated alternating direction method of multipliers
DOI: https://doi.org/10.1250/ast.e24.119
[2025] All-pass filter simulating cochlear delay characteristics and its musical applications
DOI: https://doi.org/10.1250/ast.e25.07
[2024] Fringe pattern analysis based on the two-dimensional synchrosqueezing transform
DOI: https://doi.org/10.1364/ol.530258
[2024] Ptychographic phase retrieval via a deep-learning-assisted iterative algorithm
DOI: https://doi.org/10.1107/s1600576724006897
[2024] Proposal of Protocols for Speech Materials Acquisition and Presentation Assisted By Tools Based on Structured Test Signals
DOI: https://doi.org/10.1109/o-cocosda64382.2024.10800149
[2024] Neural Analog Filter for Sampling-Frequency-Independent Convolutional Layer
DOI: https://doi.org/10.1561/116.20230082
[2024] Subgradient-projection-based stable phase-retrieval algorithm for X-ray ptychography
DOI: https://doi.org/10.1107/s1600576724004709
[2024] Determined BSS by Combination of IVA and DNN via Proximal Average
DOI: https://doi.org/10.1109/icassp48485.2024.10448266
[2024] Harmonic/Percussive Source Separation Based on Anisotropic Smoothness of Magnitude Spectrograms via Convex Optimization
DOI: https://doi.org/10.1109/lsp.2024.3459811
[2024] Single-Channel Blind Dereverberation Based on Rank-1 Matrix Lifting in Time-Frequency Domain
DOI: https://doi.org/10.1109/icassp48485.2024.10446726
[2024] Drumhead tuning based on vibration mode visualization using Fourier transform profilometry
DOI: https://doi.org/10.1250/ast.e23.40
[2024] PHAIN: Audio Inpainting via Phase-Aware Optimization With Instantaneous Frequency
DOI: https://doi.org/10.1109/taslp.2024.3463415
[2023] HIGH-SPEED OPTICAL IMAGING AND SPATIO-TEMPORAL ANALYSIS OF SOUND SOURCES OF EDGE TONE PHENOMENA
DOI: https://doi.org/10.25144/14930
[2023] Does controller sound contain valuable information for video game scene analysis? Case study by character identification of Super Smash Bros. Ultimate
DOI: https://doi.org/10.1250/ast.e23.67
[2023] Modeling source directivity by solving inverse problems
DOI: https://doi.org/10.3397/in_2023_0894
[2023] Simultaneous Measurement of Multiple Acoustic Attributes Using Structured Periodic Test Signals Including Music and Other Sound Materials
DOI: https://doi.org/10.1109/apsipaasc58517.2023.10317411
[2023] Blind Source Separation Using Independent Low-Rank Matrix Analysis with Spectrogram-Consistency Regularization
DOI: https://doi.org/10.1109/apsipaasc58517.2023.10317156
[2023] Acoustic measurement framework for audio systems based on structured periodic test signals
DOI: https://doi.org/10.1109/gcce59613.2023.10315633
[2023] The frequency modulation transfer function provides supplemental measures for the objective evaluation of pitch extractors
DOI: https://doi.org/10.1121/10.0023432
[2023] Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
DOI: https://doi.org/10.1109/waspaa58266.2023.10248089
[2023] Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides
DOI: https://doi.org/10.23919/eusipco58844.2023.10289819
[2023] On-Line Chord Recognition Using FifthNet with Synchrosqueezing Transform
DOI: https://doi.org/10.23919/eusipco58844.2023.10289838
[2023] Computationally efficient transparent sound source for the finite-difference time-domain method
DOI: https://doi.org/10.1250/ast.44.371
[2023] LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus
DOI: https://doi.org/10.21437/interspeech.2023-1584
[2023] Synthesizing Speech from ECoG with a Combination of Transformer-Based Encoder and Neural Vocoder
DOI: https://doi.org/10.1109/icassp49357.2023.10097004
[2023] Improving Phase-Vocoder-Based Time Stretching by Time-Directional Spectrogram Squeezing
DOI: https://doi.org/10.1109/icassp49357.2023.10095348
[2023] UPGLADE: Unplugged Plug-and-Play Audio Declipper Based on Consensus Equilibrium of DNN and Sparse Optimization
DOI: https://doi.org/10.1109/icassp49357.2023.10095928
[2023] Determination of microphone acoustic center from sound field projection measured by optical interferometry
DOI: https://doi.org/10.1121/10.0017246
[2023] Experimental observation of the sound field around a moving source using parallel phase-shifting interferometry
DOI: https://doi.org/10.3397/in_2022_0525
[2023] High-speed optical imaging and spatio-temporal analysis of sound sources of edge tone phenomena
DOI: https://doi.org/10.3397/in_2022_0613
[2023] Wavefit: an Iterative and Non-Autoregressive Neural Vocoder Based on Fixed-Point Iteration
DOI: https://doi.org/10.1109/slt54892.2023.10022496
[2023] Versatile Time-Frequency Representations Realized by Convex Penalty on Magnitude Spectrogram
DOI: https://doi.org/10.1109/lsp.2023.3303699
[2022] Safeguarding test signals for acoustic measurement using arbitrary sounds: Measuring impulse response by playing music
DOI: https://doi.org/10.1250/ast.43.209
[2022] Speckle holographic imaging of a sound field using Fresnel lenses
DOI: https://doi.org/10.1364/ol.469972
[2022] An objective test tool for pitch extractors' response attributes
DOI: https://doi.org/10.21437/interspeech.2022-800
[2022] On-line sound event localization and detection for real-time recognition of surrounding environment
DOI: https://doi.org/10.1016/j.apacoust.2022.108961
[2022] Convex-optimization-based post-processing for computing room impulse response by frequency-domain FEM
DOI: https://doi.org/10.1016/j.apacoust.2022.108988
[2022] Phase retrieval based on a total-variation-regularized Poisson model for X-ray ptychographic imaging of low-contrast objects
DOI: https://doi.org/10.1107/s1600576722005234
[2022] Underwater sound visualization and temperature measurement using high-speed interferometer
DOI: https://doi.org/10.1250/ast.43.177
[2022] Harmonic and Percussive Sound Separation Based on Mixed Partial Derivative of Phase Spectrogram
DOI: https://doi.org/10.1109/icassp43922.2022.9747057
[2022] APPLADE: Adjustable Plug-and-Play Audio Declipper Combining DNN with Sparse Optimization
DOI: https://doi.org/10.1109/icassp43922.2022.9747089
[2022] Acoustic Application of Phase Reconstruction Algorithms in Optics
DOI: https://doi.org/10.1109/icassp43922.2022.9747423
[2022] SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
DOI: https://doi.org/10.21437/interspeech.2022-301
[2022] Window Functions With Minimum-Sidelobe Derivatives for Computing Instantaneous Frequency
DOI: https://doi.org/10.1109/access.2022.3161543
[2022] Sampling-Frequency-Independent Convolutional Layer and its Application to Audio Source Separation
DOI: https://doi.org/10.1109/taslp.2022.3203907
[2022] Online Phase Reconstruction via DNN-Based Phase Differences Estimation
DOI: https://doi.org/10.1109/taslp.2022.3221041
[2022] Wearable Seld Dataset: Dataset For Sound Event Localization And Detection Using Wearable Devices Around Head
DOI: https://doi.org/10.1109/icassp43922.2022.9746544
[2022] Visualization of sound wave from high-speed moving source
DOI: https://doi.org/10.1250/ast.43.339
[2022] EXPERIMENTAL OBSERVATION OF THE SOUND FIELD AROUND A MOVING SOURCE USING PARALLEL PHASE-SHIFTING INTERFEROMETRY
DOI: https://doi.org/10.25144/14194
[2021] Phase-recovery algorithm for harmonic/percussive source separation based on observed phase information and analytic computation
DOI: https://doi.org/10.1250/ast.42.261
[2021] Mixture of Orthogonal Sequences Made from Extended Time-Stretched Pulses Enables Measurement of Involuntary Voice Fundamental Frequency Response to Pitch Perturbation
DOI: https://doi.org/10.21437/interspeech.2021-2073
[2021] Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
DOI: https://doi.org/10.23919/eusipco54536.2021.9616166
[2021] Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
DOI: https://doi.org/10.23919/eusipco54536.2021.9615941
[2021] Sparse Distortionless Beamformer Based on Nonconvex Optimization
DOI: https://doi.org/10.23919/eusipco54536.2021.9615982
[2021] Simultaneous Declipping and Beamforming via Alternating Direction Method of Multipliers
DOI: https://doi.org/10.23919/eusipco54536.2021.9616089
[2021] Linear Multichannel Blind Source Separation based on Time-Frequency Mask Obtained by Harmonic/Percussive Sound Separation
DOI: https://doi.org/10.1109/icassp39728.2021.9413494
[2021] Sparse Time-Frequency Representation Via Atomic Norm Minimization
DOI: https://doi.org/10.1109/icassp39728.2021.9414921
[2021] Determined BSS Based on Time-Frequency Masking and Its Application to Harmonic Vector Analysis
DOI: https://doi.org/10.1109/taslp.2021.3073863
[2021] Gamma Boltzmann Machine for Audio Modeling
DOI: https://doi.org/10.1109/taslp.2021.3095656
[2021] Interactive and real-time acoustic measurement tools for speech data acquisition and presentation: Application of an extended member of time stretched pulses
[2021] Phase Retrieval in Acoustical Signal Processing
DOI: https://doi.org/10.1587/essfr.15.1_25
[2021] Cascaded All-Pass Filters with Randomized Center Frequencies and Phase Polarity for Acoustic and Speech Measurement and Data Augmentation
DOI: https://doi.org/10.1109/icassp39728.2021.9415057
[2021] Tools and practice for supporting recommended protocol for acoustic recording of speech data for high usability -- Application of a cascaded all-pass filters with randomized center frequencies and phase polarities

科研費（0 件）

まだデータがありません（KAKEN 取り込み後に表示）。

所属学会・役職（0 件）

まだデータがありません（学会データ連携後に表示）。

AI 要約（直近 5 年の研究成果）

外部リンク

関連研究室(8 件)

研究成果（73 件）

科研費（0 件）

所属学会・役職（0 件）