Session Program Day 3
Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2022
Session | Room | Chair | |
ThAM1-1 (Image Video Multimedia) | Chiang Mai 1 | Masaaki Ikehara | |
Date | Time | Title | Authors |
10 November 2022 | 9.00-9.20 | Single Image Raindrop Removal Using a Non-Local Operator and Feature Maps in the Frequency Domain | Shinya Ezumi; Masaaki Ikehara |
9.20-9.40 | Dual-Teacher Distillation for Low-Light Image Enhancement | Jeong-Hyeok Park; Tae-Hyeon Kim; Jong-Ok Kim | |
9.40-10.00 | Automatic Data Augmentation Method With Improved Interpretability for Image Classification in Computer Vision Applications | Dair Ungarbayev; Osman Demirel; Muhammad Tahir Akhtar | |
10.00-10.20 | Learning to Sharpen Partially Blurred Image via Iterative Blurred Region Mining and Recovery | Jung Yeh; Wen-Li Wei; Duan-Yu Chen; Jen-Chun Lin | |
10.20-10.40 | Shape-Bias Evaluation of Pretrained Models Using Image Decomposition | Akinori Iwata; Masahiro Okuda | |
10.40-11.00 | Proposal of Associative Watermarking Method | Ryoto Kanegae; Masaki Kawamura | |
Session | Room | Chair | |
ThAM1-2 (Speech, Language, and Audio 1) | Chiang Mai 2 | Ying Hu/ Toshio Irino | |
Date | Time | Title | Authors |
10 November 2022 | 9.00-9.20 | DMF-Net: A Decoupling-Style Multi-Band Fusion Model for Full-Band Speech Enhancement | Guochen Yu; Yuansheng Guan; Weixin Meng; Chengshi Zheng; Hui Wang; Yutian Wang |
9.20-9.40 | Speak Like a Dog: Human to Non-Human Creature Voice Conversion | Kohei Suzuki; Shoki Sakamoto; Tadahiro Taniguchi; Hirokazu Kameoka | |
9.40-10.00 | Pre-Trained Multimodal End-To-End Network for Spoken Language Assessment Incorporating Prompts | Binghuai Lin; Liyuan Wang | |
10.00-10.20 | Gated Fusion of Handcrafted and Deep Features for Robust Automatic Pronunciation Assessment | Binghuai Lin; Liyuan Wang | |
10.20-10.40 | Effective Data Screening Technique for Crowdsourced Speech Intelligibility Experiments: Evaluation With IRM-Based Speech Enhancement | Ayako Yamamoto; Toshio Irino; Shoko Araki; Kenichi Arai; Atsunori Ogawa; Keisuke Kinoshita; Tomohiro Nakatani | |
Session | Room | Chair | |
ThAM1-3 (Deep Learning: Algorithm, Implementations, and Applications) | Chiang Mai 3 | Kasemsit Teeyapan | |
Date | Time | Title | Authors |
10 November 2022 | 9.00-9.20 | Leveraging Pre-Trained Acoustic Feature Extractor for Affective Vocal Bursts Tasks | Bagus Tris Atmaja; Akira Sasou |
9.20-9.40 | Flow-Based Variational Sequence Autoencoder | Jen-Tzung Chien; Tien-Ching Luo | |
9.40-10.00 | Speech Intelligibility Prediction Through Direct Estimation of Word Accuracy Using Conformer | Naoyuki Kamo; Kenichi Arai; Atsunori Ogawa; Shoko Araki; Tomohiro Nakatani; Keisuke Kinoshita; Marc Delcroix; Tsubasa Ochiai; Toshio Irino | |
10.00-10.20 | DNN-Rule Hybrid Dyna-Q for Sample-Efficient Task-Oriented Dialog Policy Learning | Mingxin Zhang; Takahiro Shinozaki | |
10.20-10.40 | MoCoVC: Non-Parallel Voice Conversion With Momentum Contrastive Representation Learning | Kotaro Onishi; Toru Nakashika | |
10.40-11.00 | Controllable Voice Conversion Based on Quantization of Voice Factor Scores | Takumi Isako; Kotaro Onishi; Takuya Kishida; Toru Nakashika | |
Session | Room | Chair | |
ThAM1-4 (Biomedical Signal Processing and Systems) | Board Room 2 | Daranee Hormdee | |
Date | Time | Title | Authors |
10 November 2022 | 9.00-9.20 | Deep Adaptive Denoising Auto-Encoder Networks for ECG Noise Cancelation via Time-Frequency Domain | Amir Mohammadisarab; Poorya Aghaomidi; Jalil Mazloum; Mohammad Ali Akbarzadeh; Mahdi Orooji; Nader Mokari; Halim Yanikomeroglu |
9.20-9.40 | User-Item Recommendation Approaches to Detect Genomic Variant Interactions | Emma Andrade; Nicholas Tom; Mario Banuelos | |
9.40-10.00 | Teager Energy Cepstral Coefficients for Classification of Dysarthric Speech Severity-Level | Aastha Kachhi; Anand Therattil; Ankur T. Patil; Hardik B. Sailor; Hemant A. Patil | |
10.00-10.20 | Decoding Emotional Valence from EEG in Immersive Virtual Reality | Guanxiong Pei; Bingjie Li; Taihao Li; Ruohao Xu; Jianmin Dong; Jia Jin | |
10.20-10.40 | Design of A Wearable System for Hypoxic Training Management Using Blood Oxygenation and Heart Rate | Takuma Kitagawa; Toshitaka Yamakawa | |
10.40-11.00 | MedBERT: A Pre-Trained Language Model for Biomedical Named Entity Recognition | Charangan Vasantharajan; Kyaw Zin Tun; Ho Thi-Nga; Sparsh Jain; Tong Rong; Chng Eng Siong | |
Session | Room | Chair | |
ThAM1-5 (SS21: Recent Advances and Applications in Encrypted Domain) | Board Room 3 | Simying Ong | |
Date | Time | Title | Authors |
10 November 2022 | 9.00-9.20 | Encrypted JPEG Image Retrieval via Huffman-Code Based Self-Attention Networks | Zhixun Lu; Qihua Feng; Peiya Li |
9.20-9.40 | Reversible Data Hiding in Encrypted Text Using Paillier Cryptosystem | Asad Malik; Aeyan Ashraf; Hanzhou Wu; Minoru Kuribayashi | |
9.40-10.00 | Scrambling-Embedding in Partially-Encrypted Images | Koi Yee Ng, Simying Ong | |
10.00-10.20 | Image Classification Using Vision Transformer for EtC Images | Genki HAMANO; Shoko IMAIZUMI; Hitoshi KIYA | |
10.20-10.40 | Image Watermarking Based on Saliency Detection and Multiple Transformations | Ahmed Khan; KokSheik Wong; Vishnu Monn Baskaran | |
Session | Room | Chair | |
ThAM1-6 (SS19: Towards real-world human-centric acoustic signal processing) | Chiang Mai 4 | Sermsak Uatrongjit | |
Date | Time | Title | Authors |
10 November 2022 | 9.00-9.20 | A Fast Converge Spectral Modulation Sensitive Active Noise Control System | Kah-Meng Cheong; Yih Liang Shen; Tai-Shih Chi |
9.20-9.40 | Multimodal Forgery Detection Using Ensemble Learning | Ammarah Hashmi; Sahibzada Adil Shahzad; Wasim Ahmad;Chia Wen Lin;Yu Tsao;Hsin-Min Wang | |
9.40-10.00 | Speech Enhancement-Assisted Voice Conversion in Noisy Environments | Yun-Ju Chan; Chiang-Jen Peng; Syu-Siang Wang; Hsin-Min Wang; Yu Tsao; Tai-Shih Chi | |
10.00-10.20 | Effect of Noise on the Perceptual Contribution of Cochlea-Scaled Entropy and Speech Level in Mandarin Sentence Understanding | Weikang Wu; Shangdi Liao; Fei Chen | |
10.20-10.40 | EEG-Based Auditory Attention Detection With Estimated Speech Sources Separated From an Ideal-Binary-Masking Process | Lei Wang; Fei Chen | |
10.40-11.00 | Automatic Step Detection of Tandem Gait Test in Patients With Vestibular Hypofunction Using Wearable Sensors | Yi-Ju Huang; Chien-Pin Liu; Kuan-Chung Ting; Chia-Yeh Hsieh; Kai-Chun Liu; Chia-Tai Chan | |
Session | Room | Chair | |
ThAM1-7 (SS22: Recent Advances in Biometrics and Security) | Chiang Mai 5 | Koichi Ito | |
Date | Time | Title | Authors |
10 November 2022 | 9.00-9.20 | Continuous Authentication for Smartphones Using Face Images and Touch-Screen Operation | Shuto Kinoshita; Yuka Watanabe; Yasushi Yamazaki |
9.20-9.40 | Spoofing Attack Detection in Face Recognition System Using Vision Transformer With Patch-Wise Data Augmentation | Kota Watanabe; Koichi Ito; Takafumi Aoki | |
9.40-10.00 | A Simple and Accurate CNN for Iris Recognition | Shokei Kawakami; Hiroya Kawai; Koichi Ito; Takafumi Aoki; Yoshiko Yasumura; Masakazu Fujio; Yosuke Kaga; Kenta Takahashi | |
10.00-10.20 | Eyeglass Frame Segmentation for Face Image Processing | Kanta Miura; Takamichi Miyamoto; Kazuyuki Sakurai; Koichi Ito; Takafumi Aoki | |
10.20-10.40 | A Fair Model is Not Fair in a Biased Environment | Yuya Sato; Soshi Maeda; Muku Akasaka; Masakatsu Nishigaki; Tetsushi Ohki | |
Session | Room | Chair | |
ThAM1-8 (Other related speech processing) | Board Room 4 | Sansanee Auephanwiriyakul | |
Date | Time | Title | Authors |
10 November 2022 | 9.00-9.20 | Intelligibility Prediction of Enhanced Speech Using Recognition Accuracy of End-To-End ASR System | Kenichi Arai; Atsunori Ogawa; Shoko Araki; Keisuke Kinoshita; Tomohiro Nakatani; Naoyuki Kamo; Toshio Irino |
9.20-9.40 | Hi, KIA: A Speech Emotion Recognition Dataset for Wake-Up Words | Taesu Kim; SeungHeon Doh; Gyunpyo Lee; Hyeongseok Jeon; Juhan Nam; Hyeon-Jeong Suk | |
9.40-10.00 | Improving Speech Emotion Recognition via Fine-Tuning ASR With Speaker Information | Bao Thang Ta, Tung Lam Nguyen, Dinh Son Dang, Nhat Minh Le, Van Hai Do | |
10.00-10.20 | 3CMLF: Three-Stage Curriculum-Based Mutual Learning Framework for Audio-Text Retrieval | Yi-Wen Chao; Dongchao Yang; Rongzhi Gu; Yuexian Zou | |
Session | Room | Chair | |
ThPM1-1 (Image Video Multimedia) | Chiang Mai 1 | Masaki Kawamura | |
Date | Time | Title | Authors |
10 November 2022 | 12.30-12.50 | Neural Network Based Watermarking Trained With Quantized Activation Function | Shingo Yamauchi; Masaki Kawamura |
12.50-13.10 | A Multiframe Super-Resolution Pipeline for Sub-Image-Typed Light Field Data | Chien-Han Hsu; Yi-Hsien Lin; Yen-Po Lin; Yi-Chang Lu | |
13.10-13.30 | Restoring Edge and Color Using Weighted Near-Infrared Image and Color Transmission Maps for Robust Haze Removal | Onhi Kato; Akira Kubota | |
13.30-13.50 | Dense View Interpolation of 4D Light Fields for Real-Time Augmented Reality Applications | Hidemichi Yoshino; Kazuya Kodama; Takayuki Hamamoto | |
13.50-14.10 | Bolt Looseness Identification Using Faster R-CNN and Grid Mask Augmentation | Natchapon Panmatharit; Yuttapong Jiraraksopakun; Anek Siripanichgorn; Punnarai Siricharoen | |
14.10-14.30 | Large-Scale Blind Face Super-Resolution via Edge Guided Frequency Aware Generative Facial Prior Networks | Xi Cheng; Wan-Chi Siu; Jian Yang | |
Session | Room | Chair | |
ThPM1-2 (Speech, Language, and Audio 1) | Chiang Mai 2 | Takanobu Nishiura | |
Date | Time | Title | Authors |
10 November 2022 | 12.30-12.50 | Language-Based Audio Retrieval With Converging Tied Layers and Contrastive Loss | Andrew Koh; Chng Eng Siong |
12.50-13.10 | D²Net: A Denoising and Dereverberation Network Based on Two-Branch Encoder and Dual-Path Transformer | Liusong Wang; Wenbing Wei; Yadong Chen; Ying Hu | |
13.10-13.30 | Direct Speech-Reply Generation From Text-Dialogue Context | Kenichi Fujita; Yusuke Ijima; Hiroaki Sugiyama | |
13.30-13.50 | Sequence-Wise Optimization for Quasi-Harmonic Speech Waveform Modeling | Shaowen Chen; Tomoki Toda | |
13.50-14.10 | Lattice-Based Data Augmentation for Code-Switching Speech Recognition | Roland Hartanto; Kuniaki Uto; Koichi Shinoda | |
14.10-14.30 | Phase-Aware Audio Super-Resolution for Music Signals Using Wasserstein Generative Adversarial Network | Yanqiao Yan; Binh Thien Nguyen; Yuting Geng; Kenta Iwai; Takanobu Nishiura | |
Session | Room | Chair | |
ThPM1-3 (Deep Learning: Algorithm, Implementations, and Applications) | Chiang Mai 3 | Jen-Chun Lin | |
Date | Time | Title | Authors |
10 November 2022 | 12.30-12.50 | Speech Emotion Recognition Based on the Reconstruction of Acoustic and Text Features in Latent Space | Jennifer Santoso; Rintaro Sekiguchi; Takeshi Yamada; Kenkichi Ishizuka; Taiichi Hashimoto; Shoji Makino |
12.50-13.10 | A Light CNN With Split Batch Normalization for Spoofed Speech Detection Using Data Augmentation | Haojian Lin; Yang Ai; Zhenhua Ling | |
13.10-13.30 | On the Optimal Classifier for Affective Vocal Bursts and Stuttering Predictions Based on Pre-Trained Acoustic Embedding | Bagus Tris Atmaja; Zanjabila; Akira Sasou | |
13.30-13.50 | Nonlinear Residual Echo Suppression Based on Gated Dual Signal Transformation LSTM Network | Kai Xie; Ziye Yang; Jie Chen | |
13.50-14.10 | Adaptive End-To-End Text-To-Speech Synthesis Based on Error Correction Feedback From Humans | Kazuki Fujii; Yuki Saito; Hiroshi Saruwatari | |
14.10-14.30 | Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-To-Speech | Byoung Jin Choi; Myeonghun Jeong; Minchan Kim; Sung Hwan Mun; Nam Soo Kim | |
Session | Room | Chair | |
ThPM1-4 (SS07: Latest Wireless Technologies for Sensing and Communications) | Board Room 2 | Osamu Takyu | |
Date | Time | Title | Authors |
10 November 2022 | 12.30-12.50 | Performance Evaluation of FISTA With Constant Inertial Parameter | Kaito Kameda; Ryo Hayakawa; Kazunori Hayashi; Youji Iiguni |
12.50-13.10 | An Approximated ADMM Based Algorithm for \(\ell_1-\ell_2\) Optimization Problem | Rui Lin; Kazunori Hayashi | |
13.10-13.30 | Antenna Beamforming Selection With Low Complexity and High Exploitation of White Space in Frequency Spectrum Sharing | Kizuku Kawamura; Kohei Akimoto; Osamu Takyu | |
13.30-13.50 | Individual Memory Driven Transformer Deep Learning Model for Multi-Cell Massive MIMO Beam Prediction | Taisei Urakami; Haohui Jia; Na Chen; Minoru Okada | |
13.50-14.10 | Deep Unfolding-Aided Sum-Product Algorithm for Error Correction of CRC Coded Short Message | Qilin Zhang; Shinsuke Ibi; Takumi Takahashi; Hisato Iwai | |
14.10-14.30 | Successive Interference Cancellation for Signal Demodulation of Multiple LPWA Systems | Shinichiro Kakuda; Takeo Fujii; Shusuke Narieda | |
Session | Room | Chair | |
ThPM1-5 (SS08: Digital Convergence of 5G/B5G, AIoT and Security) | Board Room 3 | Kampol Woradit | |
Date | Time | Title | Authors |
10 November 2022 | 12.30-12.50 | Evaluation of Voice Service in LEO Communication With 3GPP PUSCH Repetition Enhancement | Shou-Hong Liu; Chun-Tai Liu; Wei-Hung Chou; JenYi Pan |
12.50-13.10 | Modeling of Malware Diffusion With Mobile Devices in Intermittently Connected Networks | Hideyoshi Miura; Shoya Abukawa; Tomotaka Kimura; Kouji Hirata | |
13.10-13.30 | Software Defined Radio Access Network Sharing by Multi-Operator Core Networks | Wen-Ping Lai; Wen-Ru Chen; Ming-Jay Lai; Hong-Lun Lai; Chia-Ying Lin; Po-Chen Tseng | |
13.30-13.50 | Machine Learning Based End-To-End Constellation Training for Communication Systems | Po-Chiang Lin | |
13.50-14.10 | Flow-Based DDoS Detection Using Deep Neural Network With Radial Basis Function Neural Network | Ting-Chung Leung; Lee Chung-Nan | |
14.10-14.30 | Implement a Continuous Learning Model to Detect Different Types of DDoS Attacks With Hierarchical Temporal Memory | Hung Manh Nguyen; Yu-Kuen Lai | |
Session | Room | Chair | |
ThPM1-6 (SS23: Selected Papers from APSIPA Workshop in Hanoi, Vietnam) | Chiang Mai 4 | Nguyen Linh Trung | |
Date | Time | Title | Authors |
10 November 2022 | 12.30-12.50 | Dynamic Hand Gesture Recognition From Egocentric Videos Based on SlowFast Architecture | Ha-Dang Ho, Hong-Quan Nguyen, Thuy-Binh Nguyen, Sinh-Thuong Vu, Thi-Lan Le |
12.50-13.10 | Deep Learning-Based Signal Detection for Dual-Mode Index Modulation 3D-OFDM | Dang-Y Hoang, Tien-Hoa Nguyen, Vu-Duc Ngo, Trung Tan Nguyen†, Nguyen Cong Luong, Thien Van Luong | |
13.10-13.30 | A Comparison of Feature Selection and Feature Extraction in Network Intrusion Detection Systems | Tuan-Cuong Vuong, Hung Tran, Mai Xuan Trang, Vu-Duc Ngo, Thien Van Luong | |
13.30-13.50 | Deep Neural Network-Based Detector for Single-Carrier Index Modulation NOMA | Toan Gian, Vu-Duc Ngo,Tien-Hoa Nguyen, Trung tan Nguyen, Thien Van Luong | |
13.50-14.10 | Vibration Measurement Using Spatial Shifting Coherent Digital Holography | Long Hai Ngo; Quang Duc Pham | |
14.10-14.30 | Robust Online Tucker Dictionary Learning From Multidimensional Data Streams | Le Trung Thanh; Tran Trong Duy; Karim Abed-Meraim; Nguyen Linh Trung; Adel Hafiane | |
Session | Room | Chair | |
ThPM1-7 (SS06: Adversarial Attacks and Defense) | Chiang Mai 5 | Minoru Kuribayashi | |
Date | Time | Title | Authors |
10 November 2022 | 12.30-12.50 | Survey on Vision Based Fake News Detection and Its Impact Analysis | Mehul S Raval; Mohendra Roy; Minoru Kuribayashi |
12.50-13.10 | StyleGAN Encoder-Based Attack for Block Scrambled Face Images | AprilPyone MaungMaung; Hitoshi Kiya | |
13.10-13.30 | On the Adversarial Transferability of ConvMixer Models | Ryota Iijima; Miki Tanaka; Isao Echizen; Hitoshi Kiya | |
13.30-13.50 | Detection and Correction of Adversarial Examples Based on JPEG-Compression-Derived Distortion | Kenta Tsunomori; Yuma Yamasaki; Minoru Kuribayashi; Nobuo Funabiki; Isao Echizen | |
13.50-14.10 | Defense Against Adversarial Examples Using Beneficial Noise | Param Raval; Harin Khakhi; Minoru Kuribayashi; Mehul S. Raval | |
14.10-14.30 | Privacy Protection Against Automated Tracking System Using Adversarial Patch | Hiroto Takiwaki; Minoru Kuribayashi; Nobuo Funabiki; Mehul Shirishchandra Raval | |
Session | Room | Chair | |
ThPM1-8 (Industrial Forum "New era opened by AI-based image processing) | Board Room 4 | Jangwoo Kwon | |
Date | Time | Title | Authors |
10 November 2022 | 12.30-14.30 | Towards Best Possible Deep Learning Acceleration on the Edge – A Compression-Compilation Co-Design Framework | Yanzhi Wang, Northeastern University, Chairman and former CEO of CoCoPIE Inc., USA |
Empowering Future Pathology with Artificial Intelligence | Shuhao Wang, Co-founder and CTO of Thorough Future, China | ||
Session | Room | Chair | |
ThPM2-1 (Image Video Multimedia) | Chiang Mai 1 | Nam Ik Cho | |
Date | Time | Title | Authors |
10 November 2022 | 15.00-15.20 | Syllable Analysis Data Augmentation for Khmer Ancient Palm Leaf Recognition | Nimol Thuon; Jun Du; Jianshu Zhang |
15.20-15.40 | Multi-Class Vehicle Counting System for Multi-View Traffic Videos | Wichukorn Kuntintara; Kanokphan Lertniphonphan; Punnarai Siricharoen | |
15.40-16.00 | Table Structure Recognition Based on Grid Shape Graph | Eunji Lee; Junhyeong Kwon; Haeyoon Yang; Jaewoo Park; Soonyoung Lee; Hyung Il Koo; Nam Ik Cho | |
16.00-16.20 | Feature Distillation Network for Multi-Band NIR Colorization | Tae-Sung Park; Tae-Hyeon Kim; Jong-Ok Kim | |
16.20-16.40 | Blur Detection for Surveillance Camera System | Yikun Pan, Sik-Ho Tsang, Yui-Lam Chan, Daniel P.K. Lun | |
16.40-17.00 | Lip Sync Matters: A Novel Multimodal Forgery Detector | Sahibzada Adil Shahzad; Ammarah Hashmi; Sarwar Khan; Yan-Tsung Peng; Yu Tsao; Hsin-Min Wang | |
Session | Room | Chair | |
ThPM2-2 (Speech, Language, and Audio 1) | Chiang Mai 2 | Kittichai Wantanajittikul | |
Date | Time | Title | Authors |
10 November 2022 | 15.00-15.20 | Frame-Level Matching Scheme Using Posteriorgram Probability Distance of Spoken Data to Improve Search Accuracy of Spoken Term Detection | Reo Minakawa; Kazunori Kojima; Shi-wook Lee; Yoshiaki Itoh |
15.20-15.40 | Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis | Yuta Matsunaga; Takaaki Saeki; Shinnosuke Takamichi; Hiroshi Saruwatari | |
15.40-16.00 | Using Perceptual Quality Features in the Design of the Loss Function for Speech Enhancement | Nicholas Eng; Yusuke Hioka; Catherine I Watson | |
16.00-16.20 | Correlation Loss for MOS Prediction of Synthetic Speech | Beibei Hu; Qiang Li | |
16.20-16.40 | Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation | Chunyu Qiang; Peng Yang; Hao Che; Jinba Xiao; Xiaorui Wang; Zhongyuan Wang | |
16.40-17.00 | Classification of Short Audio Acoustic Scenes Based on Data Augmentation Methods | Xuan Zhang; Yunfei Shao; Junjie Xu; Yong Ma; Wei-Qiang Zhang | |
Session | Room | Chair | |
ThPM2-3 (Deep Learning: Algorithm, Implementations, and Applications) | Chiang Mai 3 | Kasemsit Teeyapan | |
Date | Time | Title | Authors |
10 November 2022 | 15.00-15.20 | Improving Unsupervised Anomalous Sound Detection Performance of Autoencoder and Its Variant With Pretrained Deep Belief Network | Yufeng Deng; Jia Liu; Wei-Qiang Zhang |
15.20-15.40 | ASGAN-VC: One-Shot Voice Conversion With Additional Style Embedding and Generative Adversarial Networks | Wei-Cheng Li; Tzer-Jen Wei | |
15.40-16.00 | Fusing Multiple Bandwidth Spectrograms for Improving Speech Enhancement | Hao Shi; Yuchun Shu; Longbiao Wang; Jianwu Dang; Tatsuya Kawahara | |
16.00-16.20 | End-To-End Two-Dimensional Sound Source Localization With Ad-Hoc Microphone Arrays | Yijun Gong; Shupei Liu; Xiao-Lei Zhang | |
16.20-16.40 | Exploring Speaker Age Estimation on Different Self-Supervised Learning Models | Tuan Duc Truong; Tran The Anh; Eng-Siong Chng | |
16.40-17.00 | Mandarin Singing Voice Synthesis With Denoising Diffusion Probabilistic Wasserstein GAN | Yin-Ping Cho; Yu Tsao; Hsin-Min Wang; Yi-Wen Liu | |
Session | Room | Chair | |
ThPM2-4 (SS18: Metaverse: Future of Internet) | Board Room 2 | Navadon Khunlertgit | |
Date | Time | Title | Authors |
10 November 2022 | 15.00-15.20 | Physiological Study on the Effect of Game Events in Response to Player's Laughter | Mikito Fukuda; Yoshiko Arimoto |
15.20-15.40 | Development of a Virtual Telecommunication System Research Laboratory | Siwanart Jearavongtakul; Imran Saeed Mirza; Lunchakorn Wuttisittikulkij; Pruk Sasithong; Suebphong Noisri; Pisit Vanichchanunt | |
15.40-16.00 | Camera-Based Log System for Human Physical Distance Tracking in Classroom | Somrudee Deepaisarn; Angkoon Angkoonsawaengsuk; Charn Arunkit; Chayud Srisumarnk; Krongkan Nimmanwatthana; Nanmanas Linphrachaya; Nattapol Chiewnawintawat; Rinrada Tanthanathewin; Sivakorn Seinglek; Suphachok Buaruk; Virach Sornlertlamvanich | |
16.00-16.20 | Detecting Replay Attacks Using Single-Channel Audio: The Temporal Autocorrelation of Speech | Shih-Kuang Lee; Yu Tsao; Hsin-Min Wang | |
Session | Room | Chair | |
ThPM2-5 ( Wireless Communication and networking) | Board Room 3 | Poompat Saengudomlert | |
Date | Time | Title | Authors |
10 November 2022 | 15.00-15.20 | Automatic Detection of Dimmable Pulse Position Modulation for Visible Light Communication | Poompat Saengudomlert; Karel Sterckx |
15.20-15.40 | Estimation of Angular Power Spectrum Using Multikernel Adaptive Filtering | Eiji Ninomiya; Masahiro Yukawa; Renato L. G. Cavalcante; Lorenzo Miretti | |
15.40-16.00 | Novel Smart Sectoring and Beam Designs in mmWave Broadcast Channels | Yan-Yin He; Shang-Ho (Lawrence) Tsai; Jen-Ming Wu | |
16.00-16.20 | New Methods for Fast Detection for Embedded Cognitive Radio | Grégoire de Broglie; Louis Morge-Rollet; Denis Le Jeune; Frédéric Le Roy; Christian Roland; Charles Canaff; Jean-Philippe Diguet | |
Session | Room | Chair | |
ThPM2-6 (SS23: Selected Papers from APSIPA Workshop in Hanoi, Vietnam) | Chiang Mai 4 | Nguyen Linh Trung | |
Date | Time | Title | Authors |
10 November 2022 | 15.00-15.20 | Needle Localization and Segmentation for Radiofrequency Ablation of Liver Tumors Under CT Image Guidance | Le Quoc Anh; Luu Manh Ha; Theo van Walsum; Adriaan Moelker; Dao Viet Hang; Pham Cam Phuong; Vu Duy Thanh |
15.20-15.40 | End-To-End Visual-Guided Audio Source Separation With Enhanced Losses | Duc-Huy Pham; Quang-Anh Do; Thanh Thi-Hien Duong; Thi-Lan Le; Phi Le Nguyen | |
15.40-16.00 | Automated Classification of Lung Injury From X-Ray Images Using Deep Learning Network | Huy Le; Thanh-Ha Do | |
16.00-16.20 | AI-Based Video Analysis for Traffic Monitoring | Bui Son Tung; Phung The Ngoc; Do Duy Thanh; Nguyen Hong Thinh | |
16.20-16.40 | Adaptive Filtering-Based Heavy-Noise Removal in Born Iterative Method | Tran Quang-Huy; Luong Thi Theu; Nguyen Canh Minh; Duc-Nghia Tran; Duc-Tan Tran | |
16.40-17.00 | A Novel Deep Learning-Based Approach for Sleep Apnea Detection Using Single-Lead ECG Signals | Anh-Tu Nguyen; Thao Nguyen; Huy-Khiem Le; Huy-Hieu Pham; and Cuong Do | |
Session | Room | Chair | |
ThPM2-7 (SS15: Advanced Sensing Technologies using Wireless Signal) | Chiang Mai 5 | Kampol Woradit | |
Date | Time | Title | Authors |
10 November 2022 | 15.00-15.20 | Multi-Resolution GPR Clutter Suppression Method Based on Low-Rank and Sparse Decomposition | Yanjie Cao; Xiaopeng Yang; Tian Lan |
15.20-15.40 | Indoor Human Motion Recognition Method Based on Kernel-Distance Doppler Velocity Estimation and Lightweight Network | Weicheng Gao; Xiaopeng Yang; Xiaodong Qu; Jiancheng Liao; Zixiang Yin; Ding Zhang | |
15.40-16.00 | Mainlobe Interference Suppression Method Based on Blocking Matrix Preprocessing With Low Sidelobe Constraint | Meng Haoyu; Qu Xiaodong; Zhang Xingyu; Li Wolin; Zhang Zhengyan; Yang Xiaopeng | |
16.00-16.20 | Continuous Tracking of Indoor Human Targets Based on Millimeter Wave Radar | Meiqiu Jiang; Shisheng Guo; Haolan Luo; Guolong Cui | |
16.20-16.40 | Reconfigurable Intelligent Surfaces Aided WiFi Imaging | Ying He; Dongheng Zhang; Yan Chen | |
16.40-17.00 | Continuous User Authentication Using WiFi | Pengcheng Huang; Dongheng Zhang; Ruixu Geng; Yan Chen | |
CONFERENCE FORMAT
The conference is planned to be in presence. However, if there are some travel restrictions for some authors at the time, we will allow them to upload their videos for the oral presentation. The presenter must attend the session online for Q&A. This will however mean that there will be no live streaming of the conference presentations, as in the hybrid conference. For more information please contact: apsipa2022@gmail.com