Program
Thursday, November 1
Thursday, November 1 7:00 - 8:00
Registration
Thursday, November 1 8:00 - 8:20
Opening Ceremony
Thursday, November 1 8:20 - 9:20
Keynote speaker #1: Automatic speaker recognition: current trends and challenges
Keynote Speakers Automatic speaker recognition: current trends and challenges by Prof. Tomoko Matsui, The Institute of Statistical Mathematics, Tokyo, Japan
Abstract: Speaker recognition (SR) is a technique to automatically identify or verify a speaker using the speech features extracted from voice data. In the past decades, interest in SR has increased as a technique for biometric authentication and for user friendly interface design. SR research started in the 1930's with the application of statistical machine learning methods including Gaussian mixture models, factor analysis models, kernel machines and so on. Recently deep neural networks have been successfully utilized. In this talk, the current trends and challenges in SR are introduced as well as some future perspectives.
Bio: She received the Ph.D. degree from the Computer Science Department, Tokyo Institute of Technology, Tokyo, Japan, in 1997. From 1988 to 2002, she was with NTT, where she worked on speaker and speech recognition. From 1998 to 2002, she was with the Spoken Language Translation Research Laboratory, ATR, Kyoto, Japan, as a Senior Researcher and worked on speech recognition. From January to June 2001, she was an Invited Researcher in the Acoustic and Speech Research Department, Bell Laboratories, Murray Hill, NJ, working on finding effective confidence measures for verifying speech recognition results. She is currently a Professor in the Institute of Statistical Mathematics, Tokyo, working on statistical modeling for speech and speaker recognition applications. Prof. Matsui received the paper award of the Institute of Electronics, Information, and Communication Engineers of Japan (IEICE) in 1993.
Thursday, November 1 9:20 - 10:20
Keynote speaker #2: Proximity-based Federation of Smart Objects: Its Formal Modelling and Application Framework
Abstract: In the age of smart phones, IC cards, and IoT, we are surrounded by a huge number of smart objects, i.e., intelligent devices with wireless communication capabilities ranging from peer-to-peer to cellphone communications. Some of them are wearable or in-vehicle ones. However, it is often pointed out both by theoreticians and by practitioners that the lack of a formal computation model and an application framework capable of context modeling and complex application scenario description to cover the application diversity of smart objects and their federations is the main reason why most existing applications essentially still remain within the scope of the three stereotyped scenarios, i.e., the location transparent service continuation and the location, situation-aware service provision, and dynamic federation among smart objects through the Internet, i.e., their web-based federation. The first one focuses on the ubiquity of services, while the second focuses on the context-dependent services. This talk reviews the speaker's formal modeling of complex application scenarios using autonomic proximity-based federation among smart objects, and application framework to develop complex application scenarios, and tries to open a new vista of smart objects and their federation.
Bio: Yuzuru Tanaka has been a professor emeritus of Hokkaido University (2013- ), the research supervisor of the JST CREST Program on Big Data Applications (2013-2021), an MI research advisor of Research and Services Division of Materials Data and Integrated System (MaDIS) at National Institute of Materials Science (NIMS) (2017- ), and visiting professors of National Institute of Informatics (NII) (2004- ), Institute of Catalysis at Hokkaido University (2017- ), and Hokkaigakuen University (2017- ). He had been a full professor of computer architecture at the Department of Electrical Engineering (1990-2003), then of knowledge media architecture at the Department of Computer Science (2004-2017), Hokkaido University, and the founding director of Meme Media Laboratory (1995-2013), Hokkaido University. He was also a full professor of Digital Library, Graduate School of Informatics, Kyoto University (1998-2000) in parallel. His research areas covered multiprocessor architectures, database schema-design theory, database machine architectures, full text search of document image files, and automatic cut detection in movies and full video search. His current research areas cover meme media architectures, knowledge federation frameworks, proximity-based federation of smart objects, their application to digital libraries, e-Science, clinical trials, materials informatics, and social cyber-physical systems. He worked as a visiting research fellow at IBM T.J. Watson Research Center (1985-1986), an affiliated scientist of FORTH in Crete (2010- ), and a series editor of Springer's LNAI (lecture Notes in Artificial Intelligence). He has been involved in EU's FP6 Integrated Project ACGT (Advancing Clinico-Genomic Trials on Cancer), FP7 Best Practice Network Project ASSETS (Advanced Search Services and Enhanced Technological Solutions for the European Digital Library), and FP7 Large Integration Project p-medicine (personalized medicine).
Thursday, November 1 10:20 - 10:45
Coffee break
Thursday, November 1 10:45 - 12:00
Computer Vision and Pattern Recognition
Natural Language Processing and Text Mining
Thursday, November 1 12:00 - 1:15
Lunch break
Thursday, November 1 1:15 - 2:40
Computer Vision and Pattern Recognition
- 1:15 Protozoa Identification using 3D Geometric Multiple Color Channel Local Feature
- 1:36 Coding distortion modelling method for local image perception
- 1:57 Novel Skeleton-based Action Recognition Using Covariance Descriptors on Most Informative Joints
- 2:18 An improved QR decomposition for color image watermarking
Thursday, November 1 1:15 - 3:15
Natural Language Processing and Text Mining
- 1:15 Building a Named Entity Annotated Bilingual English-Vietnamese Corpus
- 1:39 Cross-Language Aspect Extraction for Opinion Mining
- 2:03 Regularizing Forward and Backward Decoding to Improve Neural Machine Translation
- 2:27 Integrating Word Embeddings into IBM Word Alignment Models
- 2:51 Towards State-of-the-art Baselines for Vietnamese Multi-document Summarization
Thursday, November 1 3:15 - 3:40
Coffee break
Thursday, November 1 3:40 - 5:30
Knowledge Discovery and Data Mining
- 3:40 Discovering Topic Evolution In Heterogeneous Bibliographic Network
- 4:02 Information gain Aggregation-based Approach for Time Series Shapelets Discovery
- 4:24 W-PathSim++: the novel approach of topic-driven similarity search in large-scaled heterogeneous network with the support of Spark-based DataLog
- 4:46 Anomaly Detection in POSTFIX mail log using Principal Component Analysis
- 5:08 Combining Apache Spark & OrientDb to Find the Influence of a Scientific Paper in a Citation Network
Thursday, November 1 3:40 - 5:20
Knowledge, Data, and Soft Computing
- 3:40 Adaptive Fractional Terminal Sliding Mode Controller for Active Power Filter Using Fuzzy-Neural-Network
- 4:05 An Improved Human Activity Recognition by Using Genetic Algorithm to Optimize Feature Vector
- 4:30 The Internet-of-Things based Fall Detection Using Fusion Feature
- 4:55 Context learning for Bone Shadow Exclusion in CheXNet Accuracy Improvement
Thursday, November 1 6:30 - 9:00
Banquet
At the conference venue
Friday, November 2
Friday, November 2 8:00 - 9:00
Keynote speaker #3: Audio/speech information hiding based on human auditory characteristics
Abstract: Audio information hiding (AIH) has recently been focused on as a state-of-the-art technique enabling copyrights to be protected and defended against attacks and tampering of audio/speech content. This technique has aimed at embedding codes as watermarks to protect copyrights in audio/speech content, which are inaudible to and inseparable by users, and at detecting embedded codes from watermarked signals. It has also aimed at verifying whether it can robustly detect embedded codes from watermarked signals (robust or fragile), whether it can blindly detect embedded codes from watermarked signals (blind or non-blind), whether it can completely restore watermarked signals to the originals by removing embedded codes from them (reversible or irreversible), and whether it can be secure against the publicity of algorithms employed in public or private methods. AIH methods, therefore, must satisfy some of the five following requirements to provide a useful and reliable form of watermarking: (a) inaudibility (inaudible to humans with no sound distortion caused by the embedded data), (b) robustness (not affected when subjected to techniques such as data compression and malicious attacks), (c) blind detectability (high possibility of detecting the embedded data without using the original or reference signal), (d) confidentiality (secure and undetectable concealment of embedded data), and (e) reversibility (removable embedded data from the watermarked signal and/or enable watermarking to be re-edited). In this talk, historical and typical AIH methods are introduced and pointed out drawbacks. Then our proposed methods based on human auditory characteristics (cochlear delay, adaptive phase modulation, singular spectrum analysis with psychoacoustic model, and formant enhancement) are introduced.
Bio: Masashi Unoki received his M.S. and Ph.D. in Information Science from the Japan Advanced Institute of Science and Technology (JAIST) in 1996 and 1999. His main research interests are in auditory motivated signal processing and the modeling of auditory systems. He was a Japan Society for the Promotion of Science (JSPS) research fellow from 1998 to 2001. He was associated with the ATR Human Information Processing Laboratories as a visiting researcher from 1999-2000, and he was a visiting research associate at the Centre for the Neural Basis of Hearing (CNBH) in the Department of Physiology at the University of Cambridge from 2000 to 2001. He has been on the faculty of the School of Information Science at JAIST since 2001 and a full professor. Dr. Unoki received the Sato Prize from the Acoustical Society of Japan (ASJ) in 1999, 2010, and 2013 for Outstanding Papers and Best Paper Award from the Institute of Electronics, Information and Communication Engineers in 2017. Currently, he is an associate editor of Applied Acoustics and an Editor in chief of the ASJ/Acoustical Science and Technology.
Friday, November 2 9:00 - 9:20
Coffee break
Friday, November 2 9:20 - 10:40
Knowledge, Data, and Soft Computing
- 9:20 A combination of geometric buffer technique and ray based interactive methods for multi-objective evolutionary algorithms
- 9:40 Compositions of Partial Fuzzy Relations Employing the Lower Estimation Approach
- 10:00 Towards Thermal Region of Interest for Human Emotion Estimation
- 10:20 ArcViz: An Extended Radial Visualization for Classes Separation of High Dimensional Data
Friday, November 2 9:20 - 10:50
Learning, Prediction, and Recognition
- 9:20 Robust Loss Functions: Defense Mechanisms for Deep Architectures
- 9:42 Predicting user's action on emails: improvement with ham rules and real-world dataset
- 10:05 Genre Classification using Feature Extraction and Deep Learning Techniques
- 10:27 A Deep Learning Study of Aspect Similarity Recognition
Friday, November 2 9:20 - 10:20
Poster Session
- LSTM Easy-first Dependency Parsing with Pre-trained Word Embeddings and Character-level Word Embeddings in Vietnamese
- Key-value based data hiding method for NoSQL database
- A New Formula for Vietnamese Text Readability Assessment
- A Comparative Study on Detection and Estimation of a 3-D Object Model in a Complex Scene
- Finding babies on social media: A case of named entity recognition in Vietnamese documents
- Document Ontology Evolution for Large Datasets
- Combining Deep Feature and Handcrafted Features for Material Classification
- Adaptive Fuzzy Logic Control to Enhance Pitch Angle Controller for Variable-Speed Wind Turbines
- Analysis of MPPT characteristics according to the control period
Friday, November 2 10:50 - 12:20
Machine Learning - Methods and Applications
- 10:50 Feature normalization for similarity calculations in matrix factorization method
- 11:12 Ontology-based Recommender System for the Million Song Dataset Challenge
- 11:35 Building a specific amino acid substitution model for dengue viruses
- 11:57 Comparison of the Most Influential Missing Data Imputation Algorithms for Healthcare
Software Engineering
- 10:50 A Transformation-Based Method for Test Case Automatic Generation from Use Cases
- 11:12 Building Adaptive Software Architectures with Useful and Available Elements for Adaptation
- 11:35 An approach for testing model transformations
- 11:57 On Improvement of Assume-Guarantee Verification Method for Timed Component-Based Software
Friday, November 2 12:20 - 1:30
Lunch break
Friday, November 2 1:30 - 3:20
Special Session #1: Intelligent Software and Knowledge Representation
Nowadays there are many practical applications in Artificial Intelligence and the Internet. The Intelligent Software has been created to better serve the increasing needs of the people. These kinds of software have been applied in a wide range of areas such as Expert Systems, Intelligent Problems Solver, Decision Support Systems and another Intelligent Systems. Knowledge Representation (KR) and Reasoning are exciting and well- established fields of research in Intelligent Software. KR is key drivers of innovation in computer science. In recent years, KR has also derived challenges from new and emerging fields including the semantic web, computational biology, and the development of software agents.
This special session aims to bring together researchers and developers with all these different backgrounds to discuss and exchange new ideas and the latest achievements in building Intelligent Software and methods for KR
- 1:30 A Novel Method for Time Series Anomaly Detection based on Segmentation and Clustering
- 1:52 Assessment of georeferencing methods on MODIS Terra/Aqua and VIIRS NPP satellite images in Vietnam
- 2:14 Criteria of a Knowledge model for an Intelligent Problems Solver in Education
- 2:36 Combining Fuzzy Set - Simple Additive Weight and Comparing With Grey Relational Analysis For Student's Competency Assessment In The Industrial 4.0
- 2:58 Applying Multi-CNNs model for detecting abnormal problem on chest x-ray images
Friday, November 2 1:30 - 3:35
Special Session #3: Deep Learning and Applications to Language Processing
The session focuses on developing deep learning models and applications to real-world problems relevant to the field of language processing. Deep learning has made many breakthroughs in various areas of computer science such as computer vision, speech, and natural language processing (NLP). In the last few years, various models have been proposed for different data representations make deep learning transited successfully from research to applications. In the context of this session, we will explore and discuss both new models and applications of deep learning in natural, programming language, and speech processing.
- 1:30 Variants of Long Short-Term Memory for Sentiment Analysis on Vietnamese Students' Feedback Corpus
- 1:55 A study on integrating distinct classifiers with bidirectional LSTM for Slot Filling task
- 2:20 Combining Advanced Methods in Japanese-Vietnamese Neural Machine Translation
- 2:45 A Joint Model of Term Extraction and Polarity Classification for Aspect-based Sentiment Analysis
- 3:10 Encoding Local Contexts of Sentences with Convolutions on pq-Gram Representations of Dependency Trees
Friday, November 2 3:35 - 3:55
Coffee break
Friday, November 2 3:55 - 5:35
Special Session #3: Deep Learning and Applications to Language Processing
The session focuses on developing deep learning models and applications to real-world problems relevant to the field of language processing. Deep learning has made many breakthroughs in various areas of computer science such as computer vision, speech, and natural language processing (NLP). In the last few years, various models have been proposed for different data representations make deep learning transited successfully from research to applications. In the context of this session, we will explore and discuss both new models and applications of deep learning in natural, programming language, and speech processing.
- 3:55 Shallow and Deep Learning Architecture for Pests Identification on Pomelo Leaf
- 4:15 Enhancing Performance of Lexical Entailment Recognition for Vietnamese based on Exploiting Lexical Structure Features
- 4:35 Vietnamese Diacritics Restoration Using Deep Learning Approach
- 4:55 Deep Learning in Cross-Lingual English-Vietnamese Sentiment Classification
- 5:15 Enhanced Fashion Attribute Learning Framework adapts to Attributes' inner-group Correlations and Imbalanced Data
Special Session #4: Cyber Security Threats and Countermeasures
- 3:55 Modeling The Causes Of Terrorism From Media News: An Innovative Framework Connecting Impactful Events With Terror Incidents
- 4:20 Detect Wi-Fi Network Attacks Using Parallel Genetic Programming
- 4:45 An Efficient Approach for Electronic Voting Scheme without An Authenticated Channel
- 5:10 Comparison of Three Deep Learning-based Approaches for IoT Malware Detection