Proceedings of the Annual Conference of JSAI

Roadmap for Emotionally Enriched Interaction with Large Language Model

Focusing on the mechanism of fear and motivation by the amygdala

Takashi OMORI, Yoshimasa TAWATSUJI, Tatsuya MIYAMOTO, Yuta ASHIHARA, N ...

Session ID: 1Q4-OS-7b-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1Q4OS7b02

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Humans realize smooth communication by estimating and predicting the emotions of others and controlling and uttering their own emotions appropriately. The construction of a ``human-like'' agent that appropriately handles the emotional dynamics of self and others is an important issue in HAI. With the progress of large-scale language models in recent years, it has become possible to generate richly expressive sentences based on statistical learning, instead of the conventional stereotyped sentence generation. In addition, in order to realize human-like emotional interaction, it is desirable to refer to the brain when considering its function realization. From the above, by appropriately adding a brain organ model centered on the amygdala, which appropriately captures the dynamics of one's own and others' emotions, to a large-scale language model, it is possible to realize interactions that increase the level of emotional satisfaction of the interlocutor. Will. In this presentation, based on the functions necessary for interaction based on emotion, we will overview the level of emotion model. Next, we summarize the current state of language models and emotion research. Based on this, we propose a roadmap regarding the potential of emotion models in the amygdala function and how to fuse large-scale language models and emotion models in the future.

View full abstract

Download PDF (453K)
Understanding Language Instructions that Include the Vocabulary of Unobserved Objects by Integrating a Large Language Model and a Spatial Concept Model

Shoichi HASEGAWA, Ryosuke YAMAKI, Akira TANIGUCHI, Yoshinobu HAGIWARA, ...

Session ID: 1Q4-OS-7b-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1Q4OS7b03

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

For a robot to assist people in home environments, it is important to handle the vocabulary of unobserved objects while learning the knowledge of places. It is assumed that there exist objects that the robot did not observe through its sensors during learning. For such a case, the robot is expected to perform household tasks on language instructions that include the vocabulary of these objects. We propose a method that integrates a large language model and a spatial concept model to enable the robot to understand language instructions that include the vocabulary of unobserved objects while learning places. Even if the objects that the user instructed the robot to search for are not included in a training dataset during learning, the number of room visits during object search can be expected to reduce by combining the inference of these models. We validated our method in an experiment in which a robot searched for unobserved objects in a simulated environment. The results showed that our proposed method could reduce the number of room visits during the search compared to the baseline method.

View full abstract

Download PDF (786K)
Proposition of affordance extraction from Foundation model for autonomous agent

Reo KOBAYASHI, Yukie NAGANO, Yuya OSAKI, Daiki TAKAMURA, Sawako TAJIMA ...

Session ID: 1Q4-OS-7b-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1Q4OS7b04

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

The creature can perceive various information intuitively from physical entities, understand their surroundings, and act adaptively. In planning for autonomous agents, utilizing affordances like living organisms to adapt to the environment and achieve goals efficiently is effective. Therefore, the purpose of this study was to extract affordance information from large-scale language models. Large-scale language models have learned knowledge from a vast amount of text written by humans and can output new text using that knowledge. Thus, it is considered that large-scale language models contain common sense and implicit knowledge that humans possess. In this study, we analyzed the output from the large-scale language model GPT-3 and constructed a knowledge network by extracting knowledge from it. The experiments showed that using this knowledge network enables the acquisition of affordances similar to those of humans.

View full abstract

Download PDF (462K)
Leveraging BERT for Text Classification: A Hands-on Guide to Transfer Learning

Using Hugging Face to Fine-tune BERT for High-Performance Text Classification

Yang WANG

Session ID: 1Q5-OS-29-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1Q5OS2901

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Transfer learning is a powerful technique that allows a model trained on one task to be fine-tuned on a different but related task. In this presentation, we will explore how to use transfer learning to perform text classification using the BERT model and it's variaty from HuggingFace. BERT (Bidirectional Encoder Representations from Transformers) is a pre-trained model that has been shown to achieve state-of-the-art results on a wide range of natural language understanding tasks. By fine-tuning BERT on a labeled dataset of text classification, we can quickly and easily train a high-performance model with minimal data and computational resources. We will demonstrate how to fine-tune BERT using the Hugging Face library and provide tips and best practices for getting the most out of this powerful technique. Attendees will leave with a solid understanding of how to use transfer learning for text classification and the knowledge to implement their own text classification models using BERT. We will also show how to rapidly implement this with open source MLflow and Transformers.

View full abstract

Download PDF (235K)
Appliciation of Transfer Learning for cross domain recommendation system

Use cross domian transfer learning for recommendations

Leo MAO

Session ID: 1Q5-OS-29-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1Q5OS2903

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

In the big data era, a good Machine learning model requires massive training data with labels. When there is less data available in a target domain, Cross-domain recommendations are useful to leverage richer data from a source domain to improve performance of the recommendation. Cross-domain recommendation has gained lots of interest in recent years. In this talk, we will talk about the overview of CDR, what are the existing CDR approaches, demonstrate a hands-on application for user profile prediction using CDR, as well as the challenges and future directions.

View full abstract

Download PDF (232K)
Transfer Learning for Image Clustering

Yulan yan YAN

Session ID: 1Q5-OS-29-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1Q5OS2904

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

As the development of deep learning models are getting more and more popular, for industries, building good deep learning models case by case is still very expensive. At the meantime, with more and more models are being published to the public through different platforms, transfer learning which is used for transferring knowledge (i.e., the feature encoding) from the pre- trained model to a new model is becoming a technology for practical usage. This work introduces an approach to use a pre- trained deep learning model to compute features for downstream models. As an application of this approach, it demonstrate how to use transfer learning to improve training performance for image clustering by combining transfer learning for vectorization of image data and k-means for image clustering. To demonstrate the effect, we used the tf_flowers dataset, and ResNet50 for featurization of images.

View full abstract

Download PDF (448K)
101 ELSI of Educational Technology Using Student Educational Data

Kei KANO, Nobutsugu KANZAKI, Atsuo KISHIMOTO, Takayuki GOTO, Hitoshi S ...

Session ID: 1R3-OS-15-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R3OS1501

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

EdTech, which utilizes educational data, is being implemented in society in the United States and other countries. In Japan, a roadmap for the utilization of educational data has been formulated. The EdTech widely ranges from already implemented to emerging. We will explore ELSI based on an analytical framework of three categories: legal normative fundamental principles, legal normative regulation, and cultural backgrounds.

View full abstract

Download PDF (779K)
Classification of Educational Technology from ELSI Issues 101

Takayuki SHIOSE, Masayuki MURAKAMI, Takayuki GOTO, Eri MIZUMACHI, Kei ...

Session ID: 1R3-OS-15-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R3OS1502

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

The GIGA School Project has accelerated the use of ICT in education. While there are great expectations to be gained from the utilization of learning data, there are also great ethical and legal concerns. As experts in ELSI, we have organized ELSI based on the classification of EdTech into non-AI and AI systems. In this manuscript, we categorize EdTech into (1) digital teaching materials, (2) classroom support and learning support, and (3) learning management and school administration support, and organize the mapping between each basic technology and ELSI.

View full abstract

Download PDF (555K)
Issues in the ELSI on EdTech and AI in Education

Nobutsugu KANZAKI, Goro HORIGUCHI, Takayuki GOTO, Atsuo KISHIMOTO

Session ID: 1R3-OS-15-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R3OS1503

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

This paper aims to explore the issues that require consideration in discussions surrounding Ethical, Legal, and Social Implications (ELSI) of digitizing education including the datatification of education, the utilization of educational data, EdTech, and AI in Education (AIED). The discussions will encompass the handling of personal information and the treatment of learners based on the data obtained. However, these issues are not exhaustive. To elaborate, we will identify ELSI issues pertaining to EdTech and AIED from the perspectives of Ethical, Legal, and Social implications, and evaluate the relationships and differences between these three perspectives. Specific issues such as advocacy, the right to free-ride, the right to education, and social acceptance will also be examined. Finally, based on the results of the social acceptance survey, the conditions necessary for the social adoption of EdTech and AIED are identified.

View full abstract

Download PDF (377K)
A Comparative Study on the ELSI Issues of EdTech in the U.S. and Japan

Yuko FUJIMURA, Hitoshi SATO, Satoshi TAKAHASHI

Session ID: 1R3-OS-15-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R3OS1504

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Japan has lagged behind other developed countries in the use of ICT at schools for a while. However, in the past couple of years, the opportunities of the use of ICT in schools are rapidly increasing with the development of educational policies such as educational responses to new coronavirus infections and “personalized learning”. In the midst of these situations, ELSI's response to the further development of ICT utilization and EdTech will be an urgent issue. In the U.S., a leading EdTech country, the utilization of learning data in schools is being promoted. On the other hand, a number of the ELSI cases, including the appropriateness and reliability of the data use or personal data transfer without permission, have been emergent in the country. There were problematic cases of data transfer in public schools to the army recruiters and the polices. Especially, the polices made a list of children who were supposed to be committed the crimes based on academic achievement at schools. To avoid those cases, stricter ELSI protections have been constructed in the federal act and regulations, as well as the voluntary guidelines developed by non-governmental organizations. The reporters will show the expected ethical, legal, and social issues of the EdTech policy in Japan from the perspective and experience of cases in the U.S.

View full abstract

Download PDF (395K)
Preliminary Evaluation of Learning Support System Capable of Automatically Generating Program Trace Exercise

Tomohiro MOGI, Yuichiro TATEIWA, Takahito TOMOTO, Takako AKAKURA

Session ID: 1R4-OS-10a-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R4OS10a01

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Program tracing is an effective method to overcome ambiguous understanding in programming education. To provide explicit program tracing tasks, a learning support system has been developed. Positive results were demonstrated in evaluation experiments using this system. However, the system only handles pre-registered questions and feedback by the creator, making it difficult to be used in a more general context. To address this, the authors developed a system that can automatically generate problems and feedback by analyzing arbitrary programs, enabling learners to think about errors in their own programs and conduct debugging exercises. This paper describes the evaluation experiments using the developed system and its results, which revealed areas requiring improvement in future development.

View full abstract

Download PDF (484K)
Implementation of Quality Indicator based on Strategy in Code Sharing Platform and its Evaluation

Shintaro MAEDA, Kento KOIKE, Takahito TOMOTO

Session ID: 1R4-OS-10a-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R4OS10a02

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Refinement activities that bring code closer to better are important in programming learning.We have developed a code-sharing platform with a mechanism for sharing only code that is close in level. The authors believe that sharing codes that are close to the learners' strategy is beneficial in refinement activities, and have proposed a method to evaluate the strategy of codes and a method to calculate the closeness of strategy as similarity.In this paper, we implemented a filtering method that displays only codes that exceed a certain level of similarity to the ranking, and conducted an experiment to compare the ranking of the proposed method with that of the conventional method, targeting expert programmers. The results suggest that learning by using the ranking of the proposed method encourages code refinement activities.

View full abstract

Download PDF (1834K)
Utilizing Learners' Behavioral History to Enhance Programming Exercise Support

Satoru KOGURE, Tomoki IKEGAME, Yasuhiro NOGUCHI, Koichi YAMASHITA, Rai ...

Session ID: 1R4-OS-10a-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R4OS10a03

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

We are developing a system that automatically collects the history of students' program editing, compilation, and execution history during programming exercises, recognizes the history and its temporal changes, and detects impasses in the exercises. Using the results, we have also developed a system to support individual advice for students who are stuck in a programming exercise. In this study, we use machine learning to detect whether a student is still stuck or not after the next 10 minutes from 20 minutes of automatically collected exercise activity history. We employed several machine learning methods to identify the impasse using the training data from the exercise history collected in previous exercises. Although the dataset is relatively small, an F value of 0.95 was obtained as the impasse detection rate by learning with a decision tree.

View full abstract

Download PDF (342K)
Evaluation of the Code Block Programming and the Data Utilization Module Compliant with the PCI Specification

Yoshimitsu MIYAZAWA

Session ID: 1R4-OS-10a-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R4OS10a04

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

In 2021, the national center for university entrance examinations developed the code block programming and the data utilization modules compliant with the portable custom interaction specification formulated by the 1EdTech consortium. These modules can be used to create and present programming and data usage questions in computer-based testing. We demonstrate the effectiveness of the modules through experiments.

View full abstract

Download PDF (618K)
Proposal of a problem space exploration model based on knowledge structures and constraint consideration update formulas for arithmetic word problems

Sho YAMAMOTO, Yoshimasa TAWATSUJI, Tsukasa HIRASHIMA

Session ID: 1R4-OS-10a-05
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R4OS10a05

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

The authors have proposed a knowledge structure for arithmetic sentences and have attempted to estimate the learner's comprehension state based on this structure. The construction of a learner model that represents the learner's comprehension state is an important element in the study of Intelligent Tutoring System. machine learning methods such as Knowledge Tracing describe the learner's state transitions using probabilistic models, while the problem exploration process is not explicitly not explicitly described. In contrast, this study defines problem making as a search of the problem space based on knowledge structures and expresses these transition rules by the degree to which they take into account the constraints of the knowledge structures. This is a new understanding state estimation model that combines a semantic description of search with a quantitative update formula for the degree of consideration. This paper reports the implemented search model and its simulated results based on real logs.

View full abstract

Download PDF (507K)
Prediction of Responses of Learners based on Large Language Models

Yo EHARA

Session ID: 1R5-OS-10b-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R5OS10b01

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

This study addresses interpretable methods for the learner response prediction task that uses the training data of the question text for each question and its responses to predict the response to new question-learner pairs.

View full abstract

Download PDF (589K)
Item Difficulty Constrained Uniform Adaptive Testing for Reducing Bias of Item Exposure

Wakaba KISHIDA, Kazuma FUCHIMOTO, Yoshimitsu MIYAZAWA, Maomi UENO

Session ID: 1R5-OS-10b-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R5OS10b02

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Computerized adaptive testing tends to select and present items frequently with high discrimination parameters. This tendency leads to bias of item exposure. To address this shortcoming, we propose difficulty constrained uniform adaptive testing. During the first stage, the method selects and presents the optimal item from a uniform item group generated by a state-of-the-art uniform test assembly technique. In the second stage, the method selects and presents the optimal item with a difficulty parameter value within the neighborhood of the examinee's ability estimate from the whole item pool. Numerical experiments results underscore the effectiveness of the proposed method.

View full abstract

Download PDF (702K)
Development of an Automatic Generation System for Auxiliary Problems Based on Causality of Force for Mechanics

Nonoka AIKAWA, Shintaro MAEDA, Kento KOIKE, Takahito TOMOTO, Tomoya HO ...

Session ID: 1R5-OS-10b-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R5OS10b03

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

In learning, learners sometimes make mistakes on the same problem repeatedly and get stuck. Helping stalled learners with auxiliary problems can be effective. Auxiliary problems are problems that help the learner understand the original problem. The learner who is presented with an auxiliary problem can also notice errors in the original problem while solving the problem. The authors have been working on the automatic generation of auxiliary problems for mechanics. Specifically, we have studied ``how to generate problems with consistent deletion'' based on the causal inference theory of force and motion by Mizoguchi et al. and created rules for the automatic generation of auxiliary problems. In this paper, we implement the rules in a system and develop a system that can generate auxiliary problems automatically.

View full abstract

Download PDF (1104K)
Difficulty-Controllable Question Generation of Reading Comprehension using Deep Neural Networks and Item Response Theory

Suzuki AYAKA, Uto MASAKI

Session ID: 1R5-OS-10b-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R5OS10b04

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Question generation (QG) for reading comprehension, a technology for automatically generating questions related to given reading passages, has been used for educational purposes. Recently, QG methods based on deep neural networks have succeeded in generating fluent questions that are pertinent to given reading passages. However, conventional methods focus only on generating questions and cannot generate answers to them. Furthermore, they ignore the relation between question difficulty and learner proficiency, making it hard to determine an appropriate difficulty for each learner. To resolve these problems, we propose a new method for generating question–answer pairs that considers their difficulty, estimated using item response theory. The proposed difficulty controllable generation is realized by extending two pre-trained transformer models, namely, BERT and GPT-2. Experimental results show that our method can generate fluent question-answer pairs with arbitrary difficult.

View full abstract

Download PDF (441K)
Development of estimation method for learner's emotional concealment during learning using biometric information

Koichi SHINOHARA, Tatsunori MATSUI, Keiichi MURAMATSU

Session ID: 1R5-OS-10b-05
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1R5OS10b05

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

It is important to understand the mental state of the learner in the teaching/learning process, but sometimes the learner intentionally does not express negative emotions. In this study, intentionally not expressing such negative emotions is called emotional concealment, and we examined the possibility of estimating it from the learner's biometric information. In particular, we focused on the feature points of the learner's face, and attempted to clarify the relationship between microexpressions and emotion concealment. Using the results, we quantified emotion concealment from time-series data of biometric information, and tried to detect it as an abnormal value. In addition, we examined the development of feedback generation that tells the teacher that the teacher is hiding emotions using the results.

View full abstract

Download PDF (905K)
Query-focused sentence compression based on grammatical and semantic constraints

Ritsuki HAYASHI, Yoshihide KATO, Shigeki MATSUBARA

Session ID: 1T3-GS-6-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T3GS601

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Sentence compression focusing on user queries is useful for presenting the results of Web search. Previous methods construct a sentence compression model using training data consisting of source sentences, queries, maximum lengths of compressed sentences and compressed sentences, but it is costly to create such data, and in practice they only use pseudo-created training data. In this paper, we propose a sentence compression method that does not require such training data. The proposed method generates candidate compressed sentences based on dependency structures, and selects compressed sentences that satisfy grammatical and semantic constraints. Since the candidate compressed sentences are selected under the constraint that they contain queries, query-focused sentence compression can be achieved.

View full abstract

Download PDF (308K)
Stylistic Features that Create Characterization for Chatbots with Character-Based Speech

Tomowa HODOSAWA, Hajime MURAI

Session ID: 1T3-GS-6-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T3GS602

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

In recent years, chatbots have come to be used in a variety of settings. However, current chatbots generally respond in a uniform manner, and are not able to respond in a diverse manner like humans. One method to solve this problem is to add characterization to chatbots' speech. Although there have been attempts to add characterization to chatbot speech, they do not necessarily reflect the characteristics of actual character speech, and it has not been fully clarified which elements of speech contribute to the development of characterization. Therefore, in this study, we extracted stylistic features of character utterances of "Uma Musume Pretty Derby," which is a popular sales title in the character business. In order to extract features, we compared the speech of the characters with that of ordinary women, targeting frequently occurring words, parts of speech, topics, and sentence-final expressions, in order to extract speech that is unique to the characters. As a result, it became clear that stylistic features corresponding to the setting of each character were found in personal expressions, emotional verbs, and sentence-final expressions with particles and auxiliary verbs.

View full abstract

Download PDF (388K)
L1 Vocabulary 3D Map (L1 Map): A Visual Map of L1 Word Relations in English through Word Embedding

Development and Verification using Elementary School-Level English Words for Learners in Japan

Jun KANEKO, Takashi OTSUKI, Takayuki SAKAGUCHI, Jesse SOKOLOVSKY

Session ID: 1T3-GS-6-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T3GS603

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Achieving native-speaker proficiency is a formidable task for language learners. What can be done to draw closer to a native-speaker’s own sense of language? The present project attempted to provide one answer to that question through the creation of an L1 vocabulary 3D map, referred to below as L1 Map, which shows the relationships between words in visual form. A set of 185 words was drawn from the elementary school-level English textbook Let’s Try! 1. Each word in the set was associated with added-word vectors with 300 dimensions from the pretrained model for English, fastText, a library for the learning of word embeddings. To make L1 Map useable as a visual tool for learning, the 300 dimensions in these word vectors were reduced to three dimensions using UMAP (Uniform Manifold Approximation and Projection for Dimension Reduction). K-means clustering analysis was performed on the data with three dimensions. The results indicate that L1 Map may be a useful tool for learners of English. Additionally, L1 Map turns fastText’s inability to distinguish between multiple meanings into a strength, offering visualizations of word origins.

View full abstract

Download PDF (884K)
Emotion-Response Prediction model in multi-turn conversations

Yuka OZEKI, Shuhei TATEISHI, Yasuhito OHSUGI, Yoshihisa KANOU, Makoto ...

Session ID: 1T3-GS-6-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T3GS604

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Emotions are important in analyzing human conversations. The current response selection methods, however, do not handle emotions in response selection. This paper proposes a model that simultaneously learns emotions expressed with speakers' multi-turn utterances and their responses using predicted emotion sequence. Evaluation using the MELD dataset showed that our model achieved higher accuracies in predicting emotions and responses compared with the methods that learn response selections without emotion analysis do.

View full abstract

Download PDF (567K)
Prompt Optimization for Training Generalizable Language Models

Masaru ISONUMA, Junichiro MORI, Ichiro SAKATA

Session ID: 1T3-GS-6-05
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T3GS605

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Recently, instruction tuning has been attracting significant attention as a method for training generalizable language models (e.g., ChatGPT). Although various prompts have been manually created for instruction tuning, it has not been clarified what kind of prompts are optimal for obtaining cross-task generalization ability. This study presents \emph{instruction optimization}, which optimizes training prompts by leveraging bilevel optimization, and we clarify what kind of prompts are optimal for instruction tuning. Experimental results demonstrate that instruction optimization enhances the diversity of prompts and improves the generalization performance in a zero-shot setting, whereas using the same examples rather than a variety of exemplars is more effective in a few-shot setting.

View full abstract

Download PDF (513K)
Analysis of web service usage patterns and habituation

Makoto TAKEUCHI, Yukie SANO

Session ID: 1T4-GS-4-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T4GS401

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

It is known that individual behavior patterns are heterogeneous temporally with regard to various types of human behavior. The distributions of Inter-Event Times (IET) of such event time series show a long-tail distribution like a power-law distribution. In particular, it has been reported that there are individual differences in the shape of the IET distributions of physical action event time series (e.g., arm movements) and smartphone touch event time series, but what the differences in the IET distributions mean has not been clarified. We focused on the click event time series of Web services and confirmed that the statistical pattern including the IET distribution for each user changes over time. The results of our analysis suggest that this change in the statistical pattern can be interpreted as habituation to the operation of the Web service. Our results are important in that they provide insight into the origin of temporal heterogeneity common to various human behaviors. Furthermore, our results could be utilized for the improvement of the user experience of web services.

View full abstract

Download PDF (433K)
Classification of Agitation Tweets for the Purpose of Detecting "Agitation" Tweets in SNS

Masaki TOMITA, Hajime MURAI

Session ID: 1T4-GS-4-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T4GS402

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

In recent years, trouble on social networking services has become an issue. One of the causes is "agitation. However, "agitation" in SNS has not been defined in detail, and accurate detection has not been achieved. In this study, we classify agitation expressions in order to achieve accurate detection of "agitation" in SNS. As target data, we collected tweets that were judged by the analyst to be agitation using the Twitter API, extracted and annotated the means, intentions, and topics considered to be constituent elements. The χ-square test and factor analysis were conducted using the statistical data of the components included in each tweet. In the factor analysis, factors such as promotion, judgment, slander, mounting, and inducement were obtained. The obtained results show the characteristics of the components of "agitation" and their relationships in SNS, and are considered to be useful for classifying agitated tweets.

View full abstract

Download PDF (394K)
Improvement of Transformer-based Time Series Forecasting Model using Attention in Frequency Domain

Sohei KODAMA, Takuya MATSUZAKI

Session ID: 1T5-GS-2-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T5GS201

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

The purpose of this research is to enable long-term forecasting of time series data with multiple seasonal variations with a low amount of calculation and high accuracy. We use FEDformer (Zhou et al., 2022) as a baseline model. Since FEDformer performs attention in the frequency domain, it is possible to capture the periodicity even when there are multiple seasonal variations. In addition, it is designed to lighten the calculation by sampling frequency components when performing matrix calculation in the frequency domain. However, because Zhou et al. neglected an important condition in this sampling, the reduction of computational cost is small. We demonstate that, by sampling the frequency component based on the amplitude, it is possible to maintain accuracy with a small number of samples. As a result, in the long-term forecasting of time series data with multiple seasonal variations, we achieved higher accuracy than other models with less computational cost.

View full abstract

Download PDF (331K)
The use of quantum machine learning and tensor network for Japanese stock return predictions

Nozomu KOBAYASHI, Yoshiyuki SUIMON, Kouichi MIYAMOTO, Kosuke MITARAI

Session ID: 1T5-GS-2-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T5GS202

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

With the development of quantum technology, the application of quantum computer to machine learning has gained attention. Tensor network, one of quantum-inspired algorithms has also been applied to machine learning and successfully solves various kinds of tasks. In this work, with the aim of evaluating their applicability for real world problems, we employ quantum neural network, one of quantum machine learning algorithms, and matrix product state, a well studied tensor network to predict Japanese stock returns. Based on their predictions, we further conduct the investment simulation and measure the performance, comparing with benchmarks, linear regression and classical neural network models. Our experimental result shows the matrix product state model outperforms other models. On the other hand, while performances of quantum and classical neural network models vary depending on the market conditions, the quantum model has the better performance than the classical one in the latest market environment.

View full abstract

Download PDF (628K)
Performance Evaluation of Anomaly Communication Detection using BERT for Feature Extraction of Packet Payload

Tomokatsu TAKAHASHI, Yuuki YAMANAKA, Takuya MINAMI, Yoshiaki NAKAJIMA

Session ID: 1T5-GS-2-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T5GS203

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Anomaly Communication Detection is important to ensure the safety of industrial control systems (ICS). However, it is difficult to create detection rules for all the various communication protocols used in an ICS, including proprietary ones. Therefore, anomaly communication detection using Bidirectional Encoder Representations for Transformers (BERT) for feature extraction of packet payload has attracted attention, since it learns the characteristics of packet payloads without prior knowledge and can handle a wide range of protocols. In this paper, we conduct experiments to investigate the features and usefulness of this method. Specifically, we (1) measure the detection performance of random rewriting of payloads of typical protocols and (2) confirm the performance improvement by applying an overdetection correction technique. Through these experiments, we demonstrate the performance of anomaly communication detection using BERT for feature extraction of packet payloads and consider its effectiveness.

View full abstract

Download PDF (241K)
Semi-supervised Continual Learning with Pseudo-labels Based on the Output of a Deep Neural Network

Hirono KAWASHIMA, Jin NAKAZAWA

Session ID: 1T5-GS-2-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1T5GS204

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

We address semi-supervised continual learning for learning with labeled data, which is assumed to be abundant in existing continual learning research, in situations where it is not sufficiently available in the real world. We propose a semi-supervised continual learning method that uses soft outputs of a deep neural network as pseudo-labels for an image classification task, and define a semi-supervised continual learning scenario. In the experiments, the proposed and compared methods are used in several continual learning scenarios and evaluated based on the step-by-step accuracy and final accuracy as the new class increases, as well as the average accuracy over all steps.

View full abstract

Download PDF (1044K)
Semi-Autoregressive Transformer for Sign Language Production

Ehssan WAHBI, Masayasu ATSUMI

Session ID: 1U3-IS-2a-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U3IS2a01

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Sign language production (SLP) aims to generate sign language frame sequences from the corresponding spoken language text sentences. Existing approaches to SLP either rely on autoregressive models that generate the target sign frames sequentially, suffering from error accumulation and high inference latency, or non-autoregressive models that attempt to accelerate the process by producing all frames parallelly, which results in the loss of generation quality. To optimize the trade-off between speed and quality, we propose a semi-autoregressive model for sign language production (named SATSLP), which maintains the autoregressive property on a global scale but generates sign pose frames parallelly on a local scale, thus combining the best of both methods. Furthermore, we reproduced the back-translation transformer model, in which a spatial-temporal graphical skeletal structure is encoded to translate to text for evaluation. Results on the PHOENIX14T dataset show that SATSLP outperformed the baseline autoregressive model in terms of speed and quality.

View full abstract

Download PDF (470K)
Improving the efficiency of training physics-informed neural networks using active learning

Yuri AIKAWA, Naonori UEDA, Toshiyuki TANAKA

Session ID: 1U3-IS-2a-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U3IS2a02

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

PINN is a PDE solver realized as a neural network by incorporating the PDEs to be satisfied into the network as physical constraints. In this study, focusing on how to select the collocation points, we propose an active learning method to improve the efficiency of PINN learning. The proposed method uses variational inference based on dropout learning to evaluate the uncertainty of the solution estimate by PINN and defines an acquisition function for active learning based on the uncertainty. Then, by probabilistically sampling collocation points using the acquisition function, a reasonable solution can be obtained faster than random sampling. We demonstrate the effectiveness of the method using Burgers’ equation and the convection equation. We also show experimentally that the choice of the collocation points can affect the loss function, the fitting of initial and boundary conditions, and the sensible balance of PDE constraints.

View full abstract

Download PDF (3085K)
One-class Damage Detector Using Fully-Convolutional Data Description for Prognostics

Takato YASUNO, Masahiro OKANO, Riku OGATA, Junichiro FUJII

Session ID: 1U3-IS-2a-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U3IS2a03

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

It is important for infrastructure managers to maintain a high standard to ensure user satisfaction during a lifecycle of infrastructures. Surveillance cameras and visual inspections have enabled progress toward automating the detection of anomalous features and assessing the occurrence of the deterioration. Frequently, collecting damage data constraints time consuming and repeated inspections. One-class damage detection approach has a merit that only the normal images enables us to optimize the parameters. Simultaneously, the visual explanation using the heat map enable us to understand the localized anomalous feature. We propose a civil-purpose application to automate one-class damage detection using the fully-convolutional data description (FCDD). We also visualize the explanation of the damage feature using the up-sampling-based activation map with the Gaussian up-sampling from the receptive field of the fully convolutional network (FCN). We demonstrate it in experimental studies: concrete damage and steel corrosion and mention its usefulness and future works.

View full abstract

Download PDF (1314K)
Gated Variable Selection Neural Network for Multimodal Sleep Quality Assessment

Yue CHEN, Takashi MORITA, Tsukasa KIMURA, Takafumi KATO, Masayuki NUMA ...

Session ID: 1U3-IS-2a-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U3IS2a04

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Sleep quality can be affected by several factors, such as sleep environment, lifestyles and so on. Existing sleep quality evaluation methods did not consider the impact of these factors. This research proposed a novel deep learning architecture with multiple-factors for sound-based sleep quality assessment. Utilizing sleep sound for sleep quality evaluation is low-cost and contactless, also, sound data can reflect several physical behaviors such as snore, cough and body movements, which are important when human experts manually evaluate sleep quality. This research utilized VAE-LSTM to learn sleep patterns in sleep sound and applied Gated Variable Selection Network (GVSN) to select useful information in factors. We recorded whole night sleep sounds of more than 100 subjects by microphone at home and collected questionnaires for experiment. The results show that the proposed method can perform accurate sleep quality prediction as well as factor importance analysis.

View full abstract

Download PDF (746K)
Performance Evaluation of SORT Algorithms in Tracking of Fish in a School

Alin KHALIDUZZAMAN, Takato SHIBAYAMA, Hitoshi HABE, Takayuki NIIZATO, ...

Session ID: 1U3-IS-2a-05
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U3IS2a05

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Real-time detection and tracking of fish in a school might have multifold functions to contribute to collective behavior analysis and precision fish farming practices like individual monitoring for growth, anomaly detection, population counting, feed management, and guided and directional control for various measurements. Although the method called SORT (simple online and real-time tracking) and its extensions (e.g., SORT, OC-SORT) are widely used for tracking humans (e.g., pedestrians), such algorithms might have a great challenge to track objects with a similar appearance (e.g., animals). Among them, tracking fish in a school is much more difficult because of their similarity in size, shape, and appearance. Therefore, this research aims to compare the performance of multiple object tracking (MOT) methods, specifically SORT and its latest extension, OC-SORT, to find suitable algorithms for further applied research on various physical and behavioral measurements of fish for individual, collective behavioral analysis, and precision fish farming practices in the future.

View full abstract

Download PDF (308K)
The impact of sentiment scores extracted from product descriptions on customer purchase intention

Yi SUN, Yukio OHSAWA

Session ID: 1U4-IS-1a-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U4IS1a01

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

A key challenge for e-commerce platforms is how to build trust between buyers and sellers and help buyers make better purchasing decisions. In this regard, researchers are interested in addressing the information asymmetry between buyers and sellers. In this study, we focus on featured products that are often sold at a higher price than the original price and examine whether signals hidden in the seller's presentation of such products can mitigate this information asymmetry. To do so, we compute a sentiment score for each product presentation text based on word frequency through text analysis. Finally, we drop this sentiment score into a logistic regression model to see if these variables can significantly influence buyers' purchase intentions as signals. In conclusion, we find that the calculated sentiment scores can have a significant impact on customers' purchase intentions and can be regarded as a new signal to reduce information asymmetry between buyers and sellers.

View full abstract

Download PDF (554K)
Knowledge-aware attentional neural network for explainable recommendation

Yun LIU, Jun MIYAZAKI, Ryutaro ICHISE

Session ID: 1U4-IS-1a-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U4IS1a02

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

We propose a knowledge-aware attentional neural network (KANN) for dealing with recommendation tasks by extracting knowledge entities from user reviews and capturing understandable interactions between users and items at the knowledge level. The proposed KANN can not only capture the inner attention among user (item) reviews but also compute the outer attention values between users and items before generating corresponding latent vector representations. These characteristics enable the explicit preferences of users for items to be learned and understood. Furthermore, our results and analyses highlight the relatively high effectiveness and reliability of KANN for explainable recommendation. Our code is publicly released at https://github.com/liuyuncoder/KANN.

View full abstract

Download PDF (432K)
FinKG-JP: A Japanese Financial Knowledge Graph

Natthawut KERTKEIDKACHORN, Rungsiman NARARATWONG, Ziwei XU, Ryutaro IC ...

Session ID: 1U4-IS-1a-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U4IS1a03

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Financial data contains a vast amount of information relating to financial terms and business entities. Without properly organizing the data, it is difficult for artificial intelligence (AI) systems to utilize financial knowledge effectively and efficiently. In this paper, we, therefore, present our ongoing work on constructing a Japanese Financial Knowledge Graph to aid the financial domain AI applications.

View full abstract

Download PDF (588K)
Incorporating Domain-Specific Traits into Personality-Aware Recommendations for Financial Applications

Takehiro TAKAYANAGI, Kiyoshi IZUMI

Session ID: 1U4-IS-1a-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U4IS1a04

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

The use of general personality traits, specifically the Big-Five personality traits, in recommendation systems has been widely explored and adopted in various fields such as music, film, and literature. However, research on personality-aware recommendations in specific domains, such as finance and education, where domain-specific psychological traits such as risk tolerance and behavioral biases play a crucial role in explaining user behavior, remains limited. To bridge this gap, this study investigates the effectiveness of personality-aware recommendations in financial stock recommendation tasks. Firstly, the paper demonstrates the utility of general personality traits in financial stock recommendations. Secondly, this paper shows that incorporating domain-specific psychological traits along with general personality traits enhances the performance of the recommendation system. Thirdly, we propose a personalized stock recommendation model that incorporates both general personality traits and domain-specific psychological traits as well as interaction data, resulting in superior performance compared to baseline models.

View full abstract

Download PDF (460K)
Customer Clustering and Customer-Specific Product Recommendation by Distance from Core Customer Group in a Multidimensional Space of Purchase Behavioral Features

Kazuhiro KOIKE, Daishi SAGAWA, Kenji TANAKA

Session ID: 1U4-IS-1a-05
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U4IS1a05

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Recommending attractive products is generally implemented in internet shopping to increase customers’ purchasing desire. When conducting such sales promotion measures, rather than treating all customers equally, a higher degree of effectiveness can be expected when grouping customers by industry sector and their purchasing behavior features and then implementing sales promotions tailored to each group. Furthermore, customizing the promotion for each customer will improve the effectiveness of the measures even further. The method proposed in this study first extracts a core group of customers for each industry sector and clusters other customers according to their distance from the core group in a multidimensional space of dozens of purchase behavior feature variables such as purchase amount, frequency, and the number of categories. Collaborative filtering is then performed within each customer cluster to recommend products suitable for each customer. We confirmed that the proposed method improves LTV by recommending specialized products to customers in clusters with low LTV.

View full abstract

Download PDF (354K)
Comparing Feature Extraction Methods for Sarcasm Detection in Twitter

Jenq-Haur WANG, Rahmat Fadli ISNANTO

Session ID: 1U5-IS-2b-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U5IS2b01

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Sarcasm detection is a challenging task, which identifies expressions that have the opposite meaning of what is written. Most previous works only measure sentiment polarity in sentences. However, more features are needed for improving the result. In this paper, we intend to compare different feature extraction methods including n-gram, sentiment, punctuation, and part of speech features for sarcasm detection. Firstly, sarcastic data are collected using Twitter API, and preprocessed by removing all the hashtags, mentions and URLs. Then, after all features were extracted, they are combined by One Hot Encoding. Finally, we use two classification methods: Support Vector Machine and Logistic Regression for comparison. In our experimental results, n-gram feature gives the best performance compared to the other individual features. Support Vector Machine gives a better performance than logistic regression with an F1-measure of 79.64%. This shows the potential of combining different features for sarcasm detection.

View full abstract

Download PDF (272K)
Multilevel Sentence Embeddings for Personality Prediction

Paolo TIROTTA, Akira YUASA, Masashi MORITA

Session ID: 1U5-IS-2b-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U5IS2b02

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Representing text into a multidimensional space can be done with sentence embedding models such as Sentence-BERT (SBERT). However, training these models when the data has a complex multilevel structure requires individually trained class-specific models, which increases time and computing costs. We propose a two step approach which enables us to map sentences according to their hierarchical memberships and polarity. At first we teach the upper level sentence space through an AdaCos loss function and then finetune with a novel loss function mainly based on the cosine similarity of intra-level pairs. We apply this method to three different datasets: two weakly supervised Big Five personality dataset obtained from English and Japanese Twitter data and the benchmark MNLI dataset. We show that our single model approach performs better than multiple class-specific classification models.

View full abstract

Download PDF (552K)
Enhancing Financial Question Answering with Structured Knowledge

Rungsiman NARARATWONG, Natthawut KERTKEIDKACHORN, Ziwei XU, Ryutaro IC ...

Session ID: 1U5-IS-2b-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U5IS2b03

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Answering questions involving financial documents using a language model requires the ability to recognize tabular and textual data, as well as numerical reasoning. This article explains the challenges, recent progress, and our approach to tackling this problem by incorporating external structured knowledge. We also introduce our financial knowledge graph (KG) linking companies to people, industries, and facts extracted from public financial filings. The KG is part of our work to advance machine-learning models for more complex financial questions beyond the scope of the previous models and datasets.

View full abstract

Download PDF (448K)
Improving Financial Terminologies Recognition regarding Morphological Inflection

Ziwei XU, Rungsiman NARARATWONG, Natthawut KERTKEIDKACHORN, Ryutaro IC ...

Session ID: 1U5-IS-2b-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U5IS2b04

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Recognizing financial terminologies from text is essential for key information retrieval and content understanding. In general, financial terminologies do not appear in single-token form but are composed of several tokens. Also, in terminologies, a proper name might have diverse expressions, like abbreviations and morphological inflection, which sacrifice the recognition performance on recall. In this paper, along with transformer-based language models, i.e. XLM-Roberta, we propose a mechanism to train the neural classifier to distinguish terminologies from plain text, by learning from the sequential tags of targeting tokens. Initially, the targeting tokens are from a list of terminologies. To involve the diverse expressions, we inventively generate different morphologies of terminologies and utilize them to extend the targeting tokens. The experiments' results prove that this mechanism shows a convincing improvement in identifying financial terms from plain text.

View full abstract

Download PDF (286K)
Predicting CTR of Responsive Search Ads Using Handcrafted Features

Melvin Charles Ortua DY

Session ID: 1U5-IS-2b-05
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_1U5IS2b05

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

In this paper, I demonstrate that a reasonably sized set of handcrafted features (866, applied to titles and description texts separately) plus encoded metadata can be used to predict the click-through rates of the dynamic Responsive Search Ad format, exceeding the performance of some fine-tuned Transformer-based large language models at a fraction of the training cost.

View full abstract

Download PDF (263K)
Efficient generation of magic square using multi-stage evolutionary strategy cascaded by satisfiable individuals for magic sum

Kazuki TAKEMI, Takuto SAKUMA, Shohei KATO

Session ID: 2A1-GS-2-01
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_2A1GS201

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

The magic square has been pursued as a theme of recreational mathematics since ancient times. Most of existing methods are rule-based methods, so only a few special magic squares can be generated. This paper proposes a multi-stage evolutionary strategy that hierarchically classifies the constraints of magic squares into three stages of rows, columns, and diagonals, and uses the individuals satisfying each layer's constraints as the initial individuals for the next layer. A magic square is a state in which the sum of each element in a square matrix in rows, columns, and diagonals is equal to a constant value. A square matrix that satisfies only the constraints in rows and columns is called a semi-magic square. In this research, the semi-magic square was divided into two layers, and by adding a lower layer that generates individuals satisfying only the row constraints, the need for exploration in the row direction was eliminated, resulting in a significant reduction in the number of searches. Experiments compared the execution time and generation number between the proposed method and previous study, and achieved a significant reduction of the execution time.

View full abstract

Download PDF (512K)
Acquisition of Characteristic Tag Tree Patterns with Wildcards using Edit Distance and Evolutionary Learning

Mutsuki TAN, Tetsuhiro MIYAHARA, Yusuke SUZUKI, Tetsuji KUBOYAMA, Tomo ...

Session ID: 2A1-GS-2-02
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_2A1GS202

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Machine learning of characteristic structures from tree structured data is studied intensively. We propose a method for acquiring characteristic tag tree patterns with wildcards from positive and negative tree structured data, by using tree edit distance and evolutionary learning. We report preliminary experimental results on our evolutionary learning method.

View full abstract

Download PDF (431K)
Self-Examination Mechanism: Lightweight Defense Mechanism against Adversarial Examples using Explainable AI

Sora SUEGAMI, Yutaro OGURI, Zaiying ZHAO, Yu KAGAYA, Koki MUKAI, Shun ...

Session ID: 2A1-GS-2-03
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_2A1GS203

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Deep learning-based image classification models are vulnerable to adversarial examples (AEs). Existing defense methods have improved the classification accuracy for AEs, but the classification accuracy for clean images without perturbations decreases. To solve this problem, we propose a new defense mechanism called self-examination mechanism. In the proposed method, the input image is first classified. Then, the inference process of the classification model is verified using SHapley Additive exPlanations (SHAP), a method of explainable AI. If the input image is abnormal, the classification is performed again based on the output of SHAP. Thus, misclassification of AEs can be prevented without significantly reducing the classification accuracy of clean images. Evaluations on ResNet and WideResNet trained with CIFAR10 demonstrate that our method improves the accuracy for AEs and hardly reduces the accuracy for clean images.

View full abstract

Download PDF (495K)
Action evaluation of multiple soccer players in continuous state space based on deep reinforcement learning

Hiroshi NAKAHARA, Kazushi TSUTUSI, Kazuya TAKEDA, Keisuke FUJII

Session ID: 2A1-GS-2-04
Published: 2023
Released on J-STAGE: July 10, 2023

DOIhttps://doi.org/10.11517/pjsai.JSAI2023.0_2A1GS204

CONFERENCE PROCEEDINGS FREE ACCESS

Show abstractHide abstract

Advances in measurement technology have made it possible to acquire various data during a match, and advanced data analysis is being used to plan team tactics, and evaluate and coach players. Analysis of invasive sports such as soccer is difficult because the game situation is continuous in time and space, and multiple agents individually recognize the game situation and make decisions. In the previous study using deep reinforcement learning, which is one of the representative agent modelings, they have often considered the team as one agent and evaluated the players and teams who hold the ball in each discrete event. Therefore it was difficult to evaluate the behavior of multiple players, including players far from the ball, in a spatio-temporally continuous state space. In this study, based on a deep reinforcement learning model with a discrete action space in a continuous state space that mimics Google Research Football (a reinforcement learning platform for soccer), the actions in actual games are evaluated by estimating the action-value function of multiple players. In the experiment, the calculated player evaluation index was verified using the data of one season of a team in the J-League.

View full abstract

Download PDF (573K)

Register with J-STAGE for free!