Investigation of efficient prompt engineering for video retrieval
-
- Suzuki Kazuya
- Faculty of Informatics, Meisei University
-
- Chiba Akihiro
- Faculty of Informatics, Meisei University
-
- Ueki Kazuya
- Faculty of Informatics, Meisei University
Bibliographic Information
- Other Title
-
- 映像検索における効率的なプロンプトエンジニアリングの検討
Abstract
In recent years, Vision and Language technologies have been developed. In addition, the field of video retrieval has been developing along with it. In this study, we examined the accuracy of using Vision and Language techniques for video retrieval by changing the input query sentences. We conduct two types of validation: validation 1 “Adding a phrase at the beginning of a query sentence” and validation 2 “Adding an important word in a query sentence at the end or the beginning of a sentence”. As a result, validation 2 “Adding important words in the query sentence to the end or the beginning of the sentence,” was effective. “Adding the important words at the end of the sentence” improved the accuracy in many patterns. The result of validation 1 “Appending a phrase to the beginning of a query sentence” varied depending on the phrase. The phrases that improved accuracy were particularly effective for CLIP and SLIP. When two query sentences and a prompt-engineered sentence were used in the search, accuracy was improved in more query sentences, although the number of cases where accuracy was greatly improved was reduced.
Journal
-
- Reports of the Technical Conference of the Institute of Image Electronics Engineers of Japan
-
Reports of the Technical Conference of the Institute of Image Electronics Engineers of Japan 22.04 (0), 102-105, 2023
The Institute of Image Electronics Engineers of Japan
- Tweet
Details 詳細情報について
-
- CRID
- 1390298986213401088
-
- ISSN
- 27589218
- 02853957
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
-
- Abstract License Flag
- Disallowed