Japanese Sentence Dataset for Lip- reading

Takeshi Saitoh, Tatsuya Shirakata

doi:10.23919/mva51890.2021.9511353

【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
Trial version of CiNii Research Automatic Translation feature is available on CiNii Labs
Suspension and deletion of data provided by Nikkei BP
Regarding the recording of “Research Data” and “Evidence Data”

Japanese Sentence Dataset for Lip- reading

DOI

Takeshi Saitoh
Tatsuya Shirakata

Description

This research is about lip-reading for Japanese sentences. Research on English sentences is actively pursued due to the extensive datasets. However, a sufficient dataset for Japanese sentences has not been released. Therefore, this paper builds a Japanese sentence dataset. A Transformer model is used for the recognition task. Three recognition target levels: phoneme, mora, and vowel, are set, and recognition experiments show that they can be recognized.

Journal

2021 17th International Conference on Machine Vision and Applications (MVA)

2021 17th International Conference on Machine Vision and Applications (MVA) 1-5, 2021-07-25

IEEE

Details 詳細情報について

CRID

1871991017890222208
DOI

10.23919/mva51890.2021.9511353
Data Source
- OpenAIRE

Export

Export to RefWorks
Export to EndNote
Export to Mendeley
Export as RDF
Show Refer/BibIX
Show RIS
Show BibTeX
Show TSV
Show CSV
Show JSON-LD

Japanese Sentence Dataset for Lip- reading

Description

Journal

Details 詳細情報について

Export

Report a problem