Development of Neural Network for Incomplete Data Set, CQSAR: Compensation Quantitative Structure-Activity Relationships

  • AOYAMA Tomoo
    Faculty of Engineering, University of Miyazaki
  • KAMBE Junko
    Faculty of Media Communication, Edogawa University
  • NAGASHIMA Umpei
    Research Institute of Computational Science, National Institute of Advanced Industrial Science and Technology
  • UMENO Hidenori
    Faculty of Engineering, Kumamoto University

Bibliographic Information

Other Title
  • 欠測データ集合を扱う神経回路網法CQSAR:Compensation Quantitative Structure-Activity Relationshipsの開発
  • ケッソク データ シュウゴウ オ アツカウ シンケイ カイロモウホウ CQSAR Compensation Quantitative Structure Activity Relationships ノ カイハツ

Search this article

Abstract

Multi-layer neural networks are used for the multi regression analysis of many kinds of phenomena whose expressions are unknown. The application fields are medicine designs and environmental problems. We often find incomplete parts in descriptors, which make the precision of the analysis lower. Moreover, the incomplete parts make the linked parts of other descriptors invalid. We often cannot calculate the multi regression analysis, therefore, we wish to eliminate the erroneous effects. In the paper, we discuss some approaches to eliminate such effects, and derive a method based on neural networks, which compensates defect descriptors. We call the method the compensation quantitative structure-activity relationships method (CQSAR).<BR>The first step of CQSAR is to interpolate defective parts of descriptors. There are various interpolation methods. The selection depends on the number of data and characters of the data set. We introduce 3 kinds of methods; these are, variable absolute-function fitting, parametric observed-vector method, and interpolation by using a 3-layer neural network and an arithmetic progression. The first 2 methods are for small data set. The neural network approach is for general purpose, which requires over 7 data. If we can get over 11 data, the approach also gives partial derivative coefficients also. We confirm the effects in numerical calculations.<BR>The second step of CQSAR is to calculate the multi regression analysis based on the non-linear fitting functions of neural networks. We evaluate propagations of error caused by interpolations in the first step, and show that the error does not increase. In the evaluation, we give a new reconstruction learning, and show the effectiveness of the CQSAR method under a defect ratio of 50%.

Journal

Citations (2)*help

See more

References(14)*help

See more

Details 詳細情報について

Report a problem

Back to top