Fujitsu Develops World's First AI technology to Accurately Capture Characteristics of High-Dimensional Data Without Labeled Training Data
Expected to contribute to improved accuracy for a variety of AI technologies
News Facts:
- New technology developed for high-dimensional data, including images, network access, and medical data - Tested against international benchmarks for detecting anomaly data in different fields, achieving state-of-the-art accuracy with up to 37% improvement over error rates of anomaly detection for conventional deep-learning techniques - Fujitsu hopes to apply in future to improve accuracy for a variety of AI technologies
KAWASAKI, Japan, Jul 13, 2020 - (JCN Newswire) - Fujitsu Laboratories, Ltd. has developed the world's first AI technology that accurately captures essential features, including the distribution and probability of high-dimensional data in order to improve the accuracy of AI detection and judgment.
| Fig 1 (Example of error detection): Incorrect decisions due to unquantified empirical methods |
| Fig 2 Improvement of error rate when this technology is applied to abnormality detection |
| Fig. 3 Theoretical framework for acquisition of distribution and probability faithful to data characteristics inspired by information compression technology |
| Fig. 4 Deep learning technology to obtain dimensional reduction transformation distribution/probability |
High-dimensional data, which includes communications networks access data, types of medical data, and images remain difficult to process due to its complexity, making it a challenge to obtain the characteristics of the target data. Until now, this made it necessary to use techniques to reduce the dimensions of the input data using deep learning, at times causing the AI to make incorrect judgments.
Fujitsu has combined deep learning technology with its expertise in image compression technology, cultivated over many years, to develop an AI technology that makes it possible to optimize the processing of high-dimensional data with deep learning technology, and to accurately extract data features. It combines information theory used in image compression with deep learning, optimizing the number of dimensions to be reduced in high-dimensional data and the distribution of the data after the dimension reduction by deep learning.
Akira Nakagawa, (Associate fellow) of Fujitsu Laboratories commented, "This represents an important step to addressing one of the key challenges in the AI field in recent years: capturing the probability and distribution of data. We believe that this technology will contribute to performance improvements for AI, and we're excited about the possibility of applying this knowledge to improve a variety of AI technologies."
Details of this technology will be presented at the International Conference on Machine Learning "ICML 2020 (International Conference on Machine Learning 2020)" on Sunday, July 12.
Development Background
In recent years, there has been a surge in demand for AI-driven big data analysis in various business fields. AI is also expected to help support the detection of anomalies in data to reveal things like unauthorized attempts to access networks or abnormalities in medical data for thyroid values or arrhythmia data.
Challenges
Data used in many business operations is high-dimensional data. As the number of dimensions of data increases, the complexity of calculations required to accurately characterize the data increases exponentially, a phenomenon is widely known as the "Curse of Dimensionality"(1). In recent years, a method of reducing the dimensions of input data using deep learning has been identified as a promising candidate for helping to avoid this problem. However, since the number of dimensions is reduced without considering the data distribution and probability of occurrence after the reduction, the characteristics of the data have not been accurately captured, and the recognition accuracy of the AI is limited and misjudgment can occur (Figure 1). Solving these problems and accurately acquiring the distribution and probability of high-dimensional data remain important issues in the AI field.
About the Newly Developed Technology
Fujitsu has developed the world's first AI technology that accurately captures the characteristics of high-dimensional data without labeled training data.
Fujitsu tested the new technology against benchmarks for detecting data abnormalities in different fields, including communication access data distributed by the International Society for Data Mining "Knowledge Discovery and Data Mining (KDD)", thyroid gland numerical data and arrhythmia data distributed by the University of California, Irvine. The newly developed technology successfully achieved the world's highest accuracy in all data with up to a 37% improvement over conventional deep-learning-based error rates. Since this technology solves one of the fundamental challenges in the field of AI, which is how to accurately capture the characteristics of data, it is expected to prove an important development to unlocking a wide range of new applications.
The technical features of the developed technology are as follows.
1. Proof of theory that accurately captures the characteristics of data
In compression of image and audio data, which are both high-dimensional data consisting of several thousand to several million dimensions, the distribution and occurrence probability of the data have been clarified through many years of research, and methods for reducing the number of dimensions by means of discrete cosine transform(2) and other methods optimized for these known distributions and probabilities have already been established. It has been theoretically proven that the amount of compressed data information can be minimized when the degradation between the original image/sound and the restored image/sound is suppressed to a constant level by restoring data using the distribution of data after dimension reduction and the probability of occurrence. Inspired by image compression theory, Fujitsu has proved a new mathematical theory for the first time in the world that, for high-dimensional data with unknown distribution and probability, such as communication network access data and medical data, the dimensionality of the data is reduced by an auto-encoder(3), which is a neural network, and when the data is restored, the degradation between the original high-dimensional data and the restored data is kept to a constant value while the amount of information after the dimensionality reduction is minimized, enabling the characteristics of the original high-dimensional data to be accurately captured and the dimensionality to be reduced to a minimum.
2. Dimension reduction technology using deep learning
In general, deep learning can determine the combination of parameters that minimize the objective cost even in complex problems by defining the objective cost that need to be minimized. Using this feature, Fujitsu introduced parameters to control both the auto-encoder which reduces the dimension of data and the distribution of data after dimensionality reduction. Our method calculates the amount of information after compression as an objective cost and optimized it through deep learning. This allows the dimensionally reduced distribution and the probability of the data to be accurately characterized when optimized according to the mathematical theory described in 1 above.
Going forward, Fujitsu will promote the practical application of the newly-developed technology, with the aim of putting it into practical use by the end of fiscal 2021, and will apply it to even more AI technologies.
(1) Curse of Dimensionality phenomenon describing exponential increase in computational complexity as the number of dimensions of data increases. (2) Discrete cosine transform a type of Fourier transform that transforms an image or audio signal into the intensity of a frequency component. (3) Autoencoder a neural network-based unsupervised dimensional compression technique.
About Fujitsu
Fujitsu is the leading Japanese information and communication technology (ICT) company offering a full range of technology products, solutions and services. Approximately 130,000 Fujitsu people support customers in more than 100 countries. We use our experience and the power of ICT to shape the future of society with our customers. Fujitsu Limited (TSE:6702) reported consolidated revenues of 3.9 trillion yen (US$35 billion) for the fiscal year ended March 31, 2020. For more information, please see www.fujitsu.com.
About Fujitsu Laboratories
Founded in 1968 as a wholly owned subsidiary of Fujitsu Limited, Fujitsu Laboratories Ltd. is one of the premier research centers in the world. With a global network of laboratories in Japan, China, the United States and Europe, the organization conducts a wide range of basic and applied research in the areas of Next-generation Services, Computer Servers, Networks, Electronic Devices and Advanced Materials. For more information, please see: http://www.fujitsu.com/jp/group/labs/en/.
Source: Fujitsu Ltd Sectors: Cloud & Enterprise, Artificial Intel [AI]
Copyright ©2024 JCN Newswire. All rights reserved. A division of Japan Corporate News Network.
|
Latest Release
Mitsubishi Shipbuilding Receives Order from the University of Tokyo for "MiPoLin" Power Prediction and Lines Selection System Mar 28, 2024 16:45 JST
| Mitsubishi Logisnext Completes Demonstration of Automated Truck Loading, Leading to Start of Actual Operations in Japan Mar 28, 2024 13:52 JST
| Toyota Releases Sales, Production, and Export Results for February 2024 Mar 28, 2024 13:35 JST
| TANAKA to Install 500 kW Fuel Cell System to Promote the Use of Hydrogen Energy at Production Plants Mar 28, 2024 03:00 JST
| PEVE to change name to TOYOTA BATTERY Co., Ltd. and produce batteries for a wide range of electric vehicles Mar 27, 2024 15:13 JST
| NTT and Olympus Begin World's First Joint Demonstration Experiment of Cloud Endoscopy System Mar 27, 2024 15:00 JST
| TANAKA Holdings Announces Green Loan Financing for Construction of New Head Office Building Mar 27, 2024 03:00 JST
| JFE Steel and Hitachi Jointly Started Providing Solutions for the Steel Industry Mar 26, 2024 19:04 JST
| La Banque Postale and JCB join forces to elevate payments experience for travellers in France Mar 26, 2024 15:00 JST
| Fujitsu Tech Leverages AI and Underwater Drone Data for 'Ocean Digital Twin' Mar 26, 2024 10:24 JST
| NEC develops marketing strategy planning & effectiveness simulation technology using generative AI Mar 25, 2024 10:08 JST
| Toyota to Open New Tokyo Head Office in Shinagawa in FY2030 Mar 22, 2024 16:15 JST
| Hitachi Selected as CDP Supplier Engagement Leader for the Third Consecutive Year Mar 22, 2024 16:04 JST
| Mitsubishi Motors Celebrates Production of 100,000th fully electric minivehicle Mar 22, 2024 15:34 JST
| JCB and AEON Credit Service Indonesia Launch the AEON JCB Precious Card Mar 22, 2024 15:00 JST
| JCB Issues White Paper on Calculating CO2 Emissions by Payment Method in Japan Mar 22, 2024 12:00 JST
| NEC and NTT successfully conduct first-of-its-kind long-distance transmission experiment over 7,000km using 12-core optical fiber Mar 22, 2024 08:38 JST
| Lifenet and Eisai Co-Develop Dementia Insurance "be" Mar 21, 2024 17:36 JST
| Toyota: Clarification of the Roles of and Expectations for Outside Executives, Revision of the Independence Assessment Criteria, and the Changes to Members of the Board of Directors and the Audit and Supervisory Board Members following the 120th Ordinary General Shareholders' Meeting Mar 21, 2024 16:47 JST
| Sales of Evolved GR Yaris to Start in April, While Purchasing Lotteries for WRC Driver-supervised Special Editions Start Today Mar 21, 2024 16:31 JST
|
More Latest Release >>
|