3
Mathematicians from RUDN University and the Free University of Berlin proposed a new way of using neural networks for working with noisy high-dimensional data

Mathematicians from RUDN University and the Free University of Berlin proposed a new way of using neural networks for working with noisy high-dimensional data

Mathematicians from RUDN University and the Free University of Berlin have proposed a new approach to studying the probability distributions of observed data using artificial neural networks. The new approach works better with so-called outliers, i.e. input data objects that deviate significantly from the overall sample.

The restoration of the probability distribution of observed data by artificial neural networks is the most important part of machine learning. The probability distribution not only allows us to predict the behaviour of the system under study, but also to quantify the uncertainty with which forecasts are made. The main difficulty is that, as a rule, only the data are observed, but their exact probability distributions are not available. To solve this problem, Bayesian and other similar approximate methods are used. But their use increases the complexity of a neural network and therefore makes its training more complicated.

RUDN University and the Free University of Berlin mathematicians used deterministic weights in neural networks, which would help overcome the limitations of Bayesian methods. They developed a formula that allows one to correctly estimate the variance of the distribution of observed data. The proposed model was tested on different data: synthetic and real; on data containing outliers and on data from which the outliers were removed. The new method allows restoration of probability distributions with accuracy previously unachievable.

The mathematicians of RUDN University and the Free University of Berlin used deterministic weights for neural networks and used the networks outputs to encode the distribution of latent variables for the desired marginal distribution. An analysis of the training dynamics of such networks allowed them to obtain a formula that correctly estimates the variance of observed data, despite the presence of outliers in the data. The proposed model was tested on different data: synthetic and real. The new method allows restoring probability distributions with higher accuracy compared with other modern methods. Accuracy was assessed using the AUC method (area under the curve is the area under the graph that allows making assessment of the mean square error of the predictions depending on the sample size estimated by the network as “reliable”; the higher the AUC score, the better the predictions).

The article was published in the journal Artificial Intelligence.

International Projects View all
International scientific cooperation View all
16 Oct
530 applications, 90 young scientists from 30 countries. Darya Nazarova, a postgraduate student of RUDN Faculty of Economics, traveled 11,276 km from Moscow to Sao Paulo for the International Scientific School on Technological and Innovation Strategies and Economic Development Policy at the University of Campinas (UNICAMP). Darya Nazarova, a young RUDN scientist, writes about scientific research, rafting and the country of eternal carnival.
52
Similar newsletter View all
16 Oct
Green Diplomacy Center opened in RUDN

A Center for Green Diplomacy was created based on the RUDN Institute of Environmental Engineering. Among the goals is the integration of the results of scientific and practical activities into the development of international relations in the environmental sphere. The center's specialists will also accompany the corporate sector in solving various environmental problems.

78
19 Apr
A huge pizza and a jug of water, why should 5G networks be sliced? The winners of RUDN science competition explain

RUDN summarized the results of the scientific competition "Project Start: work of the science club ". Students of the Faculty of Physics, Mathematics and Natural Sciences have created a project for a managed queuing system using a neural network to redistribute resources between 5G segments. How to increase flexibility, make the network fast and inexpensive and reach more users — tell Gebrial Ibram Esam Zekri ("Fundamental Computer Science and Information Technology", Master's degree, II course) and Ksenia Leontieva ("Applied Mathematics and Computer Science", Master's degree, I course).

157
19 Apr
Lyricists and physicists are now on equal terms: the first humanitarian laboratory opened in RUDN

What is your first association with the word “laboratory”? Flasks and beakers? Microscopes and centrifuges? Yes, many of us would answer the same way.

202
Similar newsletter View all